Index based processing of semi-restrictive temporal joins

Donghui Zhang1, V.J. Tsotras1
1Computer Science Department, University of California, Riverside, CA, USA

Tóm tắt

Temporal joins are important but very costly operations. While a temporal join can involve the whole time (and/or key) domain, we consider the more general case where the join is defined by some time-key rectangle from the whole space (i.e., when the user is interested in joining portions of the-usually large-temporal data). In the most restrictive join, objects (within this rectangle) are joined together based on key equality and interval intersection. This paper concentrates on semi-restrictive joins, i.e., when either the key equality (equi-join) or the interval intersection (time-join) predicates are used. Given the large relations created by the ever increasing time dimension, we assume that each temporal relation is indexed and examine efficient ways to process semi-restrictive temporal joins. Utilizing an index is helpful since it directs the join towards the objects that are within the time-key rectangle. A straightforward approach is to perform an unsynchronized join. An index selection query on each relation identifies all objects within the time-key rectangle which are then joined. Although simple, this approach ignores the data distribution in the other relation. Instead, in a synchronized join, both indices are concurrently traversed as the join is computed. Synchronized semi-restrictive join algorithms can be performed utilizing traditional indices like B+-trees or R-trees. The drawback of this approach is that traditional indices do not achieve good temporal data clustering. Better clustering is achieved by temporal indices through record copying. Nevertheless, record copies can greatly affect the correctness and effectiveness of join performance.

Từ khóa

#Clustering algorithms #Computer science #Concurrent computing #Indexing #Robustness #Finishing

Tài liệu tham khảo

huang, 1997, Spatial Joins Using R-trees: Breadth-First Traversal with Global Optimizations, Proc of VLDB 10.1109/69.755613 10.1145/115790.115807 kline, 1998, Time-IT, the Time-Integrated Testbed 10.1109/69.667079 leung, 1993, Stream Processing: Temporal Query Processing and Optimization, Temporal Databases Theory Design and Implementation lu, 1994, On Spatially Partitioned Temporal Join, Proc of VLDB 10.1145/67544.66956 10.1142/9789814503730_0049 ramaswamy, 1996, I/O-Efficient Join Algorithms for Temporal, Spatial, and Constraint Databases, Tech Report 10.1145/170035.170075 zhang, 2002, Efficient Temporal Join Processing using Indices, Proc of ICDE becker, 1996, An Asymptotically Optimal Multiversion B-Tree, VLDB Journal, 5, 10.1007/s007780050028 van den bercken, 1996, Query Processing Techniques for Multiversion Access Methods, Proc of VLDB 10.1145/93597.98741 10.1109/ICDE.1993.344078 elmasri, 1990, The Time Index: An Access Structure for Temporal Data, Proc of VLDB arge, 2000, A Unified Approach For Indexed and Non-Indexed Spatial Joins, Proc of EDBT 10.1109/ICDE.1991.131481 arge, 1998, Scalable Sweeping-Based Spatial Join, Proc of VLDB son, 1996, Efficient Temporal Join Processing using Time Index, Proc of SSDBM segev, 1988, The Representation of a Temporal Data Model in the Relational Environment, Proc of SSDBM 10.1109/ICDE.1994.283041 10.1145/319806.319816 10.1109/ICDE.1994.283042 zurek, 1997, Optimization of Partitioned Temporal Joins 10.1109/69.599929