A fast LSH-based similarity search method for multivariate time series

Information Sciences - Tập 476 - Trang 337-356 - 2019
Chenyun Yu1, Lintong Luo2, Leanne Lai-Hang Chan3, Thanawin Rakthanmanon4,5, Sarana Nutanong5
1Department of Computer Science, City University of Hong Kong, Hong Kong, China
2Department of Computer Science, University of California, Irvine, USA
3Department of Electrical Engineering, City University of Hong Kong, Hong Kong, China
4Department of Computer Engineering, Kasetsart University, Thailand
5School of Information Science and Technology, VISTEC, Thailand

Tài liệu tham khảo

Bahadori, 2015, Functional subspace clustering with application to time series, 228 Bankó, 2012, Correlation based dynamic time warping of multivariate time series, Expert Syst. Appl., 39, 12814, 10.1016/j.eswa.2012.05.012 Chen, 2013, DTW-D: time series semi-supervised learning from a single example, 383 Datar, 2004, Locality-sensitive hashing scheme based on p-stable distributions, 253 Deng, 2016, Piecewise two-dimensional normal cloud representation for time-series data mining, Inf. Sci. (Ny), 374, 32, 10.1016/j.ins.2016.09.027 Ding, 2008, Querying and mining of time series data: experimental comparison of representations and distance measures, Proc. VLDB Endowment, 1, 1542, 10.14778/1454159.1454226 Ding, 2017, Query suggestion to allow intuitive interactive search in multidimensional time series Drosou, 2012, Disc diversity: result diversification based on dissimilarity and coverage, Proc. VLDB Endowment, 6, 13, 10.14778/2428536.2428538 Fu, 2008, Scaling and time warping in time series querying, The VLDB Journal-The International Journal on Very Large Data Bases, 17, 899, 10.1007/s00778-006-0040-z Gan, 2012, Locality-sensitive hashing scheme based on dynamic collision counting, 541 Graves, 2009, A novel connectionist system for unconstrained handwriting recognition, IEEE Trans. Pattern Anal. Mach. Intell., 31, 855, 10.1109/TPAMI.2008.137 Guttman, 1984, R-trees: A dynamic index structure for spatial searching, 47 Hjaltason, 1999, Distance browsing in spatial databases, ACM Trans. Database Syst. (TODS), 24, 265, 10.1145/320248.320255 Hoeffding, 1963, Probability inequalities for sums of bounded random variables, J. Am. Stat. Assoc., 58, 13, 10.1080/01621459.1963.10500830 Holmes, 2002, A probabilistic nearest neighbour method for statistical pattern recognition, J. R. Stat. Soc., 64, 295, 10.1111/1467-9868.00338 Huang, 2015, Query-aware locality-sensitive hashing for approximate nearest neighbor search, Proc. VLDB Endowment, 9, 1, 10.14778/2850469.2850470 Indyk, 1998, Approximate nearest neighbors: Towards removing the curse of dimensionality, 604 Itakura, 1975, Minimum prediction residual principle applied to speech recognition, IEEE Trans. Acoust. Speech Signal Process., 23, 67, 10.1109/TASSP.1975.1162641 Kale, 2014, An examination of multivariate time series hashing with applications to health care, 260 Keogh, 2005, Exact indexing of dynamic time warping, Knowl. Inf. Syst., 7, 358, 10.1007/s10115-004-0154-9 Kim, 2007, Performance bottleneck of subsequence matching in time-series databases: observation, solution, and performance evaluation, Inf. Sci. (Ny), 177, 4841, 10.1016/j.ins.2007.06.032 Kim, 2001, An index-based approach for similarity search supporting time warping in large sequence databases, 607 Krawczak, 2014, An approach to dimensionality reduction in time series, Inf. Sci. (Ny), 260, 15, 10.1016/j.ins.2013.10.037 Längkvist, 2014, A review of unsupervised feature learning and deep learning for time-series modeling, Pattern Recognit. Lett., 42, 11, 10.1016/j.patrec.2014.01.008 LeCun, 2015, Deep learning, Nature, 521, 436, 10.1038/nature14539 Liao, 2005, Clustering of time series data - a survey, Pattern Recognit., 38, 1857, 10.1016/j.patcog.2005.01.025 Lim, 2007, Using multiple indexes for efficient subsequence matching in time-series databases, Inf. Sci. (Ny), 177, 5691, 10.1016/j.ins.2007.07.004 Lin, 2007, Experiencing SAX: a novel symbolic representation of time series, Data Min. Knowl. Discov., 15, 107, 10.1007/s10618-007-0064-z Liu, 2016, Complex activity recognition using time series pattern dictionary learned from ubiquitous sensors, Inf. Sci. (Ny), 340–341, 41, 10.1016/j.ins.2016.01.020 Luo, 2017, SSH (Sketch, shingle, & hash) for indexing massive-scale time series, 38 Lv, 2007, Multi-probe LSH: efficient indexing for high-dimensional similarity search, 950 Mikolov, 2010, Recurrent neural network based language model, 2, 3 Peng, 2012, Attribute-based subsequence matching and mining, 989 Petropoulos, 2017, A hidden markov model with dependence jumps for predictive modeling of multidimensional time-series, Infornation Science, 412, 50, 10.1016/j.ins.2017.05.038 Pravilovic, 2017, Using multiple time series analysis for geosensor data forecasting, Inf. Sci. (Ny), 380, 31, 10.1016/j.ins.2016.11.001 Rakthanmanon, 2012, Searching and mining trillions of time series subsequences under dynamic time warping, 262 Sakoe, 1978, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, 26, 43, 10.1109/TASSP.1978.1163055 Schäfer, 2016, Scalable time series classification, Data Min. Knowl. Discov., 30, 1273, 10.1007/s10618-015-0441-y Shieh, 2008, iSAX: indexing and mining terabyte sized time series, 623 Shokoohi-Yekta, 2015, On the non-trivial generalization of dynamic time warping to the multi-dimensional case, 289 Tao, 2010, Efficient and accurate nearest neighbor and closest pair search in high-dimensional space, ACM Trans. Database Syst. (TODS), 35, 20, 10.1145/1806907.1806912 Ulanova, 2015, Scalable clustering of time series with U-Shapelets, 900 Vlachos, 2003, Indexing multi-dimensional time-series with support for multiple distance measures, 216 Yi, 1998, Efficient retrieval of similar time sequences under time warping, 201 Yoon, 2005, Feature subset selection and feature ranking for multivariate time series, IEEE Transactions on Knowledge and Data Engineering, 17, 1186, 10.1109/TKDE.2005.144 Yu, 2017, A generic method for accelerating lsh-based similarity join processing, IEEE Trans. Knowl. Data Eng., 29, 712, 10.1109/TKDE.2016.2638838 Zhang, 2017, Dynamic time warping under limited warping path length, Inf. Sci. (Ny), 393, 91, 10.1016/j.ins.2017.02.018 Zhu, 2012, A novel approximation to dynamic time warping allows anytime clustering of massive time series datasets, 999