A Large Comparison of Normalization Methods on Time Series

Big Data Research - Tập 34 - Trang 100407 - 2023
Felipe Tomazelli Lima1, Vinicius M.A. Souza1
1Pontifícia Universidade Católica do Paraná, Rua Imaculada Conceição 1155, 80215-901 Curitiba, PR, Brazil

Tài liệu tham khảo

Bagnall, 2017, The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances, Data Min. Knowl. Discov., 31, 606, 10.1007/s10618-016-0483-9 Abanda, 2019, A review on distance based time series classification, Data Min. Knowl. Discov., 33, 378, 10.1007/s10618-018-0596-4 Parmezan, 2022, Time series prediction via similarity search: exploring invariances, distance measures and ensemble functions, IEEE Access, 10, 78022, 10.1109/ACCESS.2022.3192849 Parmezan, 2019, Evaluation of statistical and machine learning models for time series prediction: identifying the state-of-the-art and the best conditions for the use of each model, Inf. Sci., 484, 302, 10.1016/j.ins.2019.01.076 Liao, 2005, Clustering of time series data—a survey, Pattern Recognit., 38, 1857, 10.1016/j.patcog.2005.01.025 Mueen, 2014, Time series motif discovery: dimensions and applications, Wiley Interdiscip. Rev. Data Min. Knowl. Discov., 4, 152, 10.1002/widm.1119 Blázquez-García, 2021, A review on outlier/anomaly detection in time series data, ACM Comput. Surv., 54, 1, 10.1145/3444690 Freeman, 2021, Experimental comparison and survey of twelve time series anomaly detection algorithms, J. Artif. Intell. Res., 72, 849, 10.1613/jair.1.12698 Batista, 2014, Cid: an efficient complexity-invariant distance for time series, Data Min. Knowl. Discov., 28, 634, 10.1007/s10618-013-0312-3 Slijepcevic, 2021, Explaining machine learning models for clinical gait analysis, ACM Trans. Comput. Healthc., 3, 1, 10.1145/3474121 Yamada, 1991, Phonetic typewriter based on phoneme source modeling, 169 Alpaydin, 1996, Comparison of statistical and neural classifiers and their applications to optical character recognition and speech classification, 61 Ding, 2008, Querying and mining of time series data: experimental comparison of representations and distance measures, 1542 Mueen, 2011, Logical-shapelets: an expressive primitive for time series classification, 1154 Rakthanmanon, 2013, Fast shapelets: a scalable algorithm for discovering time series shapelets, 668 Euachongprasit, 2008, Accurate and efficient retrieval of multimedia time series data under uniform scaling and time warping, 100 Rakthanmanon, 2012, Searching and mining trillions of time series subsequences under dynamic time warping, 262 Li, 2012, Visualizing variable-length time series motifs, 895 Rakthanmanon, 2013, Addressing big data time series: mining trillions of time series subsequences under dynamic time warping, ACM Trans. Knowl. Discov. Data, 7, 1, 10.1145/2500489 Paparrizos, 2015, k-shape: efficient and accurate clustering of time series, 1855 Parmezan, 2015, A study of the use of complexity measures in the similarity search process adopted by knn algorithm for time series prediction, 45 Yeh, 2016, Matrix profile i: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets, 1317 Schäfer, 2016, Scalable time series classification, Data Min. Knowl. Discov., 30, 1273, 10.1007/s10618-015-0441-y Wang, 2017, Time series classification from scratch with deep neural networks: a strong baseline, 1578 Gogolou, 2018, Comparing similarity perception in time series visualizations, IEEE Trans. Vis. Comput. Graph., 25, 523, 10.1109/TVCG.2018.2865077 Valovage, 2018, Enhancing machine learning classification for electrical time series applications, 8042 Karim, 2019, Insights into lstm fully convolutional networks for time series classification, IEEE Access, 7, 67718, 10.1109/ACCESS.2019.2916828 Chen, 2022, Learning-based shapelets discovery by feature selection for time series classification, Appl. Intell., 1 Liu, 2022, 1d convolutional neural networks for chart pattern classification in financial time series, J. Supercomput., 1, 10.1007/s11227-021-03859-5 Xiao-Xia, 2022, Time series based data explorer and stream analysis for anomaly prediction, Wirel. Commun. Mob. Comput. Zanella, 2022, Ts-dense: time series data augmentation by subclass clustering, 1 Bountrogiannis, 2022, Distribution agnostic symbolic representations for time series dimensionality reduction and online anomaly detection, IEEE Trans. Knowl. Data Eng., 35, 5752 Keogh, 2002, On the need for time series data mining benchmarks: a survey and empirical demonstration, 102 Panigrahi, 2013, Effect of normalization techniques on univariate time series forecasting using evolutionary higher order neural network, Int. J. Eng. Adv. Technol., 3, 280 Bhanja, 2019, Impact of data normalization on deep neural network for time series forecasting, 27 Al-Ghamdi, 2021, Evaluation of artificial neural networks performance using various normalization methods for water demand forecasting, 1 Höppner, 2014, Less is more: similarity of time series under linear transformations, 560 Lines, 2015, Time series classification with ensembles of elastic distance measures, Data Min. Knowl. Discov., 29, 565, 10.1007/s10618-014-0361-2 Ismail Fawaz, 2019, Deep learning for time series classification: a review, Data Min. Knowl. Discov., 33, 917, 10.1007/s10618-019-00619-1 Jiang, 2020, Time series classification: nearest neighbor versus deep learning models, SN Appl. Sci., 2, 1, 10.1007/s42452-020-2506-9 Berndt, 1994, Using dynamic time warping to find patterns in time series, 359 Vlachos, 2003, Indexing multi-dimensional time-series with support for multiple distance measures, 216 Chen, 2005, Robust and fast similarity search for moving object trajectories, 491 Marteau, 2009, Time warp edit distance with stiffness adjustment for time series matching, IEEE Trans. Pattern Anal. Mach. Intell., 31, 306, 10.1109/TPAMI.2008.76 Giusti, 2013, An empirical comparison of dissimilarity measures for time series classification, 82 He, 2016, Deep residual learning for image recognition, 770 Zhong, 2022, Combining filtering and cross-correlation efficiently for streaming time series, ACM Trans. Knowl. Discov. Data, 16, 1, 10.1145/3502738 Ioffe, 2015, Batch normalization: accelerating deep network training by reducing internal covariate shift, 448 Kvalheim, 1994, Preprocessing of analytical profiles in the presence of homoscedastic or heteroscedastic noise, Anal. Chem., 66, 43, 10.1021/ac00073a010 Singh, 2020, Investigating the impact of data normalization on classification performance, Appl. Soft Comput., 97, 10.1016/j.asoc.2019.105524 Latha, 2011, Efficient approach to normalization of multimodal biometric scores, Int. J. Comput. Appl., 32, 57 Han, 2022 Bolstad, 2003, A comparison of normalization methods for high density oligonucleotide array data based on variance and bias, Bioinformatics, 19, 185, 10.1093/bioinformatics/19.2.185 Yeo, 2000, A new family of power transformations to improve normality or symmetry, Biometrika, 87, 954, 10.1093/biomet/87.4.954 Lima Dau Dau, 2019, The ucr time series archive, IEEE/CAA J. Autom. Sin., 6, 1293, 10.1109/JAS.2019.1911747 Guyon, 2007, Competitive baseline methods set new standards for the nips 2003 feature selection benchmark, Pattern Recognit. Lett., 28, 1438, 10.1016/j.patrec.2007.02.014 Souza, 2013, Classification of data streams applied to insect recognition: initial results, 76 Silva, 2015, Exploring low cost laser sensors to identify flying insect species, J. Intell. Robot. Syst., 80, 313, 10.1007/s10846-014-0168-9 Jiménez, 2019, Prediction of mosquito species and population age structure using mid-infrared spectroscopy and supervised machine learning, Wellcome Open Res., 4 Rossi, 2015, The network data repository with interactive graph analytics and visualization Souza, 2021, Efficient unsupervised drift detector for fast and high-dimensional data streams, Knowl. Inf. Syst., 63, 1497, 10.1007/s10115-021-01564-6 Wolpert, 1996, The lack of a priori distinctions between learning algorithms, Neural Comput., 8, 1341, 10.1162/neco.1996.8.7.1341 Lubba, 2019, catch22: canonical time-series characteristics: selected through highly comparative time-series analysis, Data Min. Knowl. Discov., 33, 1821, 10.1007/s10618-019-00647-x Large, 2019, On time series classification with dictionary-based classifiers, Intell. Data Anal., 23, 1073, 10.3233/IDA-184333 Baydogan, 2013, A bag-of-features framework to classify time series, IEEE Trans. Pattern Anal. Mach. Intell., 35, 2796, 10.1109/TPAMI.2013.72