Metric-based model selection for time-series forecasting
Tóm tắt
Metric-based methods, which use unlabeled data to detect gross differences in behavior away from the training points, have recently been introduced for model selection, often yielding very significant improvements over alternatives (including cross-validation). We introduce extensions that take advantage of the particular case of time-series data in which the task involves prediction with a horizon h. The ideas are: (i) to use at t the h unlabeled examples that precede t for model selection, and (ii) take advantage of the different error distributions of cross-validation and the metric methods. Experimental results establish the effectiveness of these extensions in the context of feature subset selection.
Từ khóa
#Predictive models #Linear regression #Input variables #Testing #Training data #Machine learningTài liệu tham khảo
10.2307/1913610
foster, 1994, The risk inflation criterion for multiple regression;’ Annals of Statistic, 22, 1947
schuurmans, 1997, A new metric-based approach to model selection, Proceedings or the National Conference on Artificial Intelligence (AAAI-97), 552
10.1214/aos/1176350051
vapnik, 1982, Estimation of Dependences Based on Empirical Data
10.1023/A:1013947519741
campbell, 1997, The Econometrics of Financial Markets, 10.1515/9781400830213
bengio, 2002, Special Issue on New methods for model selection and model combination, Machine Learning, 48, 10.1023/A:1013921901994