Metric-based model selection for time-series forecasting

Y. Bengio1, N. Chapados2
1Department IRO, Université de Montréal, Montreal, QUE, Canada
2Department IRO, Université de Montreal, Montreal, QUE, Canada

Tóm tắt

Metric-based methods, which use unlabeled data to detect gross differences in behavior away from the training points, have recently been introduced for model selection, often yielding very significant improvements over alternatives (including cross-validation). We introduce extensions that take advantage of the particular case of time-series data in which the task involves prediction with a horizon h. The ideas are: (i) to use at t the h unlabeled examples that precede t for model selection, and (ii) take advantage of the different error distributions of cross-validation and the metric methods. Experimental results establish the effectiveness of these extensions in the context of feature subset selection.

Từ khóa

#Predictive models #Linear regression #Input variables #Testing #Training data #Machine learning

Tài liệu tham khảo

10.2307/1913610 foster, 1994, The risk inflation criterion for multiple regression;’ Annals of Statistic, 22, 1947 schuurmans, 1997, A new metric-based approach to model selection, Proceedings or the National Conference on Artificial Intelligence (AAAI-97), 552 10.1214/aos/1176350051 vapnik, 1982, Estimation of Dependences Based on Empirical Data 10.1023/A:1013947519741 campbell, 1997, The Econometrics of Financial Markets, 10.1515/9781400830213 bengio, 2002, Special Issue on New methods for model selection and model combination, Machine Learning, 48, 10.1023/A:1013921901994