Comparison of feature selection methods using ANNs in MCP-wind speed methods. A case study

Carta JA; Cabrera P; Matias JM; Castellano F

Applied Energy, Vol.158, 490-507, 2015

DOI10.1016/j.apenergy.2015.08.102 Export Citation

Comparison of feature selection methods using ANNs in MCP-wind speed methods. A case study

Carta JA, Cabrera P, Matias JM, Castellano F

Recent studies in the field of renewable energies, and specifically in wind resource prediction, have shown growing interest in proposals for Measure-Correlate-Predict (MCP) methods which simultaneously use data recorded at various reference weather stations. In this context, the use of a high number of reference stations may result in overspecification with its associated negative effects. These include, amongst others, an increase in the estimation error and/or overfitting which could be detrimental to the generalisation capacity of the model when handling new data (prediction). This paper analyses the benefits of feature selection for use with Artificial Neural Network (ANN) techniques with a multilayer perceptron (MLP) structure when the ANNs are used as MCP methods to predict mean hourly wind speeds at a target site. The features considered in this study were the mean hourly wind speeds and directions recorded in 2003 and 2004 at five weather stations in the Canary Archipelago (Spain). The two feature selection techniques considered in the analysis were the Correlation Feature Selection (CFS), which is a correlation-based filter approach (FA), and an MLP-based wrapper approach (WA). The metrics used to compare the results were the mean absolute error (MAE), the mean absolute percentage error (MAPE) and the index of agreement (IoA). Evaluation of the mean errors obtained in the 10-fold cross-validation tests for the year used to represent the short-term wind data period resulted in several conclusions. These included, notably, that the WA gave lower mean errors than the FA in 100% of the cases analysed independently of the metric employed. However, the FA resulted in a significant reduction in computational load and considerable enhancement of model interpretability. When very good correlation coefficients were obtained between the target and reference stations, no significant statistical difference was observed at 5% level between the three models (FA, WA and the models constructed with all the variables) in most of the cases analysed. (C) 2015 Elsevier Ltd. All rights reserved.

Keywords:Measure-correlate-predict method;Artificial neural networks;Wind speed;Wind direction;Feature selection;Cross-validation technique