Feature selection with annealing for forecasting financial time series
- Pabuccu, H; Barbu, A
- 2024
- 点赞
- 收藏
【Author】 Pabuccu, Hakan; Barbu, Adrian
【Source】FINANCIAL INNOVATION
【影响因子】6.793
【Abstract】Stock market and cryptocurrency forecasting is very important to investors as they aspire to achieve even the slightest improvement to their buy-or-hold strategies so that they may increase profitability. However, obtaining accurate and reliable predictions is challenging, noting that accuracy does not equate to reliability, especially when financial time-series forecasting is applied owing to its complex and chaotic tendencies. To mitigate this complexity, this study provides a comprehensive method for forecasting financial time series based on tactical input-output feature mapping techniques using machine learning (ML) models. During the prediction process, selecting the relevant indicators is vital to obtaining the desired results. In the financial field, limited attention has been paid to this problem with ML solutions. We investigate the use of feature selection with annealing (FSA) for the first time in this field, and we apply the least absolute shrinkage and selection operator (Lasso) method to select the features from more than 1000 candidates obtained from 26 technical classifiers with different periods and lags. Boruta (BOR) feature selection, a wrapper method, is used as a baseline for comparison. Logistic regression (LR), extreme gradient boosting (XGBoost), and long short-term memory are then applied to the selected features for forecasting purposes using 10 different financial datasets containing cryptocurrencies and stocks. The dependent variables consisted of daily logarithmic returns and trends. The mean-squared error for regression, area under the receiver operating characteristic curve, and classification accuracy were used to evaluate model performance, and the statistical significance of the forecasting results was tested using paired t-tests. Experiments indicate that the FSA algorithm increased the performance of ML models, regardless of problem type. The FSA hybrid models showed better performance and outperformed the other BOR models on seven of the 10 datasets for regression and classification. FSA-based models also outperformed Lasso-based models on six of the 10 datasets for regression and four of the 10 datasets for classification. None of the hybrid BOR models outperformed the hybrid FSA models. Lasso-based models, excluding the LR type, were comparable to the best models for six of the 10 datasets for classification. Detailed experimental analysis indicates that the proposed methodology can forecast returns and their movements efficiently and accurately, providing the field with a useful tool for investors.
【Keywords】Financial time-series forecasting; Feature selection; Machine learning; Cryptocurrency; Stock market; Return forecasting
【发表时间】2024 OCT 8
【收录时间】2024-10-13
【文献类型】实证数据
【主题类别】
区块链治理-技术治理-交易预测
Zach
这篇论文主要的研究内容是使用机器学习模型对金融时间序列数据进行预测,并通过特征选择技术来提高预测的准确性和可靠性。研究人员提出了一种综合方法,通过战术输入-输出特征映射技术来预测金融时间序列,并特别关注了特征选择在提高模型性能中的作用。论文中比较了不同的特征选择方法,包括特征选择与退火(FSA)和最小绝对收缩和选择算子(Lasso),以及使用逻辑回归(LR)、极端梯度提升(XGBoost)和长短期记忆网络(LSTM)等机器学习模型进行预测的性能。实验结果表明,FSA算法在提高机器学习模型的性能方面表现出色,并且在大多数情况下优于其他特征选择方法。此外,论文还对10个不同的金融数据集进行了详细的实验分析,包括股票和加密货币数据,并使用均方误差、AUC和分类准确率等指标来评估模型性能。通过配对t检验,研究人员还测试了预测结果的统计显著性。总的来说,该研究为金融时间序列预测提供了一种有效的工具,并为投资者在股票和加密货币市场中的决策提供了支持。
回复