Cryptocurrency Price Forecasting
Cryptocurrency Price Forecasting
A R T I C L E I N F O A B S T R A C T
Keywords: Cryptocurrency price forecasting is attracting considerable interest due to its crucial decision support role in
Cryptocurrency investment strategies. Large fluctuations in non-stationary cryptocurrency prices motivate the urgent need for
Bitcoin accurate forecasting models. The lack of seasonal effects and the need to meet a number of unrealistic re
Forecasting
quirements make it difficult to make accurate forecasts using traditional statistical methods, leaving machine
Ensemble learning
Deep learning
learning, particularly ensemble and deep learning, as the best technology in the area of cryptocurrency price
Neural networks forecasting. This is the first work to provide a comprehensive comparative analysis of ensemble learning and
deep learning forecasting models, examining their relative performance on various cryptocurrencies (Bitcoin,
Ethereum, Ripple, and Litecoin) and exploring their potential trading applications. The results of this study
reveal that gated recurrent unit, simple recurrent neural network, and LightGBM methods outperform other
machine learning methods, as well as the naive buy-and-hold and random walk strategies. This can effectively
guide investors in the cryptocurrency markets.
* Corresponding author.
E-mail address: m.z.abedin@swansea.ac.uk (M.Z. Abedin).
https://doi.org/10.1016/j.irfa.2023.103055
Received 17 December 2022; Received in revised form 28 November 2023; Accepted 18 December 2023
Available online 27 December 2023
1057-5219/© 2023 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Early work in this area focused primarily on traditional statistical performance of the hybrid two-step forecasting method (Efat et al.,
methods, such as ARIMA (autoregressive integrated moving average) 2022) that combines ARIMA with deep learning methods, thus capturing
(Ibrahim et al., 2021) and GARCH (Baur et al., 2018; Fakhfekh and linear and nonlinear patterns in the cryptocurrency time-series data.
Jeribi, 2020). However, these approaches only capture linear patterns in More importantly, the comparative analysis includes the financial per
the time series of cryptocurrencies and furthermore assume a normal formance of trading strategies based on the machine learning methods
distribution of variables, which is unrealistic in the case of crypto utilized. Thus, this work provides valuable insights into the performance
currencies (Chen et al., 2021; Khedr et al., 2021). of different machine learning models for predicting cryptocurrency
Machine learning approaches can extract nonlinear patterns and also prices, and their potential applications in trading strategies. The results
benefit from large datasets without assuming any prior understanding of suggest that these models can enable investors to make more informed
the data. However, even traditional machine learning methods, such as decisions in the cryptocurrency markets, ultimately leading to better
multilayer perceptron (MLP) neural networks (Kristjanpoller and Min investment outcomes.
utolo, 2018) or support vector machines (SVM) (Hajek et al., 2023; The remainder of the article is organized as follows. Section 2 pro
Moula et al., 2017), suffer from some problems such as susceptibility to vides an overview of previous literature on cryptocurrency price fore
overfitting and do not fully exploit the potential of extracting high-level casting. Section 3 presents the research methodology employed and
hidden patterns from cryptocurrency sequential data. To overcome Section 4 shows the results. This is followed by Section 5, which dis
these problems, deep learning-based forecasting models have been used, cusses the results. Section 6 concludes the study with some future
having the capacity to outperform traditional machine learning methods research directions.
(Chen et al., 2021; Cui et al., 2022; Liu et al., 2021; Ortu et al., 2022).
The recent work of Murray et al. (2023) substantiates this finding, 2. Literature review
demonstrating that long short-term memory (LSTM) and gated recurrent
unit (GRU) neural networks outperform various other statistical and In theory, the value of cryptocurrencies is a reflection of their utility
machine learning methods in terms of forecasting error. This includes as a medium of exchange, which considerably increased over the last ten
not only traditional models such as ARIMA and SVM but also the more years. Given the increasing importance of cryptocurrencies for financial
contemporary temporal fusion transformer (TFT). Another stream of systems, early work in this area focused primarily on cryptocurrency
research has focused on the capacity of ensemble learning approaches to volatilities, which have proved to be large (Klein et al., 2018) and
reduce variance and bias by combining a set of diverse weak learning difficult to predict so far (Fang et al., 2020; Walther et al., 2019).
models (Aggarwal et al., 2020). In their widely acclaimed work, Sun Moreover, empirical evidence also suggests that (Zhang et al., 2018): (1)
et al. (2020) showed that ensemble learning forecasting models cryptocurrency returns have heavily tailed distributions, (2) autocor
outperform individual machine learning models and that gradient relations for relative and absolute returns decay at different rates; (3)
boosting demonstrates better accuracy and robustness compared with cryptocurrencies exhibit a strong leverage effect and volatility clus
the well-known random forest approach. tering; (4) volatility and returns show the long-range dependence; and
Not only does the existing literature fail to provide a comprehensive (5) volatility and price are power-law correlated. These characteristics
comparison of the latest machine learning methods, but previous studies make cryptocurrency price forecasting challenging and investments in
also suggest that different methods may perform differently for different cryptocurrencies much riskier than investments in traditional financial
cryptocurrencies (Yang et al., 2020; Zhang et al., 2021). Moreover, no assets. Fluctuations in the value of cryptocurrency assets have been
comparative study has been found that examines the financial perfor difficult to predict because they are not related to any fundamentals,
mance of cryptocurrency investors from the perspective of different which leads to the hypothesis that the value is mainly influenced by the
trading strategies over different time periods. To bridge this gap, this sentiment of the cryptocurrency market. As shown in the literature, the
work aims to assess the performance of state-of-the-art deep learning price of Bitcoins and many other cryptocurrencies has displayed cycli
and ensemble learning approaches in forecasting the prices of four major cality patterns (also referred to as bubbles) in recent years (Dong et al.,
cryptocurrencies, namely Bitcoin, Ethereum, Ripple, and Litecoin. The 2022; Kyriazis et al., 2020).
selection of these four cryptocurrencies is not only consistent with those Most of the research on cryptocurrency price forecasting has focused
in previous related studies (Altan et al., 2019; Cheng, 2023) but it also on conventional statistical methods. Catania et al. (2019) used a battery
covers a wide range of technologies, applications and market positions, of univariate and multivariate vector autoregression (VAR) models for
making them ideal subjects for comprehensive analysis and forecasting. predicting four major cryptocurrencies: Bitcoin, Ripple, Litecoin, and
In particular, predicting the price of Ethereum can provide insights into Ethereum. Notably, significant improvements in forecasting accuracy
a wider range of blockchain applications, predicting the price of Ripple were reported for the combinations of various univariate forecasting
can provide valuable insights into the integration of cryptocurrency models. Conrad et al. (2018) analyzed the volatility of cryptocurrencies
technologies into traditional banking systems, and predicting the price through the lens of GARCH-MIDAS model to extract the long and short-
of Litecoin alongside Bitcoin can reveal how changes in blockchain term volatility components, finding that S&P 500 volatility significantly
technology affect cryptocurrency market performance. This diversity affected long-term Bitcoin volatility. Likewise, Walther et al. (2019)
ensures that the study's findings will be broadly relevant and provide applied the GARCH-MIDAS framework to forecast the volatilities of five
valuable insights into the dynamics of the cryptocurrency sector. highly capitalized cryptocurrencies as well as the CRIX cryptocurrency
Furthermore, in contrast to earlier research that has tended to evaluate index, investigating the effect of Global Real Economic Activity as a
the forecasting performance in terms of forecasting errors (Chen et al., major driver of long-term cryptocurrency volatility. Results reported by
2021; Murray et al., 2023), here we focus on investor performance by Walther et al. (2019) also suggest that the traditional GARCH model
simulating the buy & sell, long and short trading strategies. To this end, performs poorly in predicting cryptocurrency volatility during bear
two distinct sub-periods are considered in this study, before and after markets, being surpassed even by models based on individual exogenous
Covid-19. Indeed, the Covid-19 pandemic has had a significant impact variables.
on the cryptocurrency market, including changes in market efficiency, Over the last five years, the focus of cryptocurrency price forecasting
peak performance of some cryptocurrencies, and increases in market has shifted to machine learning methods. The work of Kristjanpoller and
capitalization (El Montasser et al., 2022; Jalan et al., 2021). To this line Minutolo (2018) has made a significant contribution to the field by
of research, this study adds an analysis of the predictability of crypto proposing a hybrid MLP neural network-GARCH model to forecast the
currencies in the pre- and post-pandemic period. Unlike previous price volatility of Bitcoin. The results of a thorough analysis of different
comparative studies that have focused only on the combinations of deep GARCH models revealed the benefits of combining linear and nonlinear
learning models (Murray et al., 2023), this paper also examines the models for predicting Bitcoin price volatility. MLP neural network was
2
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
also employed by Nakano et al. (2018) for predicting Bitcoin returns the relevance of ensemble learning and deep learning for automatic
based on a set of technical indicators. Experimental evidence showed cryptocurrency trading.
that the MLP forecasting model outperforms the baseline buy-and-hold
strategy. MLP also performed well when comparing its movement di 3. Research methodology
rection performance against ARIMA, Prophet, and random forest
(Ibrahim et al., 2021). More recently, recurrent neural networks have In this section, the methods used for the construction of forecasting
been utilized, such LSTM and GRU, to automatically extract high-level models are introduced, together with their specifications. The machine
temporal patterns from cryptocurrency time series. These advanced learning methods employed in this study include boosting-based
neural networks with deep learning were specifically developed to ensemble methods, recurrent deep neural networks, and hybrid two-
handle complex sequential data and, therefore, it was not surprising that stage methods integrating ARIMA with recurrent deep neural networks.
MLP and other conventional machine learning methods were out
performed by LSTM in several studies (Chen et al., 2021; Lahmiri and 3.1. Boosting-based ensemble methods
Bekiros, 2019; Li and Dai, 2020).
GRU also produced excellent forecasting performance for four major Given that bagging-based ensemble methods, including random
cryptocurrency prices (Zhang et al., 2021), outperforming not only forest, have not performed well in earlier research (Ibrahim et al., 2021;
traditional machine learning methods but also LSTM-based models. Sun et al., 2020), we decided to examine the performance of boosting-
However, these deep learning-based models have been shown to work based ensemble methods in the current study. The ultimate aim of
effectively, especially in univariate settings (Uras et al., 2020) because boosting is to enhance the accuracy of a sequence of weak prediction
they are not equipped with a feature selection component and therefore models, where each model in the sequence compensates for the errors of
can easily become too complex to learn more challenging temporal its predecessors. As a result, a strong model is produced representing a
patterns (Fu et al., 2022). highly accurate combination of weak models. This approach not only
Ensemble learning methods represent a viable alternative to deep proved to be effective compared with individual and other ensemble
learning models due to their capacity to reduce the bias (boosting learning methods, but also outperformed deep learning models in recent
methods) or variance (bagging methods such as random forest) of in investigations (Manchanda and Aggarwal, 2021). Noteworthy, Ada
dividual machine learning methods (Derbentsev et al., 2020). The model Boost, a traditional boosting approach, exceeded the forecasting per
based on LightGBM (light gradient boosting machine) demonstrated the formance of LSTM and other machine learning methods, including MLP
capacity to outperform the random forest model in forecasting the price and ELM (extreme learning machines) (Manchanda and Aggarwal,
direction of the cryptocurrency market (Sun et al., 2020), thus sug 2021).
gesting that bias reduction is more relevant in the case of cryptocurrency The idea of AdaBoost is that the weights of the data instances that are
prices than variance reduction. Overall, the above studies indicate that accurately predicted by the preceding weak regressor are decreased
the machine learning-based forecasting models outperform those using while the weights of the instances where forecasts deviated from the
conventional statistical methods. This is attributed to the capacity of actual cryptocurrency prices are increased. Thus, successive forecasting
machine learning models to construct generic models easily capturing models increasingly focus on poorly forecasted data instances, and the
nonlinear complex patterns in cryptocurrency data. Recently, there have performance of the overall model is iteratively improved. In other
been two attempts to systematically review the performance of machine words, AdaBoost generates an additive model while the value of loss
learning methods for cryptocurrency price forecasting (Khedr et al., function (bias) is reduced in each iteration.
2021; Ren et al., 2022). Khedr et al. (2021) concluded that LSTM is LightGBM is an enhanced version of AdaBoost, allowing for the
considered to be the best method for predicting cryptocurrency price computationally efficient minimization of an arbitrary differentiable
time series due to its ability to recognize long-term time-series associ loss function. Similarly, as AdaBoost, regression trees are employed as
ations. Ren et al. (2022) also valued the predictive performance of LSTM weak learners in LightGBM. In contrast, the fast and highly efficient
while highlighting that combining different machine learning methods training capacity of LightGBM allows for dealing with large datasets.
has now become a hot research area. While these survey studies focus on This is enabled by exploiting the exclusive feature bundling (into a
providing an overview of existing machine learning methods used for single feature and thus reducing data dimensionality) and gradient-
cryptocurrency price forecasting, this study seeks to conduct a based one-side sampling (by randomly dropping instances with small
comparative empirical analysis of state-of-the-art deep learning and gradients). At the same time, the advantages of the well-known XGBoost
ensemble learning methods to provide support for profitable algorithmic (extreme gradient boosting) are retained, including parallel training,
trading. sparse optimization, multiple loss functions, early stopping, and regu
Algorithmic trading has been actively developing in recent decades larization. The main difference is that LightGBM grows regression trees
due to a combination of factors: the rapid development of machine leafwise, and not level-wise like traditional boosting methods. The
learning methods, the development of technologies for working with objective function of LightGBM is defined as follows:
data and its analysis, the growth of storage and processing capabilities ((∑ )2 (∑ )2 (∑ )2 )
for large amounts of data. In addition, the complexity of trading system gi gi gi
G = 1 2 ∑ i∈IL
/
+ ∑ i∈IR − ∑ i∈I (1)
algorithms used by market participants is growing, since they compete i∈IL hi + λ i∈IR hi + λ i∈I hi + λ
not only with those who do not use automated systems, but also with
each other. In connection with these trends, the study of the applica where IR and IL are the sets of instances of the right and left branches,
bility of various machine learning algorithms to algorithmic trading respectively; gi and hi represent the loss gradient statistics of the first and
problems is an urgent task. This is important not only for companies second order, respectively; and λ is a regularization parameter.
engaged in algorithmic trading, such as hedge funds, but also from the
scientific community because the application of state-of-the-art machine 3.2. Recurrent deep neural networks
learning algorithms to the area under consideration can bring new
knowledge to the development of automated trading systems for cryp Recurrent neural networks (RNNs) are types of neural networks in
tocurrency markets. This paper is devoted to the application of ensemble which links between units generate a controlled sequence, which allows
learning and deep learning to forecast cryptocurrency prices. In cryp for processing sequential data. In contrast to MLP, RNN can process
tocurrency market trading, both the base predictors in ensembles and arbitrary length sequences with its internal memory. Various RNN ar
neural networks with deep learning mimic the actions of trading agents chitectures, ranging from simple to complex, have been introduced. For
on the cryptocurrency market. This study was carried out to investigate cryptocurrency price forecasting, the LSTM and GRU neural networks
3
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
are the most widely used. RNNs, equipped with a self-feedback mech Table 1
anism, have the capacity to handle long-term dependencies in crypto Values and ranges of model hyper-parameters.
currency time-series data. The vanishing gradient represents a major Models Hyper-parameters
limitation of RNNs. To overcome this problem, LSTM neural networks
MLP Hidden layers: [1,2]; the number of hidden units: [10, 20, 30, 40, 50,
were introduced (Yu et al., 2019). Each unit of LSTM is composed of 100]; activation function: [‘relu’, ‘tanh’]; optimizer solver: [‘sgd’,
memory cells that store information updated through the input, forget, ‘adam’]; regularization alpha: [0.0001], learning rate for sgd optimizer:
and output gate. At day t, xt represents the input cryptocurrency price [‘constant’,’invscaling’, ‘adaptive’].
data of the LSTM cell whose output at the previous day is denoted as LSTM Hidden layers: [1, 2]; the number of epochs: [3, 5, 10, 50, 100, 300]; the
number of hidden units [4, 8, 16, 32, 64, 128, 256]; learning rate
ℎt− 1, ct stands for the memory cell value. The calculation process of the [0.001, 0.01, 0.1]; lag days [3, 5, 8, 10]
LSTM unit is conducted as follows: AdaBoost The number of estimators: [10, 20, 30, 40, …, 100]; learning rate:
[0.001, 0.01, 0.1, 1.0]; loss functions: [linear, square, exponential]
it = (Wxi xt + Wℎi ℎt− 1 + Wci ct− 1 + bi ) (2) LightGBM The number of estimators: [10, 20, 30, 40, …, 100]; learning rate:
( ) [0.001, 0.01, 0.1, 1.0].
ft = Wxfxt + Wℎf ℎt− 1 + Wc fct− 1 + bf (3) RNN The number of units: [4, 16, 32, 64, 128].
GRU Hidden layers: [1, 2]; the number of epochs: [3, 5, 10, 50, 100, 300]; the
number of hidden units [4, 8, 16, 32, 64, 128, 256]; learning rate
ct = ft ct− 1 + it tanℎ (Wxc xt + Wℎc ℎt− 1 + bc ) (4)
[0.001, 0.01, 0.1].
4
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 2
Data description.
Cryptocurrency Starting Midpoint date (Covid-19 Ending date No. of observations before midpoint date No. of observations after midpoint date
date timepoint) (before Covid-19) (after Covid-19)
most relevant cryptocurrencies in terms of volume, four most popular partitioned into a training set and testing set following a rolling window
cryptocurrencies were selected for our comparative analysis, namely to match the structure of the time series data, the number of rolling test
BTC/USD, ETH/USD, LTC/USD, and XRP/USD. The trading data was samples is shown in Table 2. Consistent with earlier studies (Corbet
obtained up to August 31, 2023. Table 2 presents the basic character et al., 2022; Livieris et al., 2021), this paper applied the first-order dif
istics of the cryptocurrency data, and Fig. 1 shows the fluctuation of the ferences of daily cryptocurrency logarithmic prices (Fig. 2). It should be
logarithmic cryptocurrency prices. Note that each time series was split noted that while differencing can be a useful approach to dealing with
into two different sub-periods (denoted as before Covid-19 and after non-stationarity in time series data, it does not necessarily eliminate the
Covid-19), given the considerable effect of Covid-19 on cryptocurrency need for complex machine learning models. Complex machine learning
markets. models are often able to capture more complex patterns, handle larger
entire available period was considered as presented in Table 2. datasets, and automatically extract relevant features from the data. In
In the experimental setting, the rolling window cross-validation addition, they can be effective in dealing with complex relationships and
approach (Bhattacharjee et al., 2022; Fuss and Koller, 2016) was used non-linear dynamics that may not be adequately captured by traditional
to split the time series into the training set immediately followed by the quantitative techniques (Shajalal et al., 2023).
testing set. Specifically, the cryptocurrency time-series data were The summary descriptive statistics for the cryptocurrency time series
5
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 3
Summary descriptive statistics for cryptocurrency data.
Cryptocurrency Mean Minimum Maximum St.Dev. Skewness Kurtosis
are presented in Table 3. As can be seen in Table 3, the average daily ensemble learning and deep learning methods, two sets of metrics were
price differences were positive for all the four cryptocurrencies, with used. First, commonly used regression metrics were employed as fol
Ethereum showing the highest returns, while Ripple showed the highest lows: MAPE (mean absolute percentage error), ME (mean error), MAE
standard deviation. In addition, the high kurtosis of all cryptocurrency (mean absolute error), MPE (mean percentage error), RMSE (root mean
data indicates leptokurtic time series, with Ethereum showing the square error), R (correlation coefficient), and MIN-MAX error. Second, a
highest excess kurtosis. Finally, while the price differences of Bitcoin set of metrics useful for evaluating investor performance was used,
and Ethereum were negatively skewed (with a longer left tail), the namely scalar product (SP), return score (Return), long return
opposite result can be observed for the price differences of Litecoin and (Return_long), short return (Return_short), mean directional accuracy
Ripple. (MDA), mean directional accuracy positive (MDA+), and mean direc
To consider the stochastic nature of neural networks, hereinafter the tional accuracy negative (MDA-). The SP of the actual and forecast
results for the used neural networks are reported as an average of 50 values was used to simulate the buy & sell trading strategy, where the
simulation runs. As the datasets varied in terms of sample sizes amount of investment is proportional to the forecast signal. The return
(Table 2), the comparison of the forecasting performance of the used score was used to simulate the trading strategy based on the signals of
methods between different cryptocurrencies is difficult. To consider this the used ensemble learning and deep learning methods. The return score
limitation into account, three naive algorithms were employed to was calculated as the sum of the returns of a particular trading strategy.
represent benchmarks. To this end, in agreement with previous studies The long (short) return simulated the return obtained using the long
(Akyildirim et al., 2021; Caporale et al., 2018; Oyedele et al., 2023), the (short) trading strategy. MDA compares the predicted price direction
following methods were used: random walk (RW), white noise (WN), (upward or downward) to the actual cryptocurrency price direction,
and buy & sell (B&S). The random walk method is based on the while MDA+ and MDA- evaluate the upward and downward directional
Martingale assumption, hence using the last value of cryptocurrency accuracy, respectively.
price as the forecast for the next day value. The white noise method In addition to the three naive methods, several other baseline
relies on a randomly generated cryptocurrency time series with normal methods were used to demonstrate the efficiency of the deep learning
distribution; that is, with the mean calculated as the mean of the training methods, including the traditional ARIMA, MLP, and hybrid two-stage
sample and the variance being equal to the variance of the training ARIMA+MLP methods. Five and ten previous cryptocurrency prices
sample. The buy & sell method replicates a simple strategy of buying a were examined in the experiments and the best results are presented
cryptocurrency for a fixed amount of money every day and selling it at hereinafter.
the end of the day. The results of the experiments are presented in Tables 4–7. From
To comprehensively evaluate the forecasting performance of the Tables 4–7, it can be noted that the compared methods performed
6
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 4
Results of Bitcoin forecasting performance – regression and investor metrics.
Before Covid-19 (2016-04-01 to 2019-12-31)
differently not only on different cryptocurrencies, but also in terms of performed best for Ripple and LightGBM showed superior performance
regression and investor statistics. As for the regression metrics, for Litecoin. For the pre-Covid-19 sub-period, the returns of the best
LightGBM and AdaBoost performed best for Bitcoin and Ethereum across performing methods ranged from 2.68 for Litecoin using AdaBoost to
the sub-periods studied. Different results were obtained for the two 4.93 for Ripple (Simple RNN). For the period following the emergence of
remaining cryptocurrencies with lower market capitalization. For Rip Covid-19, the returns ranged from 2.61 (for Bitcoin using AdaBoost) to
ple, LightGBM surpassed the remaining methods during the pre-Covid- 3.39 (for Litecoin using LightGBM). Exceptional MDA was obtained for
19 sub-period, while MLP and GRU outperformed the other methods all cryptocurrencies, ranging from 67.5% for Ethereum to 75.1% for
in the period following the emergence of Covid-19. Similarly, different Bitcoin. Generally, there was a greater MDA across cryptocurrencies
results are apparent for Litecoin, with the MLP and Simple RNN during the post-Covid-19 pandemic, resulting in improved predictability
demonstrating superior performance prior to and following the of cryptocurrency price trends. This is a rather remarkable result when
appearance of Covid-19, respectively. What is striking here is that, considering balanced performance achieved in all cases in terms of up
except Bitcoin and Ethereum, these methods did not perform similarly ward and downward trend prediction. To compare the investor perfor
well in terms of investor statistics, suggesting that although achieving mance statistically, we conducted a nonparametric Friedman test across
low forecast deviations, these methods failed to capture the direction of the investor metrics. The test uses the Friedman statistics to rank the
the next day's price change. Regarding the investor metrics, Simple RNN forecasting models across the two sub-periods. The Friedman p-value
7
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 5
Results of Ethereum forecasting performance – regression and investor metrics.
Before Covid-19 (2016-04-01 to 2019-12-31)
<0.01 (the Friedman statistics ranged from 35.3 to 56.2) indicates sig effective for some cryptocurrency time series. This finding is in contrast
nificant differences between the compared forecasting models for all the to recent review studies (Khedr et al., 2021; Ren et al., 2022), which
four cryptocurrencies. Among the forecasting models, the Simple RNN highlighted the dominance of LSTM models. This may be because con
ranked first for Ripple, while LightGBM ranked first for Ethereum, Bit ventional models are just as effective as deep learning models, particu
coin and Litecoin. larly when the data are univariate and there is no need to deal with
additional variables or complex relationships. Traditional computa
5. Discussion tional models are based on statistical principles and assumptions that are
appropriate for univariate data, as these models take into account fac
Overall, three different patterns were observed in our forecasting tors such as autocorrelation, seasonality and trend. Therefore, in the
results, with Bitcoin/Ethereum and Ripple/Litecoin representing these context of univariate time series analysis, where there are no additional
patterns. This finding is not surprising given the descriptive statistics of variables or complex relationships to consider, traditional quantitative
their time series. While the expected finding was that ensemble learning techniques can often be sufficient (Castán-Lascorz et al., 2022).
and deep learning methods outperform the conventional statistical We have shown that LSTM models can be overcome by GRU models,
methods and shallow neural networks, this study showed that, at least in even when LSTM is combined with ARIMA. One reasonable explanation
terms of point forecasts, less complex conventional models can be more for this decrease is that the ensemble learning and deep learning
8
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 6
Results of Ripple forecasting performance – regression and investor metrics.
Before Covid-19 (2016-04-01 to 2019-12-31)
methods were overfitted for the less complex time series. The poor inefficient cryptocurrency markets, whereas Bitcoin appears to be the
performance of the hybrid models might be due to the fact that there is a least inefficient market. This complexity effect might also be related to
lot of noise in the residuals, so the hybrid models get overfitted by the greater liquidity in the Bitcoin market (Al-Yahyaee et al., 2020).
noise. Our results indicate that the trading strategies based on deep
Different results were observed for the investor metrics, suggesting learning (for Ripple) or ensemble learning (for Bitcoin, Ethereum, and
that more complex machine learning methods are needed to adequately Litecoin) could allow cryptocurrency investors to effectively predict
perform in terms of forecasting cryptocurrency market direction. We market development, particularly in less complex cryptocurrency mar
have shown that remarkable returns can be achieved by following the kets. The study's findings provide cryptocurrency investors with valu
trading strategy based on the forecasts produced by the LightGBM able insights into effective trading strategies, adjusting their investment
models. This remarkable performance can be attributed to effectively strategy to either take a long position or a short position. The demon
managing large datasets while exploiting its regularization mechanism strated financial effectiveness of deep and ensemble learning techniques
that helps prevent overfitting, making it more robust for cryptocurrency in cryptocurrency trading also offers new tools for financial analysts,
price forecasting (Sun et al., 2020). The highest returns could be ob enhancing their ability to predict market movements. Nonetheless,
tained for Ripple in the pre-Covid-19 period and for Litecoin in the post- policymakers ought to implement financial market interventions with a
Covid-19 period, indicating that these cryptocurrencies are the most view to enhancing the level of transparency and efficiency in these
9
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Table 7
Results of Litecoin forecasting performance – regression and investor metrics.
Before Covid-19 (2016-04-01 to 2019-12-31)
10
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
guide investors in their trading decisions. In the future, we will examine Derbentsev, V., Matviychuk, A., & Soloviev, V. N. (2020). Forecasting of cryptocurrency
prices using machine learning. In Advanced Studies of Financial Technologies and
more frequent real-time data to better exploit the advantages of deep
Cryptocurrency Markets (pp. 211–231).
learning models. In a similar manner, we seek to utilize multivariate Dong, F., Xu, Z., & Zhang, Y. (2022). Bubbly bitcoin. Economic Theory, 74(3), 9731015.
data, including the determinants of cryptocurrency supply and demand, Efat, M. I. A., Petr, H., Abedin, M. Z., Azad, R. U., Jaber, M. A., Aditya, S., & Hassan, M. K.
in order to compare the performance of univariate and multivariate (2022). Deep learning model using hybrid adaptive trend estimated series for
modelling and forecasting sales. Annals of Operations Research. https://doi.org/
models. In this way, we would also be able to interpret the models more 10.1007/s10479-022-04838-6
effectively in terms of the contribution of the determinants. El Montasser, G., Charfeddine, L., & Benhamed, A. (2022). COVID-19, cryptocurrencies
bubbles and digital market efficiency: Sensitivity and similarity analysis. Finance
Research Letters, 46, Article 102362.
Data availability Fakhfekh, M., & Jeribi, A. (2020). Volatility dynamics of crypto-currencies’ returns:
Evidence from asymmetric and long memory GARCH models. Research in
Data will be made available on request. International Business and Finance, 51, Article 101075.
Fang, T., Su, Z., & Yin, L. (2020). Economic fundamentals or investor perceptions? The
role of uncertainty in predicting long-term cryptocurrency volatility. International
Acknowledgements Review of Financial Analysis, 71, Article 101566.
Fu, E., Zhang, Y., Yang, F., & Wang, S. (2022). Temporal self-attention-based ConvLSTM
network for multivariate time series prediction. Neurocomputing, 501, 162–173.
This research paper was made possible by a grant from the Czech Fuss, R., & Koller, J. (2016). The role of spatial and temporal structure for residential rent
Sciences Foundation (No. 22-22586S). predictions. International Journal of Forecasting, 32(4), 1352–1368.
Guesmi, K., Saadi, S., Abid, I., & Ftiti, Z. (2019). Portfolio diversification with virtual
currency: Evidence from bitcoin. International Review of Financial Analysis, 63, Article
Appendix A. Supplementary data
431437.
Hajek, P., Hikkerova, L., & Sahut, J. M. (2023). How well do investor sentiment and
Supplementary data to this article can be found online at https://doi. ensemble learning predict bitcoin prices? Research in International Business and
org/10.1016/j.irfa.2023.103055. Finance, 64, Article 101836.
Hansun, S., Wicaksana, A., & Khaliq, A. Q. (2022). Multivariate cryptocurrency
prediction: Comparative analysis of three recurrent neural networks approaches.
References Journal of Big Data, 9(1), 1–15.
Ibrahim, A., Kashef, R., & Corrigan, L. (2021). Predicting market movement direction for
Aggarwal, D., Chandrasekaran, S., & Annamalai, B. (2020). A complete empirical bitcoin: A comparison of time series modeling methods. Computers and Electrical
ensemble mode decomposition and support vector machine-based approach to Engineering, 89, Article 106905.
predict bitcoin prices. Journal of Behavioral and Experimental Finance, 27, Article Jalan, A., Matkovskyy, R., & Yarovaya, L. (2021). “Shiny” crypto assets: A systemic look
100335. at gold-backed cryptocurrencies during the COVID-19 pandemic. International Review
Akyildirim, E., Goncu, A., & Sensoy, A. (2021). Prediction of cryptocurrency returns of Financial Analysis, 78, Article 101958.
using machine learning. Annals of Operations Research, 297(1), 3–36. Ji, Q., Bouri, E., Gupta, R., & Roubaud, D. (2018). Network causality structures among
Altan, A., Karasu, S., & Bekiros, S. (2019). Digital currency forecasting with chaotic meta- bitcoin and other financial assets: A directed acyclic graph approach. The Quarterly
heuristic bio-inspired signal processing techniques. Chaos, Solitons & Fractals, 126, Review of Economics and Finance, 70, 203–213.
325–336. Jiang, Y., Wu, L., Tian, G., & Nie, H. (2021). Do cryptocurrencies hedge against EPU and
Al-Yahyaee, K. H., Mensi, W., Ko, H. U., Yoon, S. M., & Kang, S. H. (2020). Why the equity market volatility during COVID-19?–new evidence from quantile
cryptocurrency markets are inefficient: The impact of liquidity and volatility. The coherency analysis. Journal of International Financial Markets Institutions and Money,
North American Journal of Economics and Finance, 52, Article 101168. 72, Article 101324.
Baur, D. G., Dimpfl, T., & Kuck, K. (2018). Bitcoin, gold and the US dollar–a replication Khedr, A. M., Arif, I., El Bannany, M., Alhashmi, S. M., & Sreedharan, M. (2021).
and extension. Finance Research Letters, 25, 103–110. Cryptocurrency price prediction using traditional statistical and machine learning
Bhattacharjee, B., Kumar, R., & Senthilkumar, A. (2022). Unidirectional and techniques: A survey. Intelligent Systems in Accounting, Finance and Management, 28
bidirectional LSTM models for edge weight predictions in dynamic cross-market (1), 3–34.
equity networks. International Review of Financial Analysis, 84, Article 102384. Klein, T., Thu, H. P., & Walther, T. (2018). Bitcoin is not the new gold–a comparison of
Bouri, E., Gupta, R., Tiwari, A. K., & Roubaud, D. (2017). Does bitcoin hedge global volatility, correlation, and portfolio performance. International Review of Financial
uncertainty? Evidence from wavelet-based quantile-in-quantile regressions. Finance Analysis, 59, 105–116.
Research Letters, 23, 87–95. Kristjanpoller, W., & Minutolo, M. C. (2018). A hybrid volatility forecasting framework
Caporale, G. M., Gil-Alana, L., & Plastun, A. (2018). Persistence in the cryptocurrency integrating GARCH, artificial neural network, technical analysis and principal
market. Research in International Business and Finance, 46, 141–148. components analysis. Expert Systems with Applications, 109, 1–11.
Castán-Lascorz, M. A., Jiménez-Herrera, P., Troncoso, A., & Asencio-Cortés, G. (2022). Kyriazis, N., Papadamou, S., & Corbet, S. (2020). A systematic review of the bubble
A new hybrid method for predicting univariate and multivariate time series based on dynamics of cryptocurrency prices. Research in International Business and Finance, 54,
pattern forecasting. Information Sciences, 586, 611–627. Article 101254.
Catania, L., Grassi, S., & Ravazzolo, F. (2019). Forecasting cryptocurrencies under model Lahmiri, S., & Bekiros, S. (2019). Cryptocurrency forecasting with deep learning chaotic
and parameter instability. International Journal of Forecasting, 35(2), 485–501. neural networks. Chaos, Solitons & Fractals, 118, 35–40.
Chen, W., Xu, H., Jia, L., & Gao, Y. (2021). Machine learning model for bitcoin exchange Li, Y., & Dai, W. (2020). Bitcoin price forecasting method based on cnn lstm hybrid
rate prediction using economic and technology determinants. International Journal of neural network model. The Journal of Engineering, 2020(13), 344–347.
Forecasting, 37(1), 28–43. Liu, M., Li, G., Li, J., Zhu, X., & Yao, Y. (2021). Forecasting the price of bitcoin using deep
Cheng, J. (2023). Modelling and forecasting risk dependence and portfolio VaR for learning. Finance Research Letters, 40, Article 101755.
cryptocurrencies. Empirical Economics, 65, 899–924. Livieris, I. E., Stavroyiannis, S., Iliadis, L., & Pintelas, P. (2021). Smoothing and
Chu, J., Zhang, Y., & Chan, S. (2019). The adaptive market hypothesis in the high stationarity enforcement framework for deep learning time-series forecasting. Neural
frequency cryptocurrency market. International Review of Financial Analysis, 64, Computing and Applications, 33(20), 14021–14035.
Article 221231. Manchanda, H., & Aggarwal, S. (2021). Forecasting cryptocurrency time series using
Chu, J., Chan, S., & Zhang, Y. (2020). High frequency momentum trading with Adaboost-based ensemble learning techniques. In Innovations in cyber physical systems
cryptocurrencies. Research in International Business and Finance, 52, Article 101176. (pp. 207–219). Singapore: Springer.
Conrad, C., Custovic, A., & Ghysels, E. (2018). Long-and short-term cryptocurrency Mnif, E., Jarboui, A., & Mouakhar, K. (2020). How the cryptocurrency market has
volatility components: A GARCH-MIDAS analysis. Journal of Risk and Financial performed during COVID 19? A multifractal analysis. Finance Research Letters, 36,
Management, 11(2), 23. Article 101647.
Corbet, S., Lucey, B., Urquhart, A., & Yarovaya, L. (2019). Cryptocurrencies as a financial Moula, F. E., Guotai, C., & Abedin, M. Z. (2017). Credit default prediction modeling: An
asset: A systematic analysis. International Review of Financial Analysis, 62, 182–199. application of support vector machine. Risk Management, 19(2), 158–187.
Corbet, S., Hou, Y. G., Hu, Y., Larkin, C., & Oxley, L. (2020). Any port in a storm: Murray, K., Rossi, A., Carraro, D., & Visentin, A. (2023). On forecasting cryptocurrency
Cryptocurrency safe-havens during the COVID-19 pandemic. Economics Letters, 194, prices: A comparison of machine learning, deep learning, and ensembles. Forecasting,
Article 109377. 5(1), 196–209.
Corbet, S., Hou, Y. G., Hu, Y., Larkin, C., Lucey, B., & Oxley, L. (2022). Cryptocurrency Nakano, M., Takahashi, A., & Takahashi, S. (2018). Bitcoin technical trading with
liquidity and volatility interrelationships during the COVID-19 pandemic. Finance artificial neural network. Physica A: Statistical Mechanics and its Applications, 510,
Research Letters, 45, Article 102137. Article 587609.
Cui, T., Ding, S., Jin, H., & Zhang, Y. (2022). Portfolio constructions in cryptocurrency Ortu, M., Uras, N., Conversano, C., Bartolucci, S., & Destefanis, G. (2022). On technical
market: A CVaR-based deep reinforcement learning approach. Economic Modelling, trading and social media indicators for cryptocurrency price classification through
106078. deep learning. Expert Systems with Applications, 198, Article 116804.
Das, D., Le Roux, C. L., Jana, R. K., & Dutta, A. (2020). Does bitcoin hedge crude oil Oyedele, A. A., Ajayi, A. O., Oyedele, L. O., Bello, S. A., & Jimoh, K. O. (2023).
implied volatility and structural shocks? A comparison with gold, commodity and Performance evaluation of deep learning and boosted trees for cryptocurrency
the US Dollar. Finance Research Letters, 36, Article 101335. closing price prediction. Expert Systems with Applications, 213, Article 119233.
11
A. Bouteska et al. International Review of Financial Analysis 92 (2024) 103055
Patel, J., Shah, S., Thakkar, P., & Kotecha, K. (2015). Predicting stock market index using Uras, N., Marchesi, L., Marchesi, M., & Tonelli, R. (2020). Forecasting bitcoin closing
fusion of machine learning techniques. Expert Systems with Applications, 42(4), price series using linear regression and neural networks models. PeerJ Computer
2162–2172. Science, 6, Article e279.
Rahmani Cherati, M., Haeri, A., & Ghannadpour, S. F. (2021). Cryptocurrency direction Walther, T., Klein, T., & Bouri, E. (2019). Exogenous drivers of bitcoin and
forecasting using deep learning algorithms. Journal of Statistical Computation and cryptocurrency volatility–a mixed data sampling approach to forecasting. Journal of
Simulation, 91(12), 2475–2489. International Financial Markets Institutions and Money, 63, Article 101133.
Ren, Y. S., Ma, C. Q., Kong, X. L., Baltas, K., & Zureigat, Q. (2022). Past, present, and Yang, B., Sun, Y., & Wang, S. (2020). A novel two-stage approach for cryptocurrency
future of the application of machine learning in cryptocurrency research. Research in analysis. International Review of Financial Analysis, 72, Article 101567.
International Business and Finance, 63, Article 101799. Yu, Y., Si, X., Hu, C., & Zhang, J. (2019). A review of recurrent neural networks: LSTM
Shahzad, S. J. H., Balli, F., Naeem, M. A., Hasan, M., & Arif, M. (2022). Do conventional cells and network architectures. Neural Computation, 31(7), 1235–1270.
currencies hedge cryptocurrencies? The Quarterly Review of Economics and Finance, Zhang, G. P. (2003). Time series forecasting using a hybrid ARIMA and neural network
85, 223–228. model. Neurocomputing, 50, 159–175. https://doi.org/10.1016/S0925-2312(01)
Shajalal, M., Petr, H., & Abedin, M. Z. (2023). Product backorder prediction with deep 00702-0
neural network on imbalance data. International Journal of Production Research, 61 Zhang, W., Wang, P., Li, X., & Shen, D. (2018). Some stylized facts of the cryptocurrency
(1), 302–319. market. Applied Economics, 50(55), 5950–5965.
Sun, X., Liu, M., & Sima, Z. (2020). A novel cryptocurrency price trend forecasting model Zhang, Z., Dai, H. N., Zhou, J., Mondal, S. K., García, M. M., & Wang, H. (2021).
based on LightGBM. Finance Research Letters, 32, Article 101084. Forecasting cryptocurrency price using convolutional neural networks with
Tuwiner, J. (2022). 63+ cryptocurrency statistics, facts & trends. Available at https://b weighted and attentive memory channels. Expert Systems with Applications, 183,
uybitcoinworldwide.com/cryptocurrency-statistics/. Article 115378.
12