Demand Forecasting - ML
Demand Forecasting - ML
A. Introduction .............................................................................................................................1
What is demand forecasting, and why is it important across various industries? .........................1
What are the main challenges of traditional demand forecasting methods? ................................3
How can machine learning offer innovative solutions for these challenges? ................................8
B. Applications of Machine Learning in Demand Forecasting ....................................................... 11
How does machine learning utilize historical sales data, customer behavior, seasonality, and
other influencing factors? ....................................................................................................... 11
What are the main machine learning approaches used for demand forecasting (e.g., regression,
classification, neural networks)? ............................................................................................. 15
How do modern machine learning methods compare to traditional forecasting approaches? .... 20
C. Benefits and Challenges ........................................................................................................ 23
What are the main advantages of using machine learning for demand forecasting? ................... 23
How does machine learning improve accuracy, reduce costs, and identify complex patterns? .. 28
What are the primary challenges faced when implementing machine learning in this domain? .. 32
Why is high-quality data crucial for machine learning models? ................................................. 35
D. Case Studies ......................................................................................................................... 39
What are some real-world examples of companies successfully using machine learning for
demand forecasting? .............................................................................................................. 39
E. Conclusion ............................................................................................................................ 42
What are the key takeaways from this discussion? ................................................................... 42
What suggestions can be made for implementing machine learning in similar projects? ........... 42
What does the future hold for machine learning in demand forecasting? .................................. 43
A. Introduction
What is demand forecasting, and why is it important across various
industries?
Definition: Demand forecasting is the process of estimating the future demand for a product or
service over a specific period. This involves analyzing historical data, market trends, and
various influencing factors to predict future demand accurately 1 2 3.
Importance Across Industries:
2. Transportation:
3. Automotive Industry:
4. Fashion Industry:
o Production Planning: Yearly and real-time demand forecasts are essential due to
the difficulty in adjusting production rates quickly. Accurate forecasts increase
system responsiveness and improve transshipment policies 8.
5. Electricity Markets:
• Advanced Methods: Incorporate AI, machine learning, and predictive analytics to enhance
accuracy and efficiency 7 14 15 16.
1. https://www.scopus.com/record/display.uri?eid=2-s2.0-85173550147&origin=scopusAI
2. https://www.scopus.com/record/display.uri?eid=2-s2.0-85207581439&origin=scopusAI
3. https://www.scopus.com/record/display.uri?eid=2-s2.0-85056654236&origin=scopusAI
4. https://www.scopus.com/record/display.uri?eid=2-s2.0-85128210808&origin=scopusAI
5. https://www.scopus.com/record/display.uri?eid=2-s2.0-85132528527&origin=scopusAI
6. https://www.scopus.com/record/display.uri?eid=2-s2.0-85164384292&origin=scopusAI
7. https://www.scopus.com/record/display.uri?eid=2-s2.0-85166318714&origin=scopusAI
8. https://www.scopus.com/record/display.uri?eid=2-s2.0-84929598025&origin=scopusAI
9. https://www.scopus.com/record/display.uri?eid=2-s2.0-85126796786&origin=scopusAI
10. https://www.scopus.com/record/display.uri?eid=2-s2.0-72249094153&origin=scopusAI
11. https://www.scopus.com/record/display.uri?eid=2-s2.0-85132424053&origin=scopusAI
12. https://www.scopus.com/record/display.uri?eid=2-s2.0-85132424053&origin=scopusAI
13. https://www.scopus.com/record/display.uri?eid=2-s2.0-85175437341&origin=scopusAI
14. https://www.scopus.com/record/display.uri?eid=2-s2.0-85083453913&origin=scopusAI
15. https://www.scopus.com/record/display.uri?eid=2-s2.0-85111639899&origin=scopusAI
16. https://www.scopus.com/record/display.uri?eid=2-s2.0-85196756650&origin=scopusAI
• Synthetic Data Generation: Traditional methods struggle with limited data, especially for
new businesses. ML techniques like Conditional Wasserstein Generative Adversarial
Networks (CWGAN-GP) can generate synthetic data to augment small datasets, improving
the accuracy of demand forecasts 1.
2. Incorporating External Factors
• Leveraging Historical and Expert Data: For products with no historical data or frequent
innovations, ML can combine historical data of similar products with expert forecasts,
significantly reducing forecast errors 3.
• Graph Convolution Networks: For intricate supply chains, ML methods like graph
convolution networks can handle non-Euclidean data, refining demand forecasting
precision by considering the connectivity and relationships within the supply chain 12.
• Dynamic Forecasting: ML models can adapt to rapid market changes and economic
instability, providing more reliable forecasts in uncertain environments 13.
Summary Table
Abstract
Challenge ML Solution
Reference
Data Scarcity Synthetic Data Generation 1
Incorporating
Inclusion of Leading Indicators 2
External Factors
Seasonal/Rapidly
Changing Combining Historical and Expert Data 3
Products
Enhancing
Advanced Algorithms, Hybrid
Forecast 4567
Approaches
Accuracy
Reducing
Data-Driven Methods 8
Forecasting Bias
Improving Supply
Operational Enhancements 9 10
Chain Efficiency
Customizing
Demand Customer-Centric Forecasting 11
Forecasts
Complex Supply
Graph Convolution Networks 12
Chains
Adapting to
Dynamic Forecasting 13
Market Changes
Integrating
Multiple Data Data Consolidation 14
Sources
Machine learning provides robust and versatile solutions to overcome the limitations of
traditional demand forecasting methods, enhancing accuracy, efficiency, and adaptability
in various business contexts.
Traditional demand forecasting methods face several significant challenges that impact
their accuracy and reliability. These challenges can be broadly categorized into data-related
issues, model limitations, and external factors.
Data-Related Issues
• Large Volume of Data: Traditional methods often struggle to handle the vast amounts of
data generated, which can lead to inefficiencies and inaccuracies in forecasts 1.
• Limited Historical Data: Forecasting demand for new products or in scenarios with limited
historical data is particularly challenging. Traditional models typically require extensive and
consistent historical data to produce accurate forecasts 2 3.
• Missing Data: Traditional methods often fail to accurately predict demand when there is
missing data, leading to unreliable forecasts 4.
Model Limitations
• Inflexibility: Traditional models, such as time series analysis and regression analysis, may
not adapt well to the varying demand patterns of different products. This inflexibility can
result in poor performance for products with sporadic or highly variable demand 5 6.
• Accuracy Metrics: Traditional accuracy metrics used to evaluate forecasts do not always
account for the complexities of demand patterns, such as horizontal and vertical shifts over
the forecasting horizon. This can lead to suboptimal method selection and inaccurate
forecasts 5.
External Factors
• Supply Chain Complexity: The complexity of modern supply chains, with their long lead
times and global sourcing, increases the uncertainty of demand. Traditional methods often
struggle to provide accurate forecasts in such complex environments 10.
Summary Table
These challenges highlight the need for more advanced and flexible forecasting methods
that can better handle the complexities and uncertainties of modern demand forecasting.
1. https://www.scopus.com/record/display.uri?eid=2-s2.0-
84939566884&origin=scopusAI
2. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85087087995&origin=scopusAI
3. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85149760540&origin=scopusAI
4. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85191653473&origin=scopusAI
5. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85104188541&origin=scopusAI
6. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85111309932&origin=scopusAI
7. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85129094942&origin=scopusAI
8. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85179547832&origin=scopusAI
9. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85183539983&origin=scopusAI
10. https://www.scopus.com/record/display.uri?eid=2-s2.0-
85115150587&origin=scopusAI
• Synthetic Data Generation: Traditional methods struggle with limited data, especially for
new businesses. ML techniques like Conditional Wasserstein Generative Adversarial
Networks (CWGAN-GP) can generate synthetic data to augment small datasets, improving
the accuracy of demand forecasts 1.
• Leveraging Historical and Expert Data: For products with no historical data or frequent
innovations, ML can combine historical data of similar products with expert forecasts,
significantly reducing forecast errors 3.
• Graph Convolution Networks: For intricate supply chains, ML methods like graph
convolution networks can handle non-Euclidean data, refining demand forecasting
precision by considering the connectivity and relationships within the supply chain 12.
• Dynamic Forecasting: ML models can adapt to rapid market changes and economic
instability, providing more reliable forecasts in uncertain environments 13.
• Data Consolidation: ML can utilize data from various sources, including data
consolidators, to improve forecast accuracy by identifying useful data points that traditional
methods might miss 14.
Summary Table
https://www.scopus.com/record/display.uri?eid=2-s2.0-85210119605&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85159788263&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85200510353&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85171882818&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85115150587&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85197424943&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85089012778&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85124668866&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85205114375&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85175116322&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85160494340&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85208599063&origin=scopusAI
https://www.scopus.com/sourceid/21100867241?origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85113219108&origin=scopusAI
B. Applications of Machine Learning in Demand
Forecasting
How does machine learning utilize historical sales data, customer
behavior, seasonality, and other influencing factors?
Historical Sales Data
• Predictive Modeling: Machine learning algorithms analyze historical sales data to identify
patterns and trends. This data serves as the foundation for training models to predict future
sales 1 2 3.
• Feature Engineering: Historical data is used to extract relevant features that can improve
the accuracy of predictions. This includes cleaning the data and handling outliers 4 5.
Customer Behavior
• Behavioral Analysis: Machine learning models incorporate customer behavior data, such
as purchase history, browsing patterns, and demographic information, to predict future
sales and customer spending 6 7.
Seasonality
• Seasonal Trends: Machine learning models account for seasonality by identifying recurring
patterns in sales data, such as increased sales during holidays or specific times of the
year 1 9 10.
• Time Series Analysis: Techniques like Seasonal Trend Decomposition with Loess (STL) and
ARIMA models are used to decompose time series data into seasonal, trend, and residual
components, improving forecast accuracy 5 11.
• External Factors: Machine learning models consider external factors such as economic
indicators, public holidays, promotional periods, and macroeconomic conditions to refine
predictions 1 9 12.
• Real-Time Data Integration: Advanced models continuously learn from new data,
adjusting predictions based on real-time market conditions and external events 3 6.
Performance Metrics
• Evaluation: Models are evaluated using metrics such as Root Mean Squared Error (RMSE),
Mean Absolute Error (MAE), and R squared to ensure accuracy and reliability 2 9 16.
Applications
• Marketing Strategies: Insights from predictive models guide marketing strategies, enabling
targeted promotions and efficient resource allocation 4 8.
References
Proceedings - 2024 8th International Conference on Inventive Systems and Control, ICISC
2024, 2024
https://www.scopus.com/record/display.uri?eid=2-s2.0-85206113506&origin=scopusAI
2023 International Conference on Data Science, Agents and Artificial Intelligence, ICDSAAI
2023, 2023
https://www.scopus.com/record/display.uri?eid=2-s2.0-85187794015&origin=scopusAI
2nd IEEE International Conference on Data Science and Information System, ICDSIS 2024,
2024
https://www.scopus.com/record/display.uri?eid=2-s2.0-85200154631&origin=scopusAI
4. Big Mart Sales using Hybrid Learning Framework with Data Analysis
https://www.scopus.com/record/display.uri?eid=2-s2.0-85185386385&origin=scopusAI
5. Sales Prediction Using ARIMA, Facebook’s Prophet and XGBoost Model of Machine
Learning
https://www.scopus.com/record/display.uri?eid=2-s2.0-85148042329&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85209124118&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85100897840&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85179962490&origin=scopusAI
9. A comparative study of traditional forecasting methodologies vs machine learning
algorithms
https://www.scopus.com/record/display.uri?eid=2-s2.0-85095449372&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85079650570&origin=scopusAI
11. Sales prediction of Four Wheelers Unit (4W) with seasonal algorithm Trend
Decomposition with Loess (STL) in PT. Astra International, Tbk
https://www.scopus.com/record/display.uri?eid=2-s2.0-85075986080&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85172235181&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85191179251&origin=scopusAI
14. Enhancing Sales Forecasting and Prediction with Cutting-Edge Machine Learning
Methods
2nd IEEE International Conference on Data Science and Information System, ICDSIS 2024,
2024
https://www.scopus.com/record/display.uri?eid=2-s2.0-85200146927&origin=scopusAI
Pavlyshenko, B.M.
Data, 2019
https://www.scopus.com/record/display.uri?eid=2-s2.0-85070878115&origin=scopusAI
16. Using “Machine Learning” Techniques in Increasing the Efficiency of Sales Forecasting
in Albania
https://www.scopus.com/record/display.uri?eid=2-s2.0-85183374111&origin=scopusAI
What are the main machine learning approaches used for demand
forecasting (e.g., regression, classification, neural networks)?
To address the query on the main machine learning approaches used for demand
forecasting, the provided abstracts offer a comprehensive overview of various techniques
and their applications. Here are the key approaches:
1. Regression Models
• Support Vector Regression (SVR): Effective for both univariate and multivariate time series
forecasting, providing accurate predictions 3.
2. Classification Models
• Decision Trees: Useful for their simplicity and ability to handle non-linear relationships 1 4.
3. Neural Networks
• Artificial Neural Networks (ANN): Demonstrated to outperform traditional methods like
Moving Average (MA) and Seasonal Autoregressive Integrated Moving Average (SARIMA) in
various studies 4 5.
• Long Short-Term Memory (LSTM): A type of recurrent neural network (RNN) particularly
effective for time series forecasting due to its ability to capture long-term dependencies 3 6.
• Multi-Layer Perceptron (MLP): Used for its flexibility and capability to model complex
patterns 4.
4. Ensemble Learning
• Boosting Methods: Such as gradient boosting, which have shown strong performance in
demand forecasting 4 7.
• Seasonal ARIMA (SARIMA): Incorporates seasonality in time series data but is often
outperformed by neural networks 5.
6. Advanced Techniques
• Extreme Learning Machine (ELM): Not widely used but shown to be effective for
intermittent demand prediction 4.
• Automated Machine Learning (AutoML): Tools like AutoTS for time series forecasting,
which streamline the model selection and tuning process 7.
Evaluation Metrics
Summary Table
Captures complex
patterns, long-term
Neural
ANN, LSTM, MLP dependencies,
Networks outperforms traditional
methods
Combines strengths of
Ensemble Boosting, Stacking, SBFA- multiple models,
Learning NN improves prediction
accuracy
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85183555810&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85191179251&origin=scopusAI
3. E-commerce time series forecasting using LSTM neural network and support vector
regression
https://www.scopus.com/record/display.uri?eid=2-s2.0-85046629234&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85124668866&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85177847499&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85171882818&origin=scopusAI
Pavlyshenko, B.M.
Data, 2019
https://www.scopus.com/record/display.uri?eid=2-s2.0-85070878115&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85203215309&origin=scopusAI
10. Ensemble Learning Models for Large-Scale Time Series Forecasting in Supply Chain
Proceedings - 2023 IEEE 22nd International Conference on Trust, Security and Privacy in
Computing and Communications, TrustCom/BigDataSE/CSE/EUC/iSCI 2023, 2023
https://www.scopus.com/record/display.uri?eid=2-s2.0-85195465190&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85135034592&origin=scopusAI
12. Revealing the dynamics of demand forecasting in supply chain management: a holistic
investigation
https://www.scopus.com/record/display.uri?eid=2-s2.0-85197424943&origin=scopusAI
13. Machine Learning and Deep Learning Models for Demand Forecasting in Supply Chain
Management: A Critical Review
https://www.scopus.com/record/display.uri?eid=2-s2.0-85207712011&origin=scopusAI
14. Machine Learning techniques for Supply Chain Management: A Systematic Literature
Review
https://www.scopus.com/record/display.uri?eid=2-s2.0-85132578870&origin=scopusAI
Performance Comparison
• Forecast Horizon: ML methods tend to perform better for medium to long-term forecasts,
while traditional methods like ARIMA are more suitable for short-term predictions 4 5. This
is because ML models can capture complex patterns and dependencies over longer
periods.
• Handling Non-linear Patterns: Traditional methods often assume linearity and may not
handle non-linear patterns effectively. In contrast, ML models, especially deep learning
techniques like LSTM and Convolutional Neural Networks (CNN), are adept at capturing
non-linear relationships in the data 7 8.
Computational Demands
• Execution Time: The execution time for ML models can be higher due to the complexity of
the algorithms and the need for extensive data preprocessing and model tuning 6 8.
Hybrid Approaches
• Combining Methods: Hybrid models that integrate both traditional and ML methods can
sometimes offer the best of both worlds. However, the effectiveness of hybrid models can
vary depending on the specific application and data characteristics 2 7.
Summary Table
In conclusion, modern machine learning methods generally offer improved accuracy and
flexibility over traditional forecasting approaches, particularly for medium to long-term
forecasts and when external factors are involved. However, they come with higher
computational demands, which can be a consideration depending on the
application 1 2 4 8.
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85095449372&origin=scopusAI
2. Forecasting performance of machine learning, time series, and hybrid methods for low-
and high-frequency time series
https://www.scopus.com/record/display.uri?eid=2-s2.0-85175736067&origin=scopusAI
3. Time series forecasting of total daily solar energy generation: A comparative analysis
between ARIMA and machine learning techniques
https://www.scopus.com/record/display.uri?eid=2-s2.0-85099344259&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85164589664&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-84955176870&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85207235622&origin=scopusAI
7. Adaptive Solar Power Generation Forecasting using Enhanced Hybrid Function Networks
with Weather Modulation
https://www.scopus.com/record/display.uri?eid=2-s2.0-85196670587&origin=scopusAI
8. Comparison of Machine Learning and Statistical Predictive Models for Production Time
Series Forecasting in Tight Oil Reservoirs
https://www.scopus.com/record/display.uri?eid=2-s2.0-85152196404&origin=scopusAI
1. Improved Accuracy
o ML techniques can capture complex patterns in data that traditional methods might
miss. This includes the ability to incorporate and analyze various internal and
external factors, such as economic indicators, weather data, and market trends,
leading to more precise forecasts 7 8 9.
o ML models can be adapted and scaled to different forecasting horizons, from short-
term to long-term predictions. Techniques like Long Short-Term Memory (LSTM)
networks are particularly effective for long-horizon forecasting 5 10. Additionally, ML
models can be continuously updated with new data, improving their accuracy over
time 8.
4. Operational Efficiency
o Automated ML (AutoML) tools can streamline the forecasting process, reducing the
need for extensive human intervention and expertise in data science. This makes ML
more accessible and economically attractive for various industries 9.
7. Enhanced Decision-Making
Summary Table
Optimizes inventory
Operational management, reduces
2 5 11
Efficiency stockouts, and balances
inventory levels.
Automation
AutoML tools reduce the need
and
for human intervention, making 9
Reduced ML more accessible.
Labor
Enhanced Enables better production
Decision- planning, inventory control, and 1 11 13
Making logistics.
In conclusion, the use of machine learning for demand forecasting offers significant
advantages in terms of accuracy, efficiency, and adaptability, making it a valuable tool for
modern businesses.
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85171882818&origin=scopusAI
IFAC-PapersOnLine, 2019
https://www.scopus.com/record/display.uri?eid=2-s2.0-85078945415&origin=scopusAI
3. Applying Machine Learning Techniques to Forecast Demand in a South African Fast-
Moving Consumer Goods Company
https://www.scopus.com/record/display.uri?eid=2-s2.0-85177847499&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85135034592&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85197424943&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85149152578&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85159788263&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85185728600&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85164413933&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85092065037&origin=scopusAI
11. The Impact of Machine Learning (ML) Optimization of supply chain management
domain
https://www.scopus.com/record/display.uri?eid=2-s2.0-85178298993&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85175665323&origin=scopusAI
13. Customer Sale Analysis and Classification Using Machine Learning Algorithm
https://www.scopus.com/record/display.uri?eid=2-s2.0-85187794015&origin=scopusAI
How does machine learning improve accuracy, reduce costs, and identify
complex patterns?
To address how machine learning (ML) improves accuracy, reduces costs, and identifies
complex patterns, we can draw insights from several abstracts provided.
Improving Accuracy
• Model Training and Data Quality: The accuracy of ML models can be significantly
enhanced by building relevant models and using high-quality datasets. The quality of the
data used for training is crucial, as poor data can lead to inaccurate models 1. Additionally,
ensemble methods like boosting, stacking, and bagging can further improve the accuracy of
ML algorithms by combining multiple models to produce better results 2.
• Handling Large Data Volumes: ML algorithms can process and analyze large volumes of
data more accurately and faster than humans, which is particularly beneficial in fields like
healthcare for disease diagnosis and treatment planning 3 4.
Reducing Costs
• Operational Efficiency: ML can optimize various business processes, reducing the need
for human intervention and thus lowering labor costs. For example, in supply chain
management, ML can enhance efficiency by optimizing logistics and inventory
management, leading to cost savings 5 6.
• Automation: By automating repetitive tasks, ML reduces the time and resources required
for these tasks. This is evident in applications like customer service, where natural language
processing (NLP) and sentiment analysis can automate responses to customer queries,
reducing the need for human customer service representatives 7 8.
• Pattern Recognition: ML excels at identifying complex patterns in data that are not easily
discernible by humans. This capability is used in various domains, such as detecting fraud
in financial transactions, recognizing speech and images, and diagnosing diseases from
medical images 3 9 10.
• Sequential Pattern Mining: Advanced ML techniques like sequential pattern mining can
discover interesting patterns in complex datasets, which can be applied to large-scale data
analysis tasks 11.
• Big Data Analysis: ML algorithms can analyze big data to identify patterns and interactions
among features, providing insights that can be used for personalized predictions and
stratification of outcomes in fields like healthcare 12.
Summary Table
- Disease diagnosis,
- Handling large data volumes 3 4
treatment planning 3 4
- Industrial equipment
- Predictive maintenance 6
maintenance 6
- Fraud detection,
Identifying speech/image
- Pattern recognition 3 9 10
Patterns recognition, disease
diagnosis 3 9 10
- Large-scale data
- Sequential pattern mining 11
analysis 11
- Personalized healthcare
- Big data analysis 12
predictions 12
References
Proceedings - 2018 IEEE International Conference on Big Data, Big Data 2018, 2018
https://www.scopus.com/record/display.uri?eid=2-s2.0-85062625299&origin=scopusAI
2. Systematic look at machine learning algorithms - Advantages, disadvantages and
practical applications
https://www.scopus.com/record/display.uri?eid=2-s2.0-85099729222&origin=scopusAI
Kamogashira, T.
https://www.scopus.com/record/display.uri?eid=2-s2.0-85142606568&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85208280813&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85100171972&origin=scopusAI
6. Industrial Organization in the Industry 4.0 Era: Leveraging Machine Learning for
Competitive Advantage
https://www.scopus.com/record/display.uri?eid=2-s2.0-85191247734&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85163538869&origin=scopusAI
8. Machine learning and role of artificial intelligence in optimizing work performance and
employee behavior
Ramachandran, K.K., Apsara Saleth Mary, A., Hawladar, S., (...), Pitroda, J.R.
https://www.scopus.com/record/display.uri?eid=2-s2.0-85127463614&origin=scopusAI
Handbook of Artificial Intelligence and Wearables: Applications and Case Studies, 2024
https://www.scopus.com/record/display.uri?eid=2-s2.0-85190912610&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85127402449&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85099260417&origin=scopusAI
12. Big data and machine learning meet the health sciences
Cavalcante Passos, I., Ballester, P., Vinícius Pinto, J., (...), Kapczinski, F.
1. Data-Related Challenges:
• Data Sensitivity and Specificity: Reusing ML applications across different domains can be
difficult due to the sensitivity and specificity of data 1.
• Data Quality and Availability: Ensuring high-quality data and dealing with limited data
availability are critical issues. Poor data quality can significantly impact the performance of
ML models 2 3.
• Class Imbalance: Imbalanced datasets, where certain classes are underrepresented, can
hinder the efficiency of ML algorithms, affecting overall accuracy and decision-making 4.
• Integration with Existing Systems: Incorporating ML models into existing software systems
can be complex and costly, as traditional software engineering practices may not directly
apply to ML 7 8.
3. Human Factors:
• Human-Centered AI: Bridging the gap between theoretical ML models and practical
applications requires considering human-centered AI principles. This includes ensuring
that ML systems are understandable and usable by non-experts 2 10.
4. Domain-Specific Constraints:
• Bias and Fairness: Designing ML systems that account for systemic biases and ensure
fairness is crucial, especially in human-facing applications like healthcare 3.
• Operational Challenges: Translating prediction algorithms into operational models is time-
consuming and involves addressing issues like data drift detection, infrastructure
standardization, and expectation management 11.
Category Challenges
Data-
Data sensitivity, data quality, data availability, class imbalance
Related
System
Scalability, integration with existing systems, automation
Engineering
Human
Human-centered AI, algorithm aversion
Factors
Domain-
Bias and fairness, operational challenges
Specific
Addressing these challenges requires a multidisciplinary approach, combining insights
from data science, software engineering, and domain expertise to develop robust and
effective ML solutions.
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85151945184&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85090177992&origin=scopusAI
Singh, H.
AIES 2022 - Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, 2022
https://www.scopus.com/record/display.uri?eid=2-s2.0-85137155055&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85105020930&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85079186867&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85019653397&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85164698905&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85116627638&origin=scopusAI
Wagstaff, K.L.
https://www.scopus.com/record/display.uri?eid=2-s2.0-84945932793&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85087106418&origin=scopusAI
High-quality data is essential for the performance and reliability of machine learning (ML)
models. Here are several reasons why:
• Data Quality and Model Accuracy: The performance of ML models is directly tied to the
quality of the data used for training. High-quality data ensures that the models can learn
accurate patterns and make reliable predictions. Poor data quality, such as missing values
or mislabeled data, can lead to inaccurate analytics and unreliable decisions 1 2 3 4.
• Impact of Errors: Errors in datasets, such as mislabeled data or missing values, can
significantly impair model performance. For instance, in intrusion detection systems, errors
like mislabeled traffic can reduce the model's ability to generalize and accurately classify
unseen data 3.
• Handling Missing Data: Effective imputation methods for handling missing data are
necessary to maintain the integrity of the dataset and ensure robust model performance.
Various imputation techniques can help mitigate the impact of missing data on
downstream ML tasks 2.
• Data Quality Evaluation: Evaluating and ensuring data quality can streamline the ML
pipeline, reducing the need for iterative debugging and improving overall efficiency. This
involves assessing data across various metrics and addressing quality gaps 1 4 6.
• Data Markets and Acquisition: High-quality data is often costly and time-consuming to
obtain. Data markets can help identify and acquire valuable training data, ensuring that
models are trained on the best possible datasets 7.
• Bias Mitigation: High-quality data helps in reducing biases that can arise from poor data
collection practices. Ensuring data accuracy, completeness, and relevance is crucial to
avoid unfair or unethical outcomes in model predictions 5 8.
• Model Interpretability: The quality of data affects the transparency and interpretability of
ML models. High-quality data allows for clearer insights into the decision-making processes
of models, which is important for gaining trust and understanding in various applications 5.
Summary Table
Bias and
Mitigates biases, ensures ethical outcomes 5 8
Fairness
Interpretability Enhances transparency and understanding of model decisions 5
In conclusion, high-quality data is fundamental to the success of machine learning models,
impacting their accuracy, generalization, efficiency, fairness, and interpretability. Ensuring
data quality through rigorous evaluation and addressing quality gaps is crucial for
developing robust and reliable ML applications.
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85161509739&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85111070078&origin=scopusAI
2024 IEEE Conference on Communications and Network Security, CNS 2024, 2024
https://www.scopus.com/record/display.uri?eid=2-s2.0-85210569442&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85113837999&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85090410666&origin=scopusAI
7. Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using
an Adaptive Sampling Algorithm
https://www.scopus.com/record/display.uri?eid=2-s2.0-85174394551&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85134432301&origin=scopusAI
2023 IEEE Global Conference on Artificial Intelligence and Internet of Things, GCAIoT 2023,
2023
https://www.scopus.com/record/display.uri?eid=2-s2.0-85184664087&origin=scopusAI
D. Case Studies
What are some real-world examples of companies successfully using
machine learning for demand forecasting?
Real-World Examples of Companies Using Machine Learning for Demand Forecasting
• Multinational Retail Chain: A multinational retail chain in the United States utilized
machine learning models such as XGBoost, random forest, and gradient boosting to
forecast weekly sales across 45 stores over three years. The XGBoost model outperformed
the others, providing the most accurate predictions 1.
• Food Industry: In the food industry, a company used a combination of Empirical Mode
Decomposition (EMD) and Support Vector Machines (SVM) optimized with Particle Swarm
Optimization (PSO) to forecast non-stationary demand. This approach showed superior
performance compared to traditional methods 3.
• Poultry Sector in Peru: A poultry company in Peru implemented machine learning for
demand forecasting, achieving a high forecast accuracy of 97.56% and a low forecast bias
of 2.44% 5.
• Dyeing Company: A dyeing company used machine learning tools such as random forests,
support vector regression, and artificial neural networks to create demand models for the
next five years. These models significantly contributed to the company's strategic decision-
making 6.
• Austrian B2B Hygiene Product Retailer: An Austrian B2B hygiene product retailer used
various machine learning methods, including demand forecasting, to improve business
intelligence and enhance customer acquisition and retention. The implementation of
dashboards integrating these methods enabled better decision-making 9.
These examples illustrate the diverse applications and significant benefits of machine
learning in demand forecasting across various sectors, leading to improved accuracy,
operational efficiency, and strategic planning.
References
https://www.scopus.com/record/display.uri?eid=2-s2.0-85144189469&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85111639899&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85028764401&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85177847499&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85149152578&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85151135827&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85159788263&origin=scopusAI
https://www.scopus.com/record/display.uri?eid=2-s2.0-85195992706&origin=scopusAI
9. From Data to Decisions: Optimizing Supply Chain Management with Machine Learning-
Infused Dashboards
https://www.scopus.com/record/display.uri?eid=2-s2.0-85195365568&origin=scopusAI
E. Conclusion
What are the key takeaways from this discussion?
1. Critical Role of Demand Forecasting: Accurate demand forecasting is crucial for
optimizing supply chain management, production planning, and inventory control across
industries like retail, automotive, food, and manufacturing.
3. Challenges: Data quality, integration with existing systems, scalability, and human factors
like algorithm aversion remain significant obstacles.
4. Industry Applications: ML has been successfully applied in diverse sectors, such as retail,
FMCG, automotive, and even poultry, showcasing its versatility and effectiveness.
o Employ data augmentation or synthetic data generation for scenarios with limited
data.
o Select models based on the data type, volume, and complexity. For example, LSTM
is ideal for time series, while Random Forests excel with categorical variables.
o Ensure the chosen solution can scale with growing data volumes and integrate
seamlessly with existing business workflows.
o Deploy ML models that continuously learn and adapt from new data to maintain
relevance in dynamic environments.
What does the future hold for machine learning in demand forecasting?
1. Increased Adoption of Advanced Algorithms:
o Deep learning models like transformers may become standard for analyzing
complex datasets and producing more accurate forecasts.
o IoT devices and edge computing will enable real-time data collection and
processing, enhancing the precision and timeliness of forecasts.
o Blockchain can add transparency and trust in supply chains, while ML optimizes
forecasting based on this data.
o AutoML tools will democratize the use of ML, making it accessible to businesses
with limited data science expertise.