0% found this document useful (0 votes)

123 views21 pages

Project 6 - Time Series PDF

- The document describes using an ARIMA model to forecast monthly gas production in Australia from 1956 to 1995. - The time series is found to be non-stationary and differencing is used to make it stationary. - Both manual and automatic ARIMA models are developed and their accuracy is evaluated on training and test data. The automatic ARIMA model is found to have better accuracy on the test set based on error metrics.

Uploaded by

Akshita Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views21 pages

Project 6 - Time Series PDF

Uploaded by

Akshita Raut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Project 6 – Time Series

Forecasting
Akshita Raut – PG BABI
Project Overview

• The given dataset is for monthly gas production in Australia from 1956 to 1995.
• The Forecast package which contains the data package also contains methods and tools for displaying and analysing
univariate time series forecasts including exponential smoothing via state space models and automatic ARIMA
modelling.

Project Objectives

• To read the data as time series object in R

• To explore components of Time Series present in the dataset
• To check if the time series is stationary
• To develop an ARIMA Model to forecast for a period of next 12 months
1. Overview of the dataset:
• The given data is a time series data (as required) with monthly frequency and hence, there is no need to convert it into any
format.
• From the plot 1, we can observe an upward trend in the gas production 1970 onwards.
• Start year = Jan 1995
• The season plot indicates increase in demand from May to highest in July and then again decline towards the end of the
year.

1. Examining the dataset:

• View(gas)
• summary(gas)
• head(gas)
• tsclean(gas)

• The above commands are used for

basic check and understand if any
outliers or imputing any missing values.
• To stabilize the data, we will use a
logarithm of the series.
•
1.1. Decomposition

2. Decomposition – Components
of Time Series
• Seasonal component –
shows the fluctuations in
data related to historical
data
• Trend component – the
overall pattern – increasing
or decreasing
• Cyclic component –
components that are not
seasonal.
1.1. Decomposition

2. Decomposition
• The plot shows us that
seasonality is constant.
• The trend is upwards
(increasing) from 1970 to 1990,
and after a short decline, the
trend continues upwards.
2. Components of Time Series:

• The yearly plot shows us that the production spiked up from 1970, took a small decline in 1990 and
is again trending upwards.
2. Components of Time Series:

• The month plot and boxplot of the dataset show us the variation that exists within months.
2. Components of Time Series:

• Let us apply log

transformation to stabilize
the variance.
3. Stationarity

• Augmented Dickey-Fuller Test

data: gas Dickey-Fuller = -2.7131, Lag order = 7, p-value = 0.2764
alternative hypothesis: stationary
• The ADF test is a formal test for
stationary.
• Visually, the given time series As the p-value is less than 0.5, we can conform the hypothesis, the
looks non-stationary. time series is non-stationary.
• The hypothesis to check if the
time series is stationary is as
follows:
• H0 (Null Hypothesis) – TS is not
stationary
• H1 (Alternate Hypothesis) –
Time Series is stationary
4. ARIMA Model

1. Autocorrelation
• The correlation is declining from
lag 1 to lag 5.
4. ARIMA Model

1. Autocorrelation
• The seasonal effect
can be seen in the ACF
plot.
• ACF plots are used to
understand the
correlation between a
series and its lags.
2. Differencing:

• Since the time series is

non-stationary, we can
use differencing to
make it stationary.

• Differencing normal
time series, shows
inconsistency, hence
we use log values of
time series.

• Differencing the time

series with a lag of 10,
can help remove trend
and seasonality both.
2. Differencing:

• The ADF test on differenced data does

not accept the null hypothesis of non-
stationary.

Augmented Dickey-Fuller Test

data: gas.diff
Dickey-Fuller = -18.14, Lag order = 7, p-value
= 0.01
alternative hypothesis: stationary
2. Differencing:

• Plotting the ACF & PACF for differenced values, gives us q = 0, p= 2 when d is considered to be 1.
3. ARIMA Model Selection: (Manual / Auto
ARIMA)

• Looking at ACF & PACF charts, we

can find out optimal p, q & d values
• In this case, we can select 5,6,8

AutoArima<-auto.arima(deseason, seasonal = FALSE)

> print(AutoArima)
Series: deseason
ARIMA(1,1,5) with drift

Coefficients:
ar1 ma1 ma2 ma3 ma4 ma5 drift
0.4747 -0.5575 0.1028 -0.2108 -0.0746 -0.1242
107.6904
s.e. 0.0922 0.0939 0.0624 0.0683 0.0650 0.0495
24.1201

sigma^2 estimated as 3966907: log likelihood=-4279.32

AIC=8574.64 AICc=8574.95 BIC=8607.95
The ACF & PACF plots
indicate repeated
residuals at lag 6, so
using a different
specification, p=6 or
q=6
4. Ljung box test
H0: Residuals are independent
Ha: Residuals are not independent

> Box.test(gasAR1$residuals)

Box-Pierce test

data: gasAR1$residuals
X-squared = 0.072517, df = 1, p-value = 0.7877

> Box.test(gasARfit$residuals)

Box-Pierce test

data: gasARfit$residuals
X-squared = 2.55, df = 1, p-value = 0.1103
5. Forecasting on Manual and Auto ARIMA Models for training data
• Forecasting on ARIMA Models
with seasonality
5. Accuracy Calculations
• > accuracy(acc, gasTest)
• ME RMSE MAE MPE MAPE MASE
• Training set 97.55989 3542.529 2660.401 -0.06479871 5.777713 0.809962
• Test set 4026.99119 6762.854 5870.357 6.79731137 11.089278 1.787236
• ACF1 Theil's U
• Training set -0.03173606 NA
• Test set 0.53466285 1.548875

• > a1<-forecast(gasARfit)
• > accuracy(a1, gasTest)
• ME RMSE MAE MPE MAPE MASE
• Training set -35.15758 2766.478 2171.046 -0.4554556 4.705954 0.6609773
• Test set 4699.81125 5586.601 5111.735 8.6922055 9.679309 1.5562732
• ACF1 Theil's U
• Training set 0.1881929 NA
• Test set 0.1598068 1.328607

Time Series Forecasting (Australian Gas) : Akalya KS
No ratings yet
Time Series Forecasting (Australian Gas) : Akalya KS
15 pages
Mini Project Based On Time Series Forecasting Methods: Data Used
No ratings yet
Mini Project Based On Time Series Forecasting Methods: Data Used
14 pages
Gas Prod
100% (3)
Gas Prod
24 pages
Project6 Time Series
No ratings yet
Project6 Time Series
14 pages
Gas Production Time Series Analysis
100% (19)
Gas Production Time Series Analysis
29 pages
End Term Project (BA)
No ratings yet
End Term Project (BA)
19 pages
TS Gas Report
No ratings yet
TS Gas Report
40 pages
Time Series Forcasting
No ratings yet
Time Series Forcasting
19 pages
Australian Gas Production: R Venkataraman
No ratings yet
Australian Gas Production: R Venkataraman
17 pages
TS Gas Report
No ratings yet
TS Gas Report
43 pages
Time Series
67% (3)
Time Series
34 pages
PROJECT - Time Series Forecasting by Akshay Kharote PDF
100% (2)
PROJECT - Time Series Forecasting by Akshay Kharote PDF
85 pages
FM - Resumes
No ratings yet
FM - Resumes
18 pages
Resumos Forecasting
No ratings yet
Resumos Forecasting
17 pages
Time Series Modeling: Shouvik Mani April 5, 2018
No ratings yet
Time Series Modeling: Shouvik Mani April 5, 2018
46 pages
Project Time Series Analysis
100% (2)
Project Time Series Analysis
26 pages
Time Series Analysis in R A Beginner's Guide
No ratings yet
Time Series Analysis in R A Beginner's Guide
13 pages
Expt. 12 Forecasting 214
No ratings yet
Expt. 12 Forecasting 214
12 pages
Time Series EDA for Data Analysts
No ratings yet
Time Series EDA for Data Analysts
20 pages
Time Series: International University - Vnu HCMC
No ratings yet
Time Series: International University - Vnu HCMC
35 pages
Time Series Analysis: Example: Stationary ARIMA
No ratings yet
Time Series Analysis: Example: Stationary ARIMA
25 pages
Time Series Forecasting Guide
No ratings yet
Time Series Forecasting Guide
6 pages
Notes On Programs Tramo and Seats: 11 March 2003
No ratings yet
Notes On Programs Tramo and Seats: 11 March 2003
95 pages
E Monika Sree 10-10-2024
No ratings yet
E Monika Sree 10-10-2024
60 pages
One Whose Properties Do Not Depend On The Time at Which The Series Is Observed
No ratings yet
One Whose Properties Do Not Depend On The Time at Which The Series Is Observed
12 pages
Forecasting Cheatsheet Final
No ratings yet
Forecasting Cheatsheet Final
4 pages
Demgn801 Business Analytics 76 150
No ratings yet
Demgn801 Business Analytics 76 150
75 pages
Activity 5 (Time Series) - Rudinas
No ratings yet
Activity 5 (Time Series) - Rudinas
7 pages
Time Series Forecasting
No ratings yet
Time Series Forecasting
29 pages
Timeseries - Analysis
No ratings yet
Timeseries - Analysis
37 pages
Gas Production
No ratings yet
Gas Production
29 pages
Arima
No ratings yet
Arima
12 pages
DSS16-Time Series
No ratings yet
DSS16-Time Series
65 pages
Econometric Methods
No ratings yet
Econometric Methods
7 pages
Notes On ARIMA: ND RD
No ratings yet
Notes On ARIMA: ND RD
4 pages
DSS13 Time Series
No ratings yet
DSS13 Time Series
65 pages
Arima Notes
No ratings yet
Arima Notes
4 pages
P - 338 - Oil Price Prediction Final
No ratings yet
P - 338 - Oil Price Prediction Final
40 pages
Estimation, Diagnosis, and Identification of Time Series Models
No ratings yet
Estimation, Diagnosis, and Identification of Time Series Models
15 pages
LAB MANUAL 135 Time Series - Knit
No ratings yet
LAB MANUAL 135 Time Series - Knit
16 pages
TimeSeries SARIMA
No ratings yet
TimeSeries SARIMA
15 pages
Predictive Analytics & Time Series
No ratings yet
Predictive Analytics & Time Series
54 pages
A Course in Time Series Analysis 1662068197
No ratings yet
A Course in Time Series Analysis 1662068197
300 pages
Rao (2022) - A Course in Time Series Analysis
No ratings yet
Rao (2022) - A Course in Time Series Analysis
527 pages
Sarima Group 11
No ratings yet
Sarima Group 11
21 pages
Arima Word
No ratings yet
Arima Word
13 pages
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
100% (1)
Box-Jenkins Method: Time Series Analysis: Forecasting and Control
4 pages
Group 10 TS Assignment
0% (1)
Group 10 TS Assignment
21 pages
cheatsheet的副本
No ratings yet
cheatsheet的副本
8 pages
C TSAF Box Jenkins - Method
No ratings yet
C TSAF Box Jenkins - Method
83 pages
Lec 08 - ARIMA Models
No ratings yet
Lec 08 - ARIMA Models
35 pages
LAB9 Report
No ratings yet
LAB9 Report
6 pages
RDataMining Slides Time Series Analysis PDF
No ratings yet
RDataMining Slides Time Series Analysis PDF
41 pages
Time Series and Forecasting Econometrics Assignment: Name: Student
No ratings yet
Time Series and Forecasting Econometrics Assignment: Name: Student
14 pages
Time Series Forecasting Guide
No ratings yet
Time Series Forecasting Guide
12 pages
Notes On Time Series Analysis
No ratings yet
Notes On Time Series Analysis
111 pages
Chapter 12 Part 2 - Arima Model Estimation - 2023
No ratings yet
Chapter 12 Part 2 - Arima Model Estimation - 2023
15 pages
Bill Sendewicz TSA Project
No ratings yet
Bill Sendewicz TSA Project
49 pages
Effects of Penn Resiliency Programme Among Students With Emotional Reactions PDF
No ratings yet
Effects of Penn Resiliency Programme Among Students With Emotional Reactions PDF
5 pages
COMP2157 Module 1 LP Fundamentals and Applications Assignment 1 Basic
No ratings yet
COMP2157 Module 1 LP Fundamentals and Applications Assignment 1 Basic
7 pages
Seismic Methods in Mineral Exploration and Mine Planning - Introduction
No ratings yet
Seismic Methods in Mineral Exploration and Mine Planning - Introduction
2 pages
The Roles of Customer Perception of Innovativeness and Engagement On Loyalty Through Value Co-Creation Behaviors: The Case of Food-Delivery Service
No ratings yet
The Roles of Customer Perception of Innovativeness and Engagement On Loyalty Through Value Co-Creation Behaviors: The Case of Food-Delivery Service
16 pages
Item Analysis
No ratings yet
Item Analysis
47 pages
The CS Detective An Algorithmic Tale of Crime Conspiracy and Computation 1st Edition Jeremy Kubica PDF Download
100% (3)
The CS Detective An Algorithmic Tale of Crime Conspiracy and Computation 1st Edition Jeremy Kubica PDF Download
63 pages
Department of Food & Nutritional Sciences Sri Satya Sai Institute of Higher Learning Anantapur Campus
No ratings yet
Department of Food & Nutritional Sciences Sri Satya Sai Institute of Higher Learning Anantapur Campus
31 pages
Topic 3-SPSS and STATA
100% (1)
Topic 3-SPSS and STATA
73 pages
Non Inferiority & Equivalence Testing
No ratings yet
Non Inferiority & Equivalence Testing
5 pages
Geotechnical Services Proposal
100% (1)
Geotechnical Services Proposal
13 pages
Narrative Report
No ratings yet
Narrative Report
4 pages
Scientific Method Review-1
No ratings yet
Scientific Method Review-1
2 pages
Difference Between Thesis and Final Year Project
100% (3)
Difference Between Thesis and Final Year Project
7 pages
Impact of Packaging on Dairy Buying
No ratings yet
Impact of Packaging on Dairy Buying
8 pages
Evolution of Endpoint Detection and Response EDR I
No ratings yet
Evolution of Endpoint Detection and Response EDR I
7 pages
A Quick Guide To EU Funding: Grants & Incentives
No ratings yet
A Quick Guide To EU Funding: Grants & Incentives
48 pages
IASSC Six Sigma Black Belt Body of Knowledge
No ratings yet
IASSC Six Sigma Black Belt Body of Knowledge
9 pages
Predicting PM2.5 Concentrations Using Stacking-Based Ensemble Model
No ratings yet
Predicting PM2.5 Concentrations Using Stacking-Based Ensemble Model
7 pages
G137-PEE PetroleumEngineering ProgHandbook 201314 - IDL (Final)
No ratings yet
G137-PEE PetroleumEngineering ProgHandbook 201314 - IDL (Final)
63 pages
Functional Form and Dynamic Models
No ratings yet
Functional Form and Dynamic Models
27 pages
Sec 3
No ratings yet
Sec 3
6 pages
Paper - Advanced Bioinformatics Methods For Practical Applications in Proteomics
No ratings yet
Paper - Advanced Bioinformatics Methods For Practical Applications in Proteomics
17 pages
A Literature Review of The Impact of Micro Finance
No ratings yet
A Literature Review of The Impact of Micro Finance
4 pages
Alemu Feyissa SRM Assignment Answer For (1, 2, 3 & 4)
No ratings yet
Alemu Feyissa SRM Assignment Answer For (1, 2, 3 & 4)
31 pages
Olweus Questionnaire
No ratings yet
Olweus Questionnaire
72 pages
Interview Questions
No ratings yet
Interview Questions
30 pages
Gender and Ethnicity Differences in Tax Compliance: Jeyapalan Kasipillai and Hijattulah Abdul Jabbar
No ratings yet
Gender and Ethnicity Differences in Tax Compliance: Jeyapalan Kasipillai and Hijattulah Abdul Jabbar
16 pages
Provincial Planning and Development Office Compressed
No ratings yet
Provincial Planning and Development Office Compressed
14 pages
Questionnaire On Consumer Buying Behaviour of Cosmetics PDF
No ratings yet
Questionnaire On Consumer Buying Behaviour of Cosmetics PDF
15 pages
Pharmacist Role in Infection Control
No ratings yet
Pharmacist Role in Infection Control
3 pages

Project 6 - Time Series PDF

Uploaded by

Project 6 - Time Series PDF

Uploaded by

Project 6 – Time Series

• To read the data as time series object in R

1. Examining the dataset:

• The above commands are used for

• Let us apply log

• Augmented Dickey-Fuller Test

• Since the time series is

• Differencing the time

• The ADF test on differenced data does

Augmented Dickey-Fuller Test

• Looking at ACF & PACF charts, we

AutoArima<-auto.arima(deseason, seasonal = FALSE)

sigma^2 estimated as 3966907: log likelihood=-4279.32

You might also like