0% found this document useful (0 votes)
440 views20 pages

Power Consumption in India (2019-2020) : Data Science Project Report

This document is a data science project report submitted by P Anand Kumar Reddy for their INT217 course. The project analyzed power consumption in India from 2019-2020. It included an introduction on India's power system, the scope of the analysis, source of the dataset, ETL process used, and key analyses on state-wise maximum and minimum power consumption using pivot tables and charts. The results showed that Maharashtra had the highest consumption while Sikkim had the lowest.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
440 views20 pages

Power Consumption in India (2019-2020) : Data Science Project Report

This document is a data science project report submitted by P Anand Kumar Reddy for their INT217 course. The project analyzed power consumption in India from 2019-2020. It included an introduction on India's power system, the scope of the analysis, source of the dataset, ETL process used, and key analyses on state-wise maximum and minimum power consumption using pivot tables and charts. The results showed that Maharashtra had the highest consumption while Sikkim had the lowest.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 20

DATA SCIENCE PROJECT REPORT

(Project Semester August-December 2020)

Power Consumption in India (2019-2020)

Submitted by
P Anand Kumar Reddy
Registration No: 11807873

Computer Science and Engineering


Section: KM074
Course Code: INT217

Under the Guidance of


Ms. Komal Arora

Discipline of CSE/IT
Lovely School of Computer Science and Engineering
Lovely Professional University, Phagwara
CERTIFICATE

This is to certify that P Anand Kumar Reddy bearing Registration no.


11807873 has completed INT217 project titled, “Power Consumption in
India (2019-2020)” under my guidance and supervision. To the best of my
knowledge, the present work is the result of his/her original development,
effort and study.

Signature and Name of the Supervisor


Designation of the Supervisor
School of Computer Science and Engineering
Lovely Professional University
Phagwara, Punjab.

Date: 20-11-2020
DECLARATION

I, P Anand Kumar Reddy, student of Lovely Professional University under


CSE/IT Discipline at, Lovely Professional University, Punjab, hereby declare
that all the information furnished in this project report is based on my own
intensive work and is genuine.

Date: 20-11-2020 Signature


Registration No. 11807873 P Anand Kumar Reddy
ACKNOWLEDGEMENT

A project work is a combination of views, ideas, suggestions and contribution


of many people. Thus, one of the pleasant parts of writing the report is to
thank those who have contributed towards its fulfilment.

I consider it as great privilege to have esteemed Lecturer Ms. Komal Arora as


my project guide. I take this opportunity to express my sincere gratitude to
him through constant advice and constructive criticism nourished my interest
in the subject and provided a free and pleasant atmosphere to work against all
odd situations. I avail this opportunity to extend my heart full thanks and deep
respect to faculty member for their able guidance during this project.

My gratitude to all those, who responded to my questionnaire in a well-


defined manner and helped me acquiring knowledge.

I would like to communicate a deep sense of gratitude to all these people


without whom my project would not have been such a great learning
experience.

P Anand Kumar Reddy


KM074
Reg No. 11807873
Lovely Professional University
Table of Contents

Sr.No. Particulars
1. Introduction
2. Scope of the Analysis
3. Source of the Dataset
4. ETL Process
5. Analysis on Dataset
a. Introduction
b. Specific Requirements/Functions and Formulas
c. Analysis Results
6. List of Analysis with Results
7. References
8. Bibliography
Introduction

India is the world's third-largest producer and third-largest consumer of electricity. The na-
tional electric grid in India has an installed capacity of 370.106 GW as of 31 March 2020.
Renewable power plants, which also include large hydroelectric plants, constitute 35.86% of
India's total installed capacity. During the 2018-19 fiscal year, the gross electricity generated
by utilities in India was 1,372 TWh and the total electricity generation (utilities and non-util-
ities) in the country was 1,547 TWh. The gross electricity consumption in 2018-19 was 1,181
kWh per capita.
In 2015-16, electric energy consumption in agriculture was recorded as being the highest
(17.89%) worldwide. The per capita electricity consumption is low compared to most other
countries despite India having a low electricity tariff.

In light of the recent COVID-19 situation, when everyone has been under lockdown for the
months of April & May the impacts of the lockdown on economic activities have been faced
by every sector in a positive or a negative way.
With the electricity consumption being so crucial to the country, we came up with a plan to
study the impact on energy consumption state and region wise.

The dataset is exhaustive in its demonstration of energy consumption state wise.

 Dataset

Column Definition

States All States in India

Regions Regions in India

Latitude and Longitude The Coordinates of States in India

Dates All dates between 2019 and 2020

Usage Electricity usage by different states


Scope of the Analysis

Data is in the form of a time series for a period of 17 months beginning from 2nd Jan 2019
till 23rd May 2020. Rows are indexed with dates and columns represent states. Rows and
columns put together, each datapoint reflects the power consumed in Mega Units (MU) by
the given state (column) at the given date (row). The data is divided into multiple tables like
maximum power usage states and minimum usage states. Data shows that main electricity
usage by regions and location in maps by using latitude and longitude.

Power System Operation Corporation Limited (POSOCO) is a wholly-owned Government of


India enterprise under the Ministry of Power. It was earlier a wholly-owned subsidiary of
Power Grid Corporation of India Limited. It was formed in March 2009 to handle the power
management functions of PGCIL.

Since such vast field of data present of the Power Consumption in India there is wide range
of scope of the analysis of date. For example:

a) State wise Electricity Usage


b) Date wise Electricity Usage
c) Maximum and Minimum Power Usage States

Source of the Dataset

 The dataset is taken from the Kaggle with the name ‘Power Consumption in India’.

https://www.kaggle.com/twinkle0705/state-wise-power-consumption-in-india

 Author of the Dataset


Twinkle Khanna

 Data last Updated


June 2020
ETL Process

ETL stands for Extraction, Transformation and Loading. It is a process in data warehousing


to extract data, transform data and load data to final source. ETL covers a process of how the
data are loaded from the source system to the data warehouse. Let us briefly describe each
step of the ETL process.

 Extraction: Extraction is the first step of ETL process where data from different
sources like txt file, XML file, Excel file or various sources collected.
 Transformation: Transformation is the second step of ETL process where all col-
lected data is been transformed into same format i.e. format can be anything as per
our requirement before loading it to data-warehouse i.e. it may be data-type format,
data merge format, splitting format, alphabet joining format, currency format etc.
 Loading: Final step of ETL process, the big chunk of data which is collected from
various sources and transformed then finally load to our data warehouse.
Analysis of Dataset
1. Tableau
a. Introduction: In this tableau, I made some changes to the data and also some
cleaning like change date and removing time.
b. Specific Requirements/Functions and Formulas:
 Tableau Prep
2. Dashboard
a. Introduction: It contains all pivot tables, slicers, and hyperlinks. Here we can see
the whole data and how it performs when we click the slicers.

3. Diagram View
a. Introduction: Diagram View to connect all the tables in excel sheet and I almost
connected 4 tables and also extra 2 tables because some of the tables are not con-
necting.
4. Maximum Power Consume States:
a. Introduction: The analysis shows the top 10 maximum power consumption states
in India.
b. Specific Requirements/Functions and Formulas:
 Pivot table of States
 Pivot table of Usage
 Stacked Bar Chart
c. Analysis Results:
 Maharashtra state consume 12.7% power. It is the highest Electricity Con-
sumption State.

Maximum Power Consume States


Maharashtra

Gujarat

UP

Tamil Nadu

NR
Rajasthan SR
WR

MP

Karnataka

Telangana

Andhra Pradesh

Punjab

0 50000 100000 150000 200000 250000


5. Minimum Power Consume States:
a. Introduction: The analysis shows the top 10 minimum power consumption states
in India.
b. Specific Requirements/Functions and Formulas:
 Pivot table of states
 Pivot table of Usage
 Stacked Bar Chart
c. Analysis Results:
 Sikkim state consume 0.0379% power. It is lowest Electricity
Consumption State.

 In this pandemic time over all Regions Electricity Consumption is decreas-


ing comparatively last year.

Minimum Power Consume States


6000

5000

4000

WR
3000 SR
NR
NER
ER
2000

1000

0
a
nd
y
ay
a
ar
h
ur
a ur d
es
h m im
Go al ig ip ip l an ra kk
Po nd r a n ga ad i zo Si
gh T r
e
Ch
a M Na lP M
M ha
ac
un
Ar
6. State wise Consumption:
a. Introduction: The analysis shows the power usage comparison between states in
India.
b. Specific Requirements/Functions and Formulas:
 Pivot table of years
 Pivot table of states
 Line with Markers Chart
c. Analysis Results:
 In June month usage of power is started decreasing again July month star-
ted increasing power usage

 October and November are usually cloudless so that time the usage of
Electricity is low.
 Again, July middle started increasing the power consumption

180000
State-wise Electricity Consumption
160000

140000

120000

100000

2019
80000 2020

60000

40000

20000

0
es
h
am g ar
h l hi a a K
ak
a tra ay
a P a b im na UP ga
l
ss De Go an J&
at sh al
M ish unja kk ga en
rad A di a ry n a h Od P S i n
P an H
Ka
r ar eg la tB
hra Ch ah M Te es
d M W
An
7. Region wise Consumption
a. Introduction: The analysis shows the relation between Regions and Sum of Power
Usage in India.
b. Specific Requirements/Functions and Formulas:
 Pivot table of Regions
 Pivot table of Sum of Usage
 3-D Pie Chart
c. Analysis Results:
 In North region UP consume maximum electricity, Chandigarh Consume
low electricity in north region.
 In West Region Maharashtra State consume maximum electricity
 In South region Tamil Nadu state consume maximum power and Pondy
consume minimum electricity.
 In NER Assam Consume maximum electricity

Region wise Consumption

9.63% 1.27
31.61% %

ER
NER
29.61% NR
SR
WR

27.88%
8. Month wise Consumption:
a. Introduction: The analysis shows the relation between Months and Sum of Power
Usage in different years in India.
b. Specific Requirements/Functions and Formulas:
 Pivot table on Months
 Pivot table on Usage in different years
 Doughnut Chart
c. Analysis Results:
 In June month usage of power is started decreasing again July month star-
ted increasing power usage
 October and November are usually cloudless so that time the usage of
Electricity is low.
 Again, July middle started increasing the power consumption

Month wise Consumption

Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
9. 3D Map
a. Introduction: The analysis shows the relation between Latitude and Longitude in
India.
b. Specific Requirements/Functions and Formulas:
 3D Map
 Longitude Data
 Latitude Data
10. Hyperlinks and Slicers
a. Introduction: Hyperlinks helps to redirect to the respective excel sheet and we
have to select to which sheet it has to redirect.
 Slicers helps to change the data in pivot tables like selecting data according to
our choice. In my data slicers are Year, Regions.

Hyperlinks

Slicers
List of Analysis with Results

 Maharashtra state consume 12.7% power. It is the highest Electricity Consumption


State.
 Sikkim state consume 0.0379% power. It is lowest Electricity Consumption State.
 Over all North Region consumes high Electricity 27.3%, North east region Electricity
Consumption is 21.2 %, South Region and West Region consume almost same
Electricity 18.2%, East Region consume minimum Electricity 15.2%.

 In this pandemic time over all Regions Electricity Consumption is decreasing compar-
atively last year.
 In all Regions, when the summer is starting that time the usage of Electricity is started
increasing middle of February the power usage started increasing.
 In June month usage of power is started decreasing again July month started increas-
ing power usage
 October and November are usually cloudless so that time the usage of Electricity is
low.

 In North region UP consume maximum electricity, Chandigarh Consume low electri-


city in north region.
 In West Region Maharashtra State consume maximum electricity.
 Again, July middle started increasing the power consumption
 In South region Tamil Nadu state consume maximum power and Pondy consume
minimum electricity.
 In NER Assam Consume maximum electricity.
 Other state consumes very low electricity power (within 200 vol only)


References

1. www.kaggle.com
2. www.stackoverflow.com
3. www.google.com
4. www.youtube.com
5. www.github.com

Bibliography

1. Microsoft Excel 2016 Bible: The Comprehensive Tutorial Resource by John


Walkenbach, Wiley.
2. Fundamentals of Business Analytics by R.N. Prasad, Seema Acharya, Wiley.

You might also like