18ECP101L-MASSIVE OPEN ONLINE COURSE-I
SEMESTER V
YEAR: NOV 2022
SQL FOR DATA SCIENCE
Report Submitted by
KATAKAM SAIKUMAR [RA2011053010059]
Faculty in-charge
Mrs S Hannah Pauline
DEPARTMENT OF ELECTRONICS AND COMMUNICATION ENGINEERING
FACULTY OF ENGINEERING AND TECHNOLOGY
SRM INSTITUTE OF SCIENCE AND TECHNOLOGY
S.R.M. Nagar, Kattankulathur - 603203, Kancheepuram District
TABLE OF CONTENTS
S.NO TITLE PAGE NO.
1. COURSE DETAILS 2
2. INSTRUCTOR PROFILE 3
3. INTRODUCTION OF THE COURSE 4
4. COURSE TIMELINE
5. SYLLABUS
6. WEEK-1 CONTENTS
7. WEEK-2 CONTENTS
8. WEEK-3 CONTENTS
9. WEEK-4 CONTENTS
10. COURSE GRADE DETAILS
11. COURSE OUTCOME
12. PROOF OF COURSE COMPLETION
THE END
1. COURSE DETAILS
COURSE PLATFORM- COURSERA
COURSE TITLE- SQL FOR DATA SCIENCE
OFFERED BY- University of California, Davis
COURSE DURATION- 4 weeks
DASHBOARD:
This Specialization is intended for a learner with no previous coding
experience seeking to develop SQL query fluency. Through four
progressively more difficult SQL projects with data science
applications, you will cover topics such as SQL basics, data
wrangling, SQL analysis, AB testing, distributed computing using
Apache Spark, Delta Lake and more. These topics will prepare you
to apply SQL creatively to analyse and explore data; demonstrate
efficiency in writing queries; create data analysis datasets; conduct
feature engineering, use SQL with other data analysis and machine
learning toolsets; and use SQL with unstructured data sets.
2. INSTRUCTOR PROFILE
3. INTRODUCTION OF THE COURSE
As data collection has increased exponentially, so has the need for people skilled at using and
interacting with data; to be able to think critically, and provide insights to make better
decisions and optimize their businesses. This is a data scientist, “part mathematician, part
computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being
a data scientist is the best job in America; with a median base salary of $110,000 and
thousands of job openings at a time. The skills necessary to be a good data scientist include
being able to retrieve and work with data, and to do that you need to be well versed in SQL,
the standard language for communicating with database systems.
You will create new tables and be able to move data into them. You will learn
common operators and how to combine the data. You will use case statements and
concepts like data governance and profiling. You will discuss topics on data, and
practice using real-world programming assignments. You will interpret the
structure, meaning, and relationships in source data and use SQL as a professional
to shape your data for targeted analysis purposes.
4. COURSE TIMELINE
DATE OF ENROLLMENT: 24th August 2022
DATE OF COMPLETION: 2nd November 2022
5. SYLLABUS
selecting and retrieving data with SQL
filtering, sorting, and calculating data with
SQL
subqueries and joins in SQL
modifying and analysing data with SQL
6. WEEK-1 CONTENTS
selecting and retrieving data with SQL
In this module, you will be able to define SQL and discuss how
SQL differs from other computer languages. You will be able to
compare the roles of a database administrator and a data
scientist, and explain the differences between one-to-one, one-
to-many, and many-to-many relationships with databases. You
will be able to use the SELECT statement and talk about some
basic syntax rules. You will be able to add comments in your
code and synthesize its importance.
7. WEEK-2 CONTENTS
filtering, sorting, and calculating data with
SQL
In this module, you will be able to use several more new clauses
and operators including WHERE, BETWEEN, IN, OR, NOT,
LIKE, ORDER BY, and GROUP BY. You will be able to use the
wildcard function to search for more specific or parts of records,
including their advantages and disadvantages, and how best to
use them. You will be able to discuss how to use basic math
operators, as well as aggregate functions like AVERAGE,
COUNT, MAX, MIN, and others to begin analyzing our data.
8. WEEK-3 CONTENTS
subqueries and joins in SQL
In this module, you will be able to discuss subqueries,
including their advantages and disadvantages, and when
to use them. You will be able to recall the concept of a key
field and discuss how these help us link data together with
JOINs. You will be able to identify and define several
types of JOINs, including the Cartesian join, an inner join,
left and right joins, full outer joins, and a self join. You will
be able to use aliases and pre-qualifiers to make your
SQL code cleaner and efficient.
9. WEEK-4 CONTENTS
modifying and analyzing data with SQL
In this module, you will be able to discuss how to modify
strings by concatenating, trimming, changing the case, and
using the substring function. You will be able to discuss the
date and time strings specifically. You will be able to use
case statements and finish this module by discussing data
governance and profiling. You will also be able to apply
fundamental principles when using SQL for data science.
You'll be able to use tips and tricks to apply SQL in a data
science context.