0% found this document useful (0 votes)

40 views7 pages

Data Analyst

The Data Analyst program at Udacity teaches students to analyze data using Python libraries like NumPy and pandas, focusing on data wrangling and visualization skills. The curriculum includes hands-on projects that cover the data analysis process, data cleaning, and effective communication of findings through visualizations. Prerequisites include basic Python knowledge and descriptive statistics, with the program designed for intermediate learners over an estimated duration of 2 months.

Uploaded by

Pawandeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views7 pages

Data Analyst

Uploaded by

Pawandeep Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

School of Data Science

Data Analyst
Syllabus

udacity.com
Data Analyst

BEFORE YOU START

Overview:
Learn how to analyze data using in-demand Python libraries
like NumPy and pandas. Students will start by going over the
basics of the data analysis process, then dive into advanced
data wrangling skills to work with messy, complex real-world
datasets. Finally, you will create highly customized
visualizations using the Matplotlib Python library.

Educational Objectives Prerequisites

This program prepares you for a career as a data analyst A well-prepared learner 
by helping you learn to organize data, uncover patterns has experience with:
and insights, draw meaningful conclusions, and clearly Basic Python

communicate critical ndings. ou ll develop pro ciency

fi Y ’ fi

in Python and its data analysis libraries NumPy, pandas, (

Descriptive Statistics

Matplotlib as you build a portfolio of pro ects to

) j
Machine Learning Fluency
showcase in your ob search.
j

Length of Program*: Skill level: School:

2 months
Intermediate
School of Data Science

Soft are ar
w /H dw are an ver ion re irement :
d s qu s

For this anode ree ro ram ou will need access to the Internet
N g p g , y .

Additional software such as P thon and its common data anal sis libraries e. . andas and at lotlib will be
y y ( g, p M p )

required but the ro ram includes Udacit or s aces with all of the relevant ac a es installed so students
, p g y W k p p k g ,

will not need to download an additional software.

*The length of this program is an estimation of total hours the average student may take to complete all required coursework,
including lecture and project time. If you spend about 5-10 hours per week working through the program, you should finish within
the time provided. Actual hours may vary.

udacity.com
Data A n a l y s t

Course #1: 
Introduction to Data Analysis 
with Pandas and NumPy
PROJECT #1

Investigate a Dataset

In this project, you will analyze a dataset and then communicate

your findings about it. This includes asking questions, exploring
the dataset, performing basic data wrangling, drawing
conclusions, and presenting your findings with numbers and
visualizations. Your analysis will be performed in a Jupyter
Notebook using the NumPy and pandas Python libraries.

Exploring and Inspecting Data

Supporting Lesson Content
Form and ask questions about data

Define data wrangling and EDA

Gather data

The Data Analysis Process

Describe the types of problems that Data Analysts can solve
Read CSV files with pandas

Use pandas to inspect and assess data

Describe the five steps in the data analysis process:
Question, Wrangle, Explore, Draw Conclusions, and
Communicate
M anipulating Data U sing Pandas and Nu m Py

Describe three important Python packages for data analysis: Use pandas to perform simple data cleaning tasks

NumPy, pandas, and Matplotlib Use the pandas query function to filter data

Fix column data types using pandas

Use pandas concatenate and merge to combine data

Jupyter Notebooks
Use pandas explode to expand data
Explain that Jupyter Notebooks can combine explanatory
text, math equations, code, and visualizations

C o mm unicating R esults
Create a new Jupyter Notebook

Use pandas to summarize a dataset

Use code and Markdown cells in a Jupyter Notebook

Use pandas plotting to create simple visualizations

Use keyboard shortcuts in a Jupyter Notebook

Draw conclusions from data using descriptive statistics and

Use magic keywords in a Jupyter Notebook
visualizations

Convert notebooks to other formats Use visuals to communicate results

udacity.com
Data A n a l y s t

Course #2: 
Advanced Data Wrangling
PROJECT #2

Wrangle and Analyze Data

Real-world data rarely comes clean. Using Python and its
libraries, you will gather data from a variety of sources and in a
variety of formats, assess its quality and structure, then clean it.
This is called data wrangling. You will document your wrangling
efforts in a Jupyter Notebook, plus showcase them through
analyses and visualizations using Python (and its libraries).

Supporting Lesson Content Assessing Data

Describe the assessing phase

Distinguish between dirty data (content or “quality” issues) 

Introduction to Data Wrangling and messy data (structural or “tidiness” issues)

Identify each step of the data wrangling process (gathering, Identify data quality issues and categorize them

assessing, and cleaning)

Assess data quality visually

Explain why data wrangling is important

Assess data quality programmatically using pandas

Strategize about data structuring needed for analytical datasets

Gathering Data Assess data structure visually

Describe the gathering phase

Assess data structure using pandas
Unzip file archives using Python

Extract gathered tabular data from flat files using pandas

Cleaning Data
Gather data by programmatically downloading files
Describe the cleaning phase

Extract data from text files using Python

Identify each step of the data cleaning process (defining,
Gather data by accessing APIs
coding, and testing)

Extract gathered data from JSON files

Define data cleaning tasks based on assessment findings

Gather and extract data from HTML files using BeautifulSoup

Clean data using Python

Extract data from a SQL database

Test cleaning code visually

Identify additional file formats that data analysts might Test cleaning programmatically using Python

encounter Store cleaned data using flat files

udacity.com
Data A nalyst

Course #3: Data Visualization 

with Matplotlib and Seaborn PROJECT #3

Communicate Data Findings

In this course you will learn how to: In Part I, Exploratory data visualization, you will use Python
Implement a broad variety of visualizations to visualization libraries to systematically explore your selected
communicate key metrics and features of a dataset using dataset, starting with plots of single variables and building up
exploratory analysis.
to plots of multiple variables.

Apply appropriate plots, limits, transformations, and In Part II, Explanatory data visualization, you will produce a
aesthetics for exploratory analysis of a dataset, to short presentation that illustrates interesting properties,
understand variable distributions and features.
trends, and relationships that you discovered in your selected
dataset. The primary method of conveying your findings will
Utilize encodings and design principles to effectively be through transforming your exploratory visualizations from
respond to business questions using explanatory the first part into polished, explanatory visualizations.

analysis.

Univariate Exploration of Data

Supporting Lesson Content Use bar charts to depict distributions of categorical
variables.

Use histograms to depict distributions of numeric

Data Visualization in Data Analysis variables.

Understand why visualization is important in the practice Use axis limits and different scales to change how your
of data analysis.
data is interpreted.
Know what distinguishes exploratory analysis from
Explanatory analysis, and the role of data visualization in Multivariate Exploration of Data
each. Use encodings like size, shape, and color to encode values of
the third variable in a visualization.

Design of Visualizations Explore multiple relationships between multiple variables at

Interpret features in terms of the level of measurement.
the same time.

Know different encodings that can be used to depict data in Use feature engineering to capture relationships between
visualizations.
variables.
Understand various pitfalls that can affect the effectiveness
and truthfulness of visualizations. Explanatory Visualizations
Bivariate Exploration of Data
Understand what it means to tell a compelling story with
data.

Use scatterplots to depict relationships between numeric Choose the best plot type, encodings, and annotations to
variables.
polish your plots.

Use violin and box charts to depict relationships between Create high-quality image files using a Jupyter Notebook to
categorical and numeric variables.
convey your findings.
Use clustered bar charts to depict relationships between
categorical variables
Visualization Case Study
Use faceting to create plots across different subsets of  Apply your knowledge of data visualization to a dataset
the data involving the characteristics of diamonds and their prices.

udacity.com
Data Analyst

Course #1 Instructor
Matt Maybeno
Principal Software Engineer

Matt is a Principal Software Engineer at SOCi. With a masters in Bioinformatics

from SDSU, he utilizes his cross domain expertise to build solutions in NLP and
predictive analytics.

Course #2 Instructor
Ria Cheruvu
Intel NEX AI Ethics Lead Architect
Ria is Intel NEX AI Ethics Lead Architect, leading trustworthy AI. She is an emerging
industry speaker and has a master’s in data science from Harvard University. Ria
previously served as a Teaching Fellow for Harvard's 2021 Data Science graduate
curriculum and Lead Instructor for Eduonix's ML Deployment course.

Course #2 Instructor
Josh Magee
Senior Data Scientist

Josh is a Senior Data Scientist at Local Logic, where he models commercial real
estate trends, acquisitions, and sustainable cities. He was formerly Assistant
Professor of Data Analytics at Stonehill College, and was a postdoctoral researcher
in nuclear physics at Lawrence Livermore National Laboratory.

udacity.com
Learn More at
www.udacity.com

udacity.com

Data Analyst: Nanodegree Program Syllabus
No ratings yet
Data Analyst: Nanodegree Program Syllabus
16 pages
Udacity Enterprise Syllabus Data Analyst nd002
No ratings yet
Udacity Enterprise Syllabus Data Analyst nd002
16 pages
Nd002 Syllabus 2018 June v9
No ratings yet
Nd002 Syllabus 2018 June v9
5 pages
Dand Syllabus v7 Terms 1
No ratings yet
Dand Syllabus v7 Terms 1
6 pages
Data Analyst: Syllabus
No ratings yet
Data Analyst: Syllabus
7 pages
Python Course Outline
No ratings yet
Python Course Outline
24 pages
Data Analyst Nanodegree Program - Syllabus
50% (2)
Data Analyst Nanodegree Program - Syllabus
7 pages
Data Analysis Using Python
No ratings yet
Data Analysis Using Python
17 pages
Python Data Analysis for Beginners
No ratings yet
Python Data Analysis for Beginners
7 pages
CS352 - Lab Syllabus
No ratings yet
CS352 - Lab Syllabus
2 pages
Python
No ratings yet
Python
170 pages
Data Analytics Curriculum
No ratings yet
Data Analytics Curriculum
8 pages
Data Analysis Using Python (1) NAVTTC
No ratings yet
Data Analysis Using Python (1) NAVTTC
17 pages
Data Analysis With Python - FreeCodeCamp
No ratings yet
Data Analysis With Python - FreeCodeCamp
26 pages
Data Analytics
No ratings yet
Data Analytics
6 pages
Data Analysis and Visualization LAB
No ratings yet
Data Analysis and Visualization LAB
2 pages
Data Analytics and Reporting - Notes Unit 1 and 2
No ratings yet
Data Analytics and Reporting - Notes Unit 1 and 2
11 pages
Python Data Analysis for Beginners
No ratings yet
Python Data Analysis for Beginners
28 pages
Data Analysis With Python - FreeCodeCamp
No ratings yet
Data Analysis With Python - FreeCodeCamp
28 pages
Data Analysis With Python: Full Tutorial For Beginners
No ratings yet
Data Analysis With Python: Full Tutorial For Beginners
26 pages
Data Analysis With Python - FreeCodeCamp
No ratings yet
Data Analysis With Python - FreeCodeCamp
26 pages
Python Ds
No ratings yet
Python Ds
22 pages
Data Analysis With Python - FreeCodeCamp
100% (1)
Data Analysis With Python - FreeCodeCamp
26 pages
Data Analytics in Python (Johar) SP2022
No ratings yet
Data Analytics in Python (Johar) SP2022
4 pages
Python For Data Analysts - Quick Summary
No ratings yet
Python For Data Analysts - Quick Summary
6 pages
Python for Aspiring Data Scientists
No ratings yet
Python for Aspiring Data Scientists
8 pages
Data Analysis With Python
100% (1)
Data Analysis With Python
26 pages
Data Analyst Compressed
No ratings yet
Data Analyst Compressed
51 pages
Applied Data Science with Python
No ratings yet
Applied Data Science with Python
17 pages
Data Analysis With Python & Pandas
100% (3)
Data Analysis With Python & Pandas
378 pages
OJT-Field Report - Research Project Format 2025
No ratings yet
OJT-Field Report - Research Project Format 2025
9 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
29 pages
DS Final
No ratings yet
DS Final
46 pages
22am901 Data Science Using Python Unit 2
No ratings yet
22am901 Data Science Using Python Unit 2
116 pages
Python and PowerBI Syllabus
No ratings yet
Python and PowerBI Syllabus
3 pages
Unit I-V
No ratings yet
Unit I-V
184 pages
The Data Science Process Course Slides Red
No ratings yet
The Data Science Process Course Slides Red
95 pages
20ad41e2 - Data Science
No ratings yet
20ad41e2 - Data Science
2 pages
Python Data Analytics Outline
No ratings yet
Python Data Analytics Outline
8 pages
Data Science Machine Learning Batch 01 Bluep
No ratings yet
Data Science Machine Learning Batch 01 Bluep
34 pages
Python Curriculam
No ratings yet
Python Curriculam
12 pages
Data Science With Machine Learning Level 1-5
No ratings yet
Data Science With Machine Learning Level 1-5
7 pages
Final 28 June NEP CSE SY Btech Structure Syllabus 15 16
No ratings yet
Final 28 June NEP CSE SY Btech Structure Syllabus 15 16
2 pages
Complete Roadmap To Learn Python For Data Analysis
No ratings yet
Complete Roadmap To Learn Python For Data Analysis
5 pages
Documentation Sample
No ratings yet
Documentation Sample
37 pages
Data Analysis For Beginners Book - 2
100% (1)
Data Analysis For Beginners Book - 2
27 pages
Python Foundations For Data Analysis
67% (3)
Python Foundations For Data Analysis
339 pages
Data Science Workshop - Day 1
No ratings yet
Data Science Workshop - Day 1
80 pages
Python For Data Science Syllabus
No ratings yet
Python For Data Science Syllabus
6 pages
Data Analytics Course Guide
No ratings yet
Data Analytics Course Guide
14 pages
Data Analysis Roadmap
No ratings yet
Data Analysis Roadmap
18 pages
Data Analytics Info
No ratings yet
Data Analytics Info
1 page
BUDT704: Data Processing and Analysis in Python
No ratings yet
BUDT704: Data Processing and Analysis in Python
9 pages
DAL EXT 1 and 2
No ratings yet
DAL EXT 1 and 2
125 pages
Algoritma Data Science School Syllabus
No ratings yet
Algoritma Data Science School Syllabus
23 pages
Data Science Analysis with Python
No ratings yet
Data Science Analysis with Python
10 pages
Data Analytics Advanced With Python, Numpy and
No ratings yet
Data Analytics Advanced With Python, Numpy and
6 pages
Invoice 4321201202
No ratings yet
Invoice 4321201202
1 page
Cause - Effect Essay 2A - Versi151122
No ratings yet
Cause - Effect Essay 2A - Versi151122
1 page
Groundfridge Safety Guide
No ratings yet
Groundfridge Safety Guide
1 page
15A Artery
No ratings yet
15A Artery
72 pages
SPI Print Optimizer
No ratings yet
SPI Print Optimizer
2 pages
Pharmacist Return To Work Course
100% (2)
Pharmacist Return To Work Course
8 pages
Programming Massively Parallel Processors 4th Edition Wen-Mei W. Hwu available all format
100% (1)
Programming Massively Parallel Processors 4th Edition Wen-Mei W. Hwu available all format
87 pages
Jpa Mineral Trader Pipeline Projects
No ratings yet
Jpa Mineral Trader Pipeline Projects
2 pages
Maria EdgeworthS An Essay On The Noble Science of Self-Justification
100% (3)
Maria EdgeworthS An Essay On The Noble Science of Self-Justification
14 pages
Handbook For Wooden Buildings en PDF
No ratings yet
Handbook For Wooden Buildings en PDF
196 pages
CASE STUDY. - Assignment#3
No ratings yet
CASE STUDY. - Assignment#3
2 pages
- தன்னார்வ அமைப்புகள் மற்றும் பொதுக்குறைகள் தீர்ப்ப... நுகர்வோர் பாதுகாப்பு நடவடிக்கைகள் - 1st - chapter
No ratings yet
- தன்னார்வ அமைப்புகள் மற்றும் பொதுக்குறைகள் தீர்ப்ப... நுகர்வோர் பாதுகாப்பு நடவடிக்கைகள் - 1st - chapter
13 pages
Disting EX User Manual 1.5
No ratings yet
Disting EX User Manual 1.5
84 pages
Fidp of Organization and Management
No ratings yet
Fidp of Organization and Management
6 pages
P.6 MTC Scheme Term - One
No ratings yet
P.6 MTC Scheme Term - One
12 pages
Listening and Reading Skills Test
No ratings yet
Listening and Reading Skills Test
3 pages
Mind Your Thoughts British English Student Ver2
No ratings yet
Mind Your Thoughts British English Student Ver2
5 pages
Optimization An Important Stage of Engineering Design
No ratings yet
Optimization An Important Stage of Engineering Design
8 pages
MP100 Mp110e
No ratings yet
MP100 Mp110e
2 pages
Modell & Allied Publications - Bellona Military Vehicle Prints 03 - M7B1 150mm Howitzer Motor Carriage
100% (2)
Modell & Allied Publications - Bellona Military Vehicle Prints 03 - M7B1 150mm Howitzer Motor Carriage
12 pages
2 Public Revenue
No ratings yet
2 Public Revenue
20 pages
Advanced Linear Algebra Guide
100% (1)
Advanced Linear Algebra Guide
270 pages
Adaptive Cruise Control Module
No ratings yet
Adaptive Cruise Control Module
13 pages
Pizzanut Enterprises - For Class Harish
No ratings yet
Pizzanut Enterprises - For Class Harish
27 pages
Nonferrous Metals Standards
No ratings yet
Nonferrous Metals Standards
7 pages
Slimming World - May-June 2021
100% (1)
Slimming World - May-June 2021
118 pages
Vacuum Insulation Breakdown
No ratings yet
Vacuum Insulation Breakdown
14 pages
Intensive Care Medicine MCQs Multiple Choice Questions With Explanatory Answers
59% (17)
Intensive Care Medicine MCQs Multiple Choice Questions With Explanatory Answers
350 pages
Equine Lameness
No ratings yet
Equine Lameness
128 pages
CE134P ESCRUZ Syllabus
No ratings yet
CE134P ESCRUZ Syllabus
5 pages

Data Analyst

Uploaded by

Data Analyst

Uploaded by

School of Data Science

BEFORE YOU START

Educational Objectives Prerequisites

communicate critical ndings. ou ll develop pro ciency

in Python and its data analysis libraries NumPy, pandas, (

Matplotlib as you build a portfolio of pro ects to

Length of Program*: Skill level: School:

will not need to download an additional software.

In this project, you will analyze a dataset and then communicate

Exploring and Inspecting Data

Define data wrangling and EDA

The Data Analysis Process

Use pandas to inspect and assess data

Fix column data types using pandas

Use pandas concatenate and merge to combine data

Use pandas to summarize a dataset

Use code and Markdown cells in a Jupyter Notebook

Use pandas plotting to create simple visualizations

Use keyboard shortcuts in a Jupyter Notebook

Draw conclusions from data using descriptive statistics and

Convert notebooks to other formats Use visuals to communicate results

Wrangle and Analyze Data

Supporting Lesson Content Assessing Data

Distinguish between dirty data (content or “quality” issues)

assessing, and cleaning)

Assess data quality visually

Explain why data wrangling is important

Strategize about data structuring needed for analytical datasets

Gathering Data Assess data structure visually

Describe the gathering phase

Extract gathered tabular data from flat files using pandas

Extract data from text files using Python

Extract gathered data from JSON files

Gather and extract data from HTML files using BeautifulSoup

Extract data from a SQL database

encounter Store cleaned data using flat files

Course #3: Data Visualization

Communicate Data Findings

Univariate Exploration of Data

Use histograms to depict distributions of numeric

Design of Visualizations Explore multiple relationships between multiple variables at

Matt is a Principal Software Engineer at SOCi. With a masters in Bioinformatics

You might also like

Distinguish between dirty data (content or “quality” issues) 

Course #3: Data Visualization