0% found this document useful (0 votes)

312 views5 pages

Anaconda Training: Data Science Foundations

This 4-day course teaches students how to use Anaconda Enterprise and related Python tools for data science. Over the course, students will learn the core Python libraries for data processing, analysis, and machine learning. They will learn how to access tabular and database data, use Pandas for data exploration and wrangling, perform statistical analysis and modeling, and apply machine learning algorithms with Scikit-Learn. The course is aimed at all experience levels and includes hands-on exercises using tools like Jupyter Notebook.

Uploaded by

Faisal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

312 views5 pages

Anaconda Training: Data Science Foundations

Uploaded by

Faisal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Anaconda Training

Data Science Foundations

At the conclusion of this 4-day course you will have a solid understanding of how Anaconda

Enterprise and the Python ecosystem work together to help you perform quantitative and

qualitative analyses. This course covers the core libraries for data processing and analysis,

statistical computation, and an overview of machine learning. You’ll learn how to access tabular

data stored in various file formats along with data stored in relational databases and distributed

storage systems.

Preparation
Students will connect to an Anaconda Enterprise instance maintained by the Anaconda Training
department.

This course is meant for all levels of Python and Data Science backgrounds. However, these
DataCamp courses are very useful to provide background preparation.

• Intro to Python for Data Science

• Intermediate Python for Data Science

Curriculum
See following pages for a detailed outline of each section.
• Getting Started with Anaconda Enterprise (1 day)

• Essential Pandas (1 day)

• Access Big Data with Anaconda Enterprise (1/2 day)

• Statistical Modeling and Analysis (1/2 day)

• Machine Learning with Scikit-Learn (1 day)

About Anaconda, Inc.

With more than 13 million users, Anaconda is the world’s most popular data science platform and the foundation of modern machine
learning. Anaconda Enterprise delivers data science and machine learning at speed and scale, unleashing the full potential of our
customers’ data science and machine learning initiatives. 

206 E 9th Street, Floor 18, Austin, TX 78701

anaconda.com
Day 1
Getting Started with Anaconda Enterprise
Duration: 1 day

Anaconda Enterprise Platform

• Log into Anaconda Enterprise

Overview of Projects, Deployments, and Channels
• Working with projects
Install packages and environments
Commit and share projects
• Working in Sessions
Jupyter Notebook
JupyterLab
Zeppelin

Review Core Python Syntax

• Python concepts & constructs

Python object model & modules
Flow control
Functions
• Data structures
Methods/attributes for common data structures
Idioms for slicing, indexing, iteration, and comprehension
Files & file I/O: common methods & idioms, context managers
 

206 E 9th Street, Floor 18, Austin, TX 78701

anaconda.com
Day 2
Essential Pandas
Duration: 1 day

• Data Exploration
reading data sources
selections & summary statistics
data analysis methods
filtering data using logical conditions
plotting in Jupyter notebooks
• Data Formats
Flat text files: CSV, TSV
Binary Formats: HDF5, SAS, Excel, databases
Structured files: JSON
• Data Processing
using vectorized operations
transforming strings & datetimes
• Time series
creating & using datetime indexes
resampling time series
using rolling windows
• Grouping
Grouping data by column values
resampling time series
• Merging & Joining DataFrames
appending & concatenating DataFrames
joins/merges on Index
merges on multiple columns
merging with missing values 

206 E 9th Street, Floor 18, Austin, TX 78701

anaconda.com
Day 3
Access Big Data with Anaconda Enterprise
Duration: 1/2 day

• Accessing Databases
Connect with SQLAlchemy
Execute queries
Retrieve results
• PySpark
HIVE tables and storage
DataFrame objects
Processing Hadoop files

Statistical Modeling and Analysis

Duration: 1/2 day

• Overview of scipy.stats
• Sampling empirical distributions
• Construct PDF and CDF
• Hypothesis testing
• Statsmodels
linear regression
regression analysis
logistic regression
building design matrices with R-like equations 

206 E 9th Street, Floor 18, Austin, TX 78701

anaconda.com
Day 4
Machine Learning with Scikit-Learn
Duration: 1 day

Supervised Learning

• Model Training and validation

• Regression problems
Linear models
Support vector machines
Decision trees
• Classification problems
K-nearest neighbors classification
Naive Bayes classification
Support vector machines
Decision trees and ensemble strategies
• Model building and scoring
Scoring functions & cross-validation
Feature selection
Feature extraction
• Pipelines
• Grid search parameter optimization

Unsupervised Learning

• Feature extraction
• Clustering problems
K-means & hierarchical clustering
DBScan
• Dimensionality reduction
PCA, LDA, NMF
• Detection & treatment of outliers

206 E 9th Street, Floor 18, Austin, TX 78701

anaconda.com

Chapter 3
No ratings yet
Chapter 3
40 pages
Advanced Data Mining
No ratings yet
Advanced Data Mining
6 pages
Dsc-M-Bca-354t Minor - Fundamentals of Mathematical Apllications
No ratings yet
Dsc-M-Bca-354t Minor - Fundamentals of Mathematical Apllications
2 pages
Unit 3
No ratings yet
Unit 3
18 pages
Intro to Algorithms for Novices
No ratings yet
Intro to Algorithms for Novices
4 pages
Data Science and Ethical Issues
No ratings yet
Data Science and Ethical Issues
42 pages
Facets of Data
No ratings yet
Facets of Data
6 pages
Syllabus
No ratings yet
Syllabus
5 pages
1 Introduction
No ratings yet
1 Introduction
130 pages
Quick Sort
No ratings yet
Quick Sort
18 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
34 pages
Big Data Analytics Comp Syllabus Sem7
No ratings yet
Big Data Analytics Comp Syllabus Sem7
4 pages
Topic 1 Overview of Intelligent Systems
No ratings yet
Topic 1 Overview of Intelligent Systems
35 pages
Hadoop The Definitive Guide 4th Edition Tom White Newest Edition 2025
100% (1)
Hadoop The Definitive Guide 4th Edition Tom White Newest Edition 2025
109 pages
Int. To Data Analytics and Cyber Security Syllabus
No ratings yet
Int. To Data Analytics and Cyber Security Syllabus
2 pages
R Vectors and Lists Guide
No ratings yet
R Vectors and Lists Guide
12 pages
Big Data Analytics Course 2023
No ratings yet
Big Data Analytics Course 2023
6 pages
Lecture 05 Image Enhancement III
No ratings yet
Lecture 05 Image Enhancement III
28 pages
Bda Lab Manual - Bad601
No ratings yet
Bda Lab Manual - Bad601
38 pages
Scilab Manual For Image Processing by MR Gautam Pal Computer Engineering Tripura Institute of Technlogy
No ratings yet
Scilab Manual For Image Processing by MR Gautam Pal Computer Engineering Tripura Institute of Technlogy
50 pages
PPS - Unit 3
No ratings yet
PPS - Unit 3
98 pages
Assignment 4 On Visualization On Graph With Solution
No ratings yet
Assignment 4 On Visualization On Graph With Solution
14 pages
Business Analytics Module 8
100% (1)
Business Analytics Module 8
65 pages
Computer Graphics 5: Line Drawing Algorithms: Course Website
No ratings yet
Computer Graphics 5: Line Drawing Algorithms: Course Website
32 pages
DSAL - Assignment 1 Format
No ratings yet
DSAL - Assignment 1 Format
3 pages
AKTU MBA 1st Semester
No ratings yet
AKTU MBA 1st Semester
14 pages
Data Science Workshop
No ratings yet
Data Science Workshop
6 pages
Cs3352 - Foundation of Data Science
No ratings yet
Cs3352 - Foundation of Data Science
56 pages
Lab 1 - Installing Python and Setting Up The Environment
100% (1)
Lab 1 - Installing Python and Setting Up The Environment
11 pages
DSF - Unit IV Notes
No ratings yet
DSF - Unit IV Notes
40 pages
Basic Concepts, Methods of Data Collection and Presentation
No ratings yet
Basic Concepts, Methods of Data Collection and Presentation
17 pages
RGB vs CMYK: Color Models Explained
No ratings yet
RGB vs CMYK: Color Models Explained
6 pages
Computer Vision Lab Manual 9: University of Agriculture, Faisalabad (Uaf)
No ratings yet
Computer Vision Lab Manual 9: University of Agriculture, Faisalabad (Uaf)
8 pages
OCS353 Data Science Fundamentals LAB QUESTION SET
No ratings yet
OCS353 Data Science Fundamentals LAB QUESTION SET
2 pages
Unit I Illumination and Color Models: Light Sources
No ratings yet
Unit I Illumination and Color Models: Light Sources
80 pages
Encrypted Data Analysis
100% (5)
Encrypted Data Analysis
63 pages
C Arrays for Programming Students
No ratings yet
C Arrays for Programming Students
21 pages
Book 1
0% (1)
Book 1
416 pages
Predictive Analytics Overview
No ratings yet
Predictive Analytics Overview
10 pages
III BBA MIS CLASS NOTES For Students Use
No ratings yet
III BBA MIS CLASS NOTES For Students Use
115 pages
CSC207
No ratings yet
CSC207
14 pages
Microcontroller - PPT 3
No ratings yet
Microcontroller - PPT 3
16 pages
Multimedia System
No ratings yet
Multimedia System
3 pages
Data Science Report
No ratings yet
Data Science Report
35 pages
Unit II Computational Thinking and Programming - 1: (45 Marks)
No ratings yet
Unit II Computational Thinking and Programming - 1: (45 Marks)
22 pages
2nd - Semester - Data Science
No ratings yet
2nd - Semester - Data Science
16 pages
Aiml Manual 6th Sem
No ratings yet
Aiml Manual 6th Sem
15 pages
Computer Oriented Numerical Methods!
No ratings yet
Computer Oriented Numerical Methods!
160 pages
94047595747
No ratings yet
94047595747
3 pages
BCA Data Structures Guide
No ratings yet
BCA Data Structures Guide
26 pages
Fake News Detection with ML
No ratings yet
Fake News Detection with ML
20 pages
BSC Data Science Syllabus V6 - 20190528061619 PDF
No ratings yet
BSC Data Science Syllabus V6 - 20190528061619 PDF
116 pages
BM2406 Digital Image Processing Lab Manual
No ratings yet
BM2406 Digital Image Processing Lab Manual
107 pages
Data Science Module1
No ratings yet
Data Science Module1
20 pages
Python Module-4
No ratings yet
Python Module-4
109 pages
Data Science Course and Machine Learnign Using Python
No ratings yet
Data Science Course and Machine Learnign Using Python
3 pages
Python for Data Science & ML Guide
100% (3)
Python for Data Science & ML Guide
31 pages
Machine Learning Online Training Program: Session 1
No ratings yet
Machine Learning Online Training Program: Session 1
3 pages
Data Science in Python - Regression
100% (1)
Data Science in Python - Regression
234 pages
Fake Video Call 1
100% (1)
Fake Video Call 1
3 pages
eBOOKPython5 3 2023
No ratings yet
eBOOKPython5 3 2023
39 pages
Data 20science 20crash 20course 20for 20beginners
100% (3)
Data 20science 20crash 20course 20for 20beginners
310 pages
Anaconda Training: Data Science Foundations
No ratings yet
Anaconda Training: Data Science Foundations
5 pages
Python Manual - A Learning Guide For Structural Engineering Studen
No ratings yet
Python Manual - A Learning Guide For Structural Engineering Studen
132 pages
Python and Matplotlib Essentials For Scientists and Engineers-Morgan & Claypool (2015)
No ratings yet
Python and Matplotlib Essentials For Scientists and Engineers-Morgan & Claypool (2015)
205 pages
Artifcial Intelligence
No ratings yet
Artifcial Intelligence
27 pages
Derivatives Pricing Using: An Introduction: Quantlib
No ratings yet
Derivatives Pricing Using: An Introduction: Quantlib
27 pages
Python
No ratings yet
Python
44 pages
Python Advanced - Advanced Techniques For Finance Pro's - A Comprehensive Guide To The Application of Python in Finance-Reactive Publishing (2023)
100% (4)
Python Advanced - Advanced Techniques For Finance Pro's - A Comprehensive Guide To The Application of Python in Finance-Reactive Publishing (2023)
192 pages
Stock Price Sam23
No ratings yet
Stock Price Sam23
38 pages
(Ebook) Effective Data Science Infrastructure: How To Make Data Scientists More Productive by Ville Tuulos ISBN 9781617299193, 1617299197 Available All Format
No ratings yet
(Ebook) Effective Data Science Infrastructure: How To Make Data Scientists More Productive by Ville Tuulos ISBN 9781617299193, 1617299197 Available All Format
153 pages
KNN Algorithm in Machine Learning
No ratings yet
KNN Algorithm in Machine Learning
26 pages
Problem Set 0
No ratings yet
Problem Set 0
7 pages
Sample Python Program To Be Executed in Anaconda - Jupyter Notebook.
No ratings yet
Sample Python Program To Be Executed in Anaconda - Jupyter Notebook.
3 pages
Faculty of Technology and Engineering: U & P U. Patel Department of Computer Engineering
No ratings yet
Faculty of Technology and Engineering: U & P U. Patel Department of Computer Engineering
5 pages
Sathyabama: House Price Prediction
No ratings yet
Sathyabama: House Price Prediction
72 pages
Geetha Internship
No ratings yet
Geetha Internship
17 pages
Wflow Readthedocs Io en Latest
No ratings yet
Wflow Readthedocs Io en Latest
196 pages
Ccs339 Text and Speech Analysis Lab Manual
No ratings yet
Ccs339 Text and Speech Analysis Lab Manual
51 pages
Malicious Application Detection Using Machine Learning
No ratings yet
Malicious Application Detection Using Machine Learning
59 pages
Pant D. Statistics For Data Scientists and Analysts... Using Python 2025
No ratings yet
Pant D. Statistics For Data Scientists and Analysts... Using Python 2025
508 pages
Python - 'Jupyter' Is Not Recognized As An Internal or External Command - Stack Overflow
No ratings yet
Python - 'Jupyter' Is Not Recognized As An Internal or External Command - Stack Overflow
1 page
Unity ML-Agents Reinforcement Learning Guide
No ratings yet
Unity ML-Agents Reinforcement Learning Guide
43 pages
1A PyTorch Installation
No ratings yet
1A PyTorch Installation
2 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Literature Survey of Real Time Communication Powered by AI For Specially Abled Person
No ratings yet
Literature Survey of Real Time Communication Powered by AI For Specially Abled Person
6 pages
Deploy Streamlit on Azure Easily
No ratings yet
Deploy Streamlit on Azure Easily
1 page
Driver Drowsiness Detection: VARSHA S (URK18CS273)
No ratings yet
Driver Drowsiness Detection: VARSHA S (URK18CS273)
27 pages
Sargent T. Python Programming For Economics and Finance 2023
No ratings yet
Sargent T. Python Programming For Economics and Finance 2023
365 pages

Anaconda Training: Data Science Foundations

Uploaded by

Anaconda Training: Data Science Foundations

Uploaded by

Anaconda Training

Data Science Foundations

• Intro to Python for Data Science

• Essential Pandas (1 day)

• Access Big Data with Anaconda Enterprise (1/2 day)

• Statistical Modeling and Analysis (1/2 day)

• Machine Learning with Scikit-Learn (1 day)

206 E 9th Street, Floor 18, Austin, TX 78701

Anaconda Enterprise Platform

• Log into Anaconda Enterprise

Review Core Python Syntax

• Python concepts & constructs

206 E 9th Street, Floor 18, Austin, TX 78701

206 E 9th Street, Floor 18, Austin, TX 78701

Statistical Modeling and Analysis

206 E 9th Street, Floor 18, Austin, TX 78701

• Model Training and validation

206 E 9th Street, Floor 18, Austin, TX 78701

You might also like