0% found this document useful (0 votes)

12 views36 pages

Lecture 3

The document provides an overview of machine learning (ML), including its definition, types, and applications, as well as a brief introduction to the Scikit-learn library. It outlines the differences between supervised, unsupervised, and reinforcement learning, and discusses practical examples and exercises using the K-Nearest Neighbors algorithm. The content is aimed at introducing ML concepts without delving into theoretical details, with a focus on hands-on practice using Python.

Uploaded by

Edoardo Maschio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views36 pages

Lecture 3

Uploaded by

Edoardo Maschio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Machine learning with pytho

ESCP-Paris 2021
Slides (or images, contents) adapted from D. Dligach, C. Müller, E.
Duchesnay, M.Defferrard, E. Eaton, S. Sankararaman and many others (who
,

made their course materials freely available online).

Anh-Phuong TA
Chief Data Scientist at Le Figaro CCM-Benchmark group
taanhphuong@gmail.com

1
n

Maths …

2
Today’s lecture
• Overview of ML
• Quick introduction to Scikit-learn
• No theories (i.e., we will learn them next lecture)
Machine learning is
Wiki: ML is
Machine learning is a sub eld of computer science (more
particularly soft computing) that evolved from the study of
pattern recognition and computational learning theory in
arti cial intelligence. In 1959, Arthur Samuel de ned
machine learning as a “Field of study that gives computers
the ability to learn without being explicitly programmed”.
Machine learning explores the study and construction of
algorithms that can learn from and make predictions on
data.
fi

fi
fi
Machine learning is
When Do We Use Machine Learning?
ML is used when:
• Human expertise does not exist (navigating on Mars)
• Humans cannot explain their expertise (speech recognition)
• Algorithms must be customized (personalized medicine)
• Data exists to acquire expertise (genomics)
A classic example of a task that
requires machine learning:
More tasks that are best solved
by using a learning algorithm
• Recognizing patterns
- Facial identities or facial expressions
- Handwritten or spoken word
- Medical image
• Generating patterns:
- Generating images or motion sequences
• Recognizing anomalies:
- Unusual credit card transactions
- Unusual patterns of sensor readings in a nuclear power plant
• Prediction:
- Future stock prices or currency exchange rates
s

Some applications of ML
• Web searc
• Computational biolog
• Financ
• E-commerc
• Space exploratio
• Robotic
• Information extraction
• Social network
• Debugging software
e

Types of ML

11
Types of Learning
• Supervised (inductive) learning : Learn with a teache
– Given: labeled training instances (or examples)
– Goal: learn mapping that predicts label for test instance
• Unsupervised learning : Learn without a teacher
– Given: unlabeled inputs
– Goal: learn some intrinsic structure in inputs
• Reinforcement learning: Learn by interactin
– Given agent interacting in environment (having set of
states)
– Learn policy (state to action mapping) that maximizes agent’s
reward g

Supervised learning

• Predicting the future with supervised learnin

• Classi cation vs. Regression
fi
g

Classi cation
• Predict categorical class labels
based on past observations
• Class labels are discrete
unordered values
• Email spam classi cation
example (binary)
• Handwritten digit classi cation
example (multi-class)
fi

fi
fi

Regression
• Also a kind of
supervised learnin
• Prediction of
continuous outcome
• Predicting semester
grade scores for
students
g

Unsupervised learning

• Dealing with unlabeled dat

• Cluster analysi
• Objects within a cluster share a degree of
similarity Unsupervised learning
s

Unsupervised learning
Reinforcement Learning
• Given sequence of states and actions
with (delayed) rewards
• Learn policy that maximizes agent’s
reward
Examples:
– Game playing
– Robot in maze

The Agent-Environment Interface

Designing a Learning System
• Choose training experience
• Choose exactly what is to be learned – i.e. the
target functio
• Choose how to represent the target function
• Choose learning algorithms to infer target
function from experience
n

Feature representations
Feature representations
Ex: Iris dataset
Basic terminology
Diff. steps for building ML app
Practice: we will use sklearn
• Contains many state-of-the-art machine
learning algorithms
• Offers comprehensive documentation (hp://
scikit- learn.org/stable/documentation) about
each algorithm
• Widely used, and a wealth of tutorials (hp://
scikit- learn.org/stable/user_guide.html) and
code snippets are available
• Works well with numpy, scipy, pandas,
matplotlib, …

Building ( tting) models

All classi ers in Sci-kit learn have the same API:
fi
fi
Example
from sklearn.naive_bayes import MultinomialNB
model = MultinomialNB()

model. t(X_train, y_train)

print("train score:", model.score(X_train, y_train))

print("test score:", model.score(X_test, y_test))
X_pred = model.predict(X_test)
fi

Some toy datasets available

in sklearn

from sklearn.datasets import load_iris

iris_dataset = datasets.load_iris()
X = iris_dataset.data
y = iris_dataset.target
print("Targets: {}".format(iris_dataset['target_names'])
print("Features: {}".format(iris_dataset['feature_names'])
print("Shape of data: {}".format(iris_dataset['data'].shape)
print("First 5 rows:\n{}".format(iris_dataset['data'][:5])
print("Target names: {}".format(iris_dataset['target_names'])
print("Targets:\n{}".format(iris_dataset['target']))
)

Supervised learning: rst

algorithm

fi
SuSupervised learning with
sklearn
KNN: k=1

Return the class of nearest label

KNN: k>1

for k>1: do a vote and return the majority (or

a con dence value for each class)
fi
Training and testing data
• train_test_split : splits data randomly in 75%
training and 25% test data.
X_train, X_test, y_train, y_test = train_test_split(

iris_dataset['data'], iris_dataset['target'], random_state=0)

• Some questions
• Why 75%? Are there beer ways to split?
What if one random split yields different models than
another
• What if all examples of one class all end up in the
training/test set?
?

Testing kNN with sklearn

from sklearn.neighbors import KNeighborsClassifie

• Training a kNN model

knn = KNeighborsClassifier(n_neighbors=1
knn.fit(X_train, y_train)

• Evaluating the model

y_pred = knn.predict(X_test)
print("Score: {:.2f}".format(np.mean(y_pred == y_test)))
print("Score: {:.2f}".format(knn.score(X_test, y_test) ))

• Predicting a new example

X_new = np.array([[5, 2.9, 1, 0.2]])
prediction = knn.predict(X_new)
:

Exercise
Testing kNN with boston dataset

01 Introduction
No ratings yet
01 Introduction
28 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
Module 1
No ratings yet
Module 1
175 pages
AI Bootcamp Sarris2024
No ratings yet
AI Bootcamp Sarris2024
64 pages
1 Lecture 1: Introduction To Machine Learning
No ratings yet
1 Lecture 1: Introduction To Machine Learning
12 pages
Unit 1
No ratings yet
Unit 1
93 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
Lecture Compiled
No ratings yet
Lecture Compiled
224 pages
Introduction To ML
No ratings yet
Introduction To ML
25 pages
ML Module I
No ratings yet
ML Module I
71 pages
Unit 1
No ratings yet
Unit 1
28 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Karthik
No ratings yet
Karthik
10 pages
Unit 1
No ratings yet
Unit 1
92 pages
Introduction To Machine Learning: Agenda
No ratings yet
Introduction To Machine Learning: Agenda
13 pages
Lec 7 - 8 - Machine Learning Introduction
No ratings yet
Lec 7 - 8 - Machine Learning Introduction
55 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
Advanced Machine Learning Tutorial
No ratings yet
Advanced Machine Learning Tutorial
37 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Unit 1
No ratings yet
Unit 1
62 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
Week 8
No ratings yet
Week 8
70 pages
ML Unit 1 Intro ML
No ratings yet
ML Unit 1 Intro ML
43 pages
DIR Notes 1
No ratings yet
DIR Notes 1
39 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
ML Unit1
No ratings yet
ML Unit1
6 pages
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
100% (1)
Introduction To Machine Learning: WWW - Seas.upenn - Edu/ Cis519
51 pages
Applied Machine Learning
No ratings yet
Applied Machine Learning
49 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
37 pages
Algorithmeknn 121213175830 Phpapp02
No ratings yet
Algorithmeknn 121213175830 Phpapp02
52 pages
ML Short U1-4
No ratings yet
ML Short U1-4
60 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
58 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
25 pages
Asset-V1 MKAU+SEng9032+DEV 01+type@asset+block@ChapOne
No ratings yet
Asset-V1 MKAU+SEng9032+DEV 01+type@asset+block@ChapOne
29 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
28 pages
ML Intro Linear Regression
No ratings yet
ML Intro Linear Regression
312 pages
MLT Uint1
No ratings yet
MLT Uint1
26 pages
ML - Machine Learning Unit 1 Notes ML - Machine Learning Unit 1 Notes
No ratings yet
ML - Machine Learning Unit 1 Notes ML - Machine Learning Unit 1 Notes
18 pages
History and Types of Machine Learning
No ratings yet
History and Types of Machine Learning
84 pages
ML 01
No ratings yet
ML 01
15 pages
Overview of Machine Learning
No ratings yet
Overview of Machine Learning
60 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Unit 1 ML
No ratings yet
Unit 1 ML
93 pages
Unit-1 Introduction To Machine Learning
No ratings yet
Unit-1 Introduction To Machine Learning
24 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
2024 SCU ML 1 2 Introduction
No ratings yet
2024 SCU ML 1 2 Introduction
35 pages
Module 1 - Intro To ML - V2
No ratings yet
Module 1 - Intro To ML - V2
47 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
ML Microst
No ratings yet
ML Microst
264 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
ML 1
No ratings yet
ML 1
9 pages
Intro To Machine Learning
100% (1)
Intro To Machine Learning
250 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Experiment No 2
No ratings yet
Experiment No 2
11 pages
Beginner's Guide to Competitive Programming
No ratings yet
Beginner's Guide to Competitive Programming
14 pages
Data Visualization Assignment Guide
No ratings yet
Data Visualization Assignment Guide
3 pages
Single Precision and Double Precision
No ratings yet
Single Precision and Double Precision
2 pages
Graph Search Algorithms Explained
No ratings yet
Graph Search Algorithms Explained
64 pages
Quiz
No ratings yet
Quiz
2 pages
(2000) A Near-Optimal Solution To A Two-Dimensional Cutting Stock Prob
No ratings yet
(2000) A Near-Optimal Solution To A Two-Dimensional Cutting Stock Prob
13 pages
Dependency Preservation Explained
No ratings yet
Dependency Preservation Explained
24 pages
Paper 49
No ratings yet
Paper 49
10 pages
Algorithm Analysis & Complexity
No ratings yet
Algorithm Analysis & Complexity
26 pages
6.2 Worksheet - Sorting Algorithms (Oly)
No ratings yet
6.2 Worksheet - Sorting Algorithms (Oly)
2 pages
Simulation: Chapter - 13
No ratings yet
Simulation: Chapter - 13
10 pages
Brochure - Control Sys
No ratings yet
Brochure - Control Sys
2 pages
Elliptic Curve Cryptography: Presented by Nemi Chandra Rathore M.Tech WCC IWC2008013
No ratings yet
Elliptic Curve Cryptography: Presented by Nemi Chandra Rathore M.Tech WCC IWC2008013
34 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
Automated Recognition of Cattles Breed Through Computer Vision A Case Study On T
No ratings yet
Automated Recognition of Cattles Breed Through Computer Vision A Case Study On T
5 pages
Fourier Series Method FIR Filter Design
No ratings yet
Fourier Series Method FIR Filter Design
16 pages
PSRC Working Group C43
No ratings yet
PSRC Working Group C43
111 pages
Linear Regression for Analysts
No ratings yet
Linear Regression for Analysts
24 pages
01 Introduction
No ratings yet
01 Introduction
14 pages
FALLSEM2023-24 MEE1014 TH VL2023240101810 2023-10-13 Reference-Material-I
No ratings yet
FALLSEM2023-24 MEE1014 TH VL2023240101810 2023-10-13 Reference-Material-I
44 pages
Language Modelling-NGRAM, NeuralLM
No ratings yet
Language Modelling-NGRAM, NeuralLM
16 pages
Detecting Low-Rate DoS/DDoS with Deep Learning
No ratings yet
Detecting Low-Rate DoS/DDoS with Deep Learning
7 pages
MLT Unit 5 12m
No ratings yet
MLT Unit 5 12m
25 pages
Chapter 6
No ratings yet
Chapter 6
35 pages
AI DL ML Dott Lezioni2019 6
No ratings yet
AI DL ML Dott Lezioni2019 6
35 pages
Chapter 3 Unity Feedback Systems
No ratings yet
Chapter 3 Unity Feedback Systems
9 pages
Bernoulli Distribution Notes
No ratings yet
Bernoulli Distribution Notes
2 pages
A-Level RDMS & Keys
No ratings yet
A-Level RDMS & Keys
18 pages
SaqlainAbbas Assignment02
No ratings yet
SaqlainAbbas Assignment02
4 pages

Lecture 3

Uploaded by

Lecture 3

Uploaded by

Machine learning with pytho

made their course materials freely available online).

• Predicting the future with supervised learnin

• Dealing with unlabeled dat

The Agent-Environment Interface

Building ( tting) models

model. t(X_train, y_train)

print("train score:", model.score(X_train, y_train))

Some toy datasets available

from sklearn.datasets import load_iris

Supervised learning: rst

Return the class of nearest label

for k>1: do a vote and return the majority (or

iris_dataset['data'], iris_dataset['target'], random_state=0)

Testing kNN with sklearn

• Training a kNN model

• Evaluating the model

• Predicting a new example

You might also like