0% found this document useful (0 votes)

4 views31 pages

Lecture2 KNN

The document outlines an introduction to machine learning, focusing on the k-Nearest Neighbors (KNN) algorithm and its applications. It includes reminders for upcoming tests and assignments, as well as discussions on ethical responsibilities in machine learning. The content also covers the definition of learning problems, examples of learning tasks, and the machine learning framework.

Uploaded by

adityanarayangiri409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views31 pages

Lecture2 KNN

Uploaded by

adityanarayangiri409

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

10-‐601

Introduction to Machine Learning

Machine Learning Department
School of Computer Science
Carnegie Mellon University

Machine Learning in

Practice + k-‐Nearest
Neighbors
Intro Readings: KNN Readings:
Mitchell 1 Mitchell 8.2
Matt Gormley
HTF 1, 2 HTF 13.3 Lecture 2
Murphy 1 Murphy -‐-‐-‐ January 23, 2016
Bishop 1 Bishop 2.5.2

1
Reminders
• Background Test
– Tue, Jan. 24 at 6:30pm
– **Your test location depends on your
registration status – see Piazza for details
• Background Exercises (Homework 1)
– Released: Tue, Jan. 24 after the test
– Due: Mon, Jan. 30 at 5:30pm

2
Machine Learning & Ethics
What ethical responsibilities do we Some topics that we
won’t cover are probably
have as machine learning experts? deserve an entire course

If our search results for news are

optimized for ad revenue, might
they reflect gender / racial / socio-‐
economic biases?
http://bing.com/

http://arstechnica.com/
Should restrictions be placed on
intelligent agents that are capable of
interacting with the world?

How do autonomous vehicles make

decisions when all of the outcomes
are likely to be negative? 3
http://vizdoom.cs.put.edu.pl/
Outline
• Defining Learning Problems
– Artificial Intelligence (AI)
– Mitchell’s definition of learning
– Example learning problems
– Data annotation
– The Machine Learning framework
• Classification
– Binary classification
– 2D examples
– Decision rules / hypotheses
• k-‐Nearest Neighbors (KNN)
– KNN for binary classification
– Distance functions
– Special cases
Covered
– Choosing k Next
Lecture
4
This section is based on Chapter 1 of (Mitchell, 1997)

DEFINING LEARNING PROBLEMS

5
Artificial Intelligence
The basic goal of AI is to develop intelligent
machines.

This consists of many sub-‐goals: Artificial

• Perception Intelligence

• Reasoning Machine
• Control / Motion / Manipulation Learning

• Planning
• Communication
• Creativity
• Learning
6
Amazon Go
https://www.amazon.com/b?node=16008589011
https://www.youtube.com/watch?v=NrmMk1Myrxc

7
Slide from Roni Rosenfeld

Artificial Intelligence (AI):

Example Tasks:

– Identify objects in an image

– Translate from one human language to another
– Recognize speech
– Assess risk (e.g. in loan application)
– Make decisions (e.g. in loan application)
– Assess potential (e.g. in admission decisions)
– Categorize a complex situation (e.g. medical diagnosis)
– Predict outcome (e.g. medical prognosis, stock prices,
inflation, temperature)
– Predict events (default on loans, quitting school, war)
– Plan ahead under perfect knowledge (chess)
– Plan ahead under partial knowledge (Poker, Bridge)

© Roni Rosenfeld, 2016 8

Well-‐Posed Learning Problems
Three components:
1. Task, T
2. Performance measure, P
3. Experience, E

Mitchell’s definition of learning:

A computer program learns if its performance
at tasks in T, as measured by P, improves with
experience E.

9
Definition from (Mitchell, 1997)
Example Learning Problems
(historical perspective)
1. Learning to recognize spoken words
THEN NOW
“…the SPHINX system (e.g.
Lee 1989) learns speaker-
specific strategies for
recognizing the primitive
sounds (phonemes) and
words from the observed
speech signal…neural
network methods…hidden
Markov models…”

(Mitchell, 1997)
Source: https://www.stonetemple.com/great-‐knowledge-‐box-‐
showdown/#VoiceStudyResults

10
Example Learning Problems
(historical perspective)
2. Learning to drive an autonomous vehicle
THEN NOW
“…the ALVINN system
(Pomerleau 1989) has used
its learned strategies to drive
unassisted at 70 miles per
hour for 90 miles on public
highways among other
cars…”

(Mitchell, 1997)
waymo.com

11
Example Learning Problems
(historical perspective)
2. Learning to drive an autonomous vehicle
THEN NOW
“…the ALVINN system
(Pomerleau 1989) has used
its learned strategies to drive
unassisted at 70 miles per
hour for 90 miles on public
highways among other
cars…”

https://www.geek.com/wp-‐
(Mitchell, 1997)
content/uploads/2016/03/uber.jpg

12
Example Learning Problems
(historical perspective)
3. Learning to beat the masters at board games
THEN NOW
“…the world’s top computer
program for backgammon,
TD-GAMMON (Tesauro,
1992, 1995), learned its
strategy by playing over one
million practice games
against itself…”

(Mitchell, 1997)

13
Example Learning Problems

3. Learning to beat the masters at chess

1. Task, T:

2. Performance measure, P:

3. Experience, E:

14
Example Learning Problems

4. Learning to respond to voice commands (Siri)

1. Task, T:

2. Performance measure, P:

3. Experience, E:

15
Capturing the Knowledge of Experts
1980 1990 2000 2010

Solution #1: Expert Systems Give me directions to Starbucks

• Over 20 years ago, we If: “give me directions to X”
had rule based systems Then: directions(here, nearest(X))

• Ask the expert to How do I get to Starbucks?

1. Obtain a PhD in
If: “how do i get to X”
Linguistics Then: directions(here, nearest(X))
2. Introspect about the
structure of their native Where is the nearest Starbucks?
language
If: “where is the nearest X”
3. Write down the rules Then: directions(here, nearest(X))
they devise
16
Capturing the Knowledge of Experts
1980 1990 2000 2010

Solution #1: Expert Systems Give medirections

I need directionstotoStarbucks
Starbucks
• Over 20 years ago, we If:
If: “give me directions
“I need directions to
to X”
X”
had rule based systems Then:
Then: directions(here,
directions(here, nearest(X))
nearest(X))
• Ask the expert to How do I get to Starbucks?
Starbucks directions
1. Obtain a PhD in
If:
If: “how do i get to X”
Linguistics Then:
“X directions”
Then: directions(here,
directions(here, nearest(X))
nearest(X))
2. Introspect about the
structure of their native Where
Is thereis athe nearest Starbucks?
Starbucks nearby?
language
If:
If: “where is the
“Is there an Xnearest
nearby”X”
3. Write down the rules Then:
Then: directions(here,
directions(here, nearest(X))
nearest(X))
they devise
17
Capturing the Knowledge of Experts
1980 1990 2000 2010

Solution #2: Annotate Data and Learn

• Experts:
– Very good at answering questions about specific
cases
– Not very good at telling HOW they do it
• 1990s: So why not just have them tell you what
they do on SPECIFIC CASES and then let
MACHINE LEARNING tell you how to come to
the same decisions that they did 18
Capturing the Knowledge of Experts
1980 1990 2000 2010

Solution #2: Annotate Data and Learn

1. Collect raw sentences {x1, …, xn}
2. Experts annotate their meaning {y1, …, yn}

x1: How do I get to Starbucks? x3: Send a text to John that I’ll be late
y1: directions(here, y3: txtmsg(John, I’ll be late)
nearest(Starbucks))
x4: Set an alarm for seven in the
x2: Show me the closest Starbucks
morning
y2: map(nearest(Starbucks)) y4: setalarm(7:00AM) 19
Example Learning Problems

4. Learning to respond to voice commands (Siri)

1. Task, T:
predicting action from speech
2. Performance measure, P:
percent of correct actions taken in user pilot
study
3. Experience, E:
examples of (speech, action) pairs

20
Slide from Roni Rosenfeld

The Machine Learning Framework

• Formulate a task as a mapping from input to output

– Task examples will usually be pairs: (input, correct_output)
• Formulate performance as an error measure
– or more generally, as an objective function (aka Loss function)
• Examples:
– Medical Diagnosis
• mapping input to one of several classes/categories è Classification
– Predict tomorrow’s Temperature
• mapping input to a number è Regression
– Chance of Survival: From patient data to p(survive >= 5 years)
• mapping input to probability è Density estimation
– Driving recommendation
• mapping input into a plan è Planning

Slide from Roni Rosenfeld

Choices in ML Formulation

Often, the same task can be formulated in more than one way:
• Ex. 1: Loan applications
– creditworthiness/score (regression)
– probability of default (density estimation)
– loan decision (classification)
• Ex. 2: Chess
– Nature of available training examples/experience:
• expert advice (painful to experts)
• games against experts (less painful but limited, and not much control)
• experts’ games (almost unlimited, but only ”found data” – no control)
• games against self (unlimited, flexible, but can you learn this way?)
– Choice of target function: boardàmove vs. boardàscore

Slide from Roni Rosenfeld

How to Approach a Machine Learning Problem

1. Consider your goal à definition of task T

– E.g. make good loan decisions, win chess competitions, …
2. Consider the nature of available (or potential) experience E
– How much data can you get? What would it cost (in money, time or effort)?
3. Choose type of output O to learn
– (Numerical? Category? Probability? Plan?)
4. Choose the Performance measure P (error/loss function)

5. Choose a representation for the input X

6.Choose a set of possible solutions H (hypothesis space)
– set of functions h: X è O
– (often, by choosing a representation for them)
7. Choose or design a learning algorithm
– for using examples (E) to converge on a member of H that optimizes P

CLASSIFICATION

24
Fisher Iris Dataset
Fisher (1936) used 150 measurements of flowers
from 3 different species: Iris setosa (0), Iris
virginica (1), Iris versicolor (2) collected by
Anderson (1936)
Species Sepal Sepal Petal Petal
Length Width Length Width
0 4.3 3.0 1.1 0.1
0 4.9 3.6 1.4 0.1
0 5.3 3.7 1.5 0.2
1 4.9 2.4 3.3 1.0
1 5.7 2.8 4.1 1.3
1 6.3 3.3 4.7 1.6
1 6.7 3.0 5.0 1.7
26
Full dataset: https://en.wikipedia.org/wiki/Iris_flower_data_set
Fisher Iris Dataset
Classification
Whiteboard:
– Binary classification
– 2D examples
– Decision rules / hypotheses

28
K-‐NEAREST NEIGHBORS

29
k-‐Nearest Neighbors
Whiteboard:
– KNN for binary classification
– Distance functions

30
Takeaways
• Learning Problems
– Defining a learning problem is tricky
– Formalizing exposes the many possibilities
• k-‐Nearest Neighbors
– KNN is an extremely simple algorithm for
classification

Nasslli ML 2018 Slides
No ratings yet
Nasslli ML 2018 Slides
243 pages
Data Science & ML Course Guide
No ratings yet
Data Science & ML Course Guide
83 pages
Machine Learning
No ratings yet
Machine Learning
26 pages
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
100% (2)
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
112 pages
ML - Week 1
No ratings yet
ML - Week 1
37 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Artificial Intelligence: Chapter 5 - Machine Learning
No ratings yet
Artificial Intelligence: Chapter 5 - Machine Learning
30 pages
ML Unit 1 Notes
No ratings yet
ML Unit 1 Notes
135 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
606 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
01 Introduction
No ratings yet
01 Introduction
43 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
01-Introduction To Machine Learning
No ratings yet
01-Introduction To Machine Learning
89 pages
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
No ratings yet
UNIT I 1 ML Introduction To ML Well Posed Learning Problem
48 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
Machine Learning Course Overview
No ratings yet
Machine Learning Course Overview
51 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
01 Introduction
No ratings yet
01 Introduction
50 pages
Ai & ML Digital Notes
No ratings yet
Ai & ML Digital Notes
177 pages
Unit 3
No ratings yet
Unit 3
62 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
132 pages
Unit 1
No ratings yet
Unit 1
92 pages
ML UNIT-1 Notes PDF
No ratings yet
ML UNIT-1 Notes PDF
22 pages
Unit 1 Machine Learning Applications
No ratings yet
Unit 1 Machine Learning Applications
76 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
71 pages
Unit 01
No ratings yet
Unit 01
32 pages
Unit1 2
No ratings yet
Unit1 2
101 pages
Introduction To ML
100% (1)
Introduction To ML
39 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
1 Leaning Introduction
No ratings yet
1 Leaning Introduction
29 pages
Ai Lect6 Genetic
No ratings yet
Ai Lect6 Genetic
94 pages
Asset-V1 MITx+6.86x+1T2019+Type@Asset+Block@Slides Lecture1
No ratings yet
Asset-V1 MITx+6.86x+1T2019+Type@Asset+Block@Slides Lecture1
27 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture1 Compressed
27 pages
Unit 1
No ratings yet
Unit 1
93 pages
Machine Learning
No ratings yet
Machine Learning
92 pages
Machine Learning: Herbert Jaeger
No ratings yet
Machine Learning: Herbert Jaeger
100 pages
2-Artificial Intelligence, Concept and Application
No ratings yet
2-Artificial Intelligence, Concept and Application
24 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
Machine Learning - v1
No ratings yet
Machine Learning - v1
30 pages
Unit - 5.1 - Introduction To Machine Learning
No ratings yet
Unit - 5.1 - Introduction To Machine Learning
38 pages
01 Introduction ML
No ratings yet
01 Introduction ML
60 pages
Machine Learning Week2
No ratings yet
Machine Learning Week2
51 pages
ML Unit-1
No ratings yet
ML Unit-1
26 pages
LM #01-Introduction To ML
No ratings yet
LM #01-Introduction To ML
33 pages
ML (Unit-1)
No ratings yet
ML (Unit-1)
17 pages
AIML Lect1 Introduction
No ratings yet
AIML Lect1 Introduction
70 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
ML Mid1 Notes
No ratings yet
ML Mid1 Notes
45 pages
ML - Unit I - Final
No ratings yet
ML - Unit I - Final
132 pages
ML Unit-1
No ratings yet
ML Unit-1
60 pages
1.0 Introduction
No ratings yet
1.0 Introduction
50 pages
Introduction To ML P2
No ratings yet
Introduction To ML P2
30 pages
Machine Learning Overview & History
No ratings yet
Machine Learning Overview & History
14 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Lecture 1.2 Introduction To Machine Learning
No ratings yet
Lecture 1.2 Introduction To Machine Learning
31 pages
DL Unit 1 Notes
No ratings yet
DL Unit 1 Notes
90 pages
68c274869780af51764c07d0 Original
No ratings yet
68c274869780af51764c07d0 Original
14 pages
Airtel Receipt
No ratings yet
Airtel Receipt
1 page
Unit 12
No ratings yet
Unit 12
57 pages
Lung Cancer Research in G7 and BRIC Countries A Co
No ratings yet
Lung Cancer Research in G7 and BRIC Countries A Co
10 pages
MLL For English XII
No ratings yet
MLL For English XII
63 pages
Make Love Work - Nic Beets
No ratings yet
Make Love Work - Nic Beets
13 pages
Erich Fromm
No ratings yet
Erich Fromm
2 pages
Montague Grammar Barbara H Partee PDF Download
100% (1)
Montague Grammar Barbara H Partee PDF Download
31 pages
Lesson 3
No ratings yet
Lesson 3
7 pages
ADHD: Understanding and Support
No ratings yet
ADHD: Understanding and Support
8 pages
Rationality Test
No ratings yet
Rationality Test
2 pages
A Woman's Way Through The Twelve Steps &amp A Woman's Way Through The Twelve Steps Wo A Women's Recovery Collection From Stephanie Covington
100% (19)
A Woman's Way Through The Twelve Steps &amp A Woman's Way Through The Twelve Steps Wo A Women's Recovery Collection From Stephanie Covington
14 pages
Walker Wildlife Preserve
No ratings yet
Walker Wildlife Preserve
28 pages
Sloan Fellows' Happiness Course
No ratings yet
Sloan Fellows' Happiness Course
5 pages
DLP 8
No ratings yet
DLP 8
4 pages
MODULE 1 Research in Daily Life
No ratings yet
MODULE 1 Research in Daily Life
14 pages
1009-Article Text-7071-1-10-20181220 PDF
No ratings yet
1009-Article Text-7071-1-10-20181220 PDF
12 pages
Synthesis Essay Thesis Generator
100% (2)
Synthesis Essay Thesis Generator
6 pages
Recruitment and Selection Policy
No ratings yet
Recruitment and Selection Policy
17 pages
Lencioni Five Dysfunctions of A Team
No ratings yet
Lencioni Five Dysfunctions of A Team
14 pages
Manning - For A Pragmatics of The Useless ANNOTATED
100% (1)
Manning - For A Pragmatics of The Useless ANNOTATED
385 pages
FA1 (SEC1 (SubSec1) ) DUQUE
No ratings yet
FA1 (SEC1 (SubSec1) ) DUQUE
2 pages
Solution Manual For Practical Research Planning and Design 10E 10th Edition
100% (1)
Solution Manual For Practical Research Planning and Design 10E 10th Edition
9 pages
The Power of Habit
100% (3)
The Power of Habit
23 pages
University of Sussex Dissertation Format
100% (2)
University of Sussex Dissertation Format
7 pages
Humor Styles and Happiness Study
No ratings yet
Humor Styles and Happiness Study
11 pages
The Concise Laws of Human Nature
No ratings yet
The Concise Laws of Human Nature
4 pages
A Great Feast of Languages: The Translator
No ratings yet
A Great Feast of Languages: The Translator
39 pages
Ap Art and Design Drawing Sustained Investigation Samples 2019 2020 PDF
No ratings yet
Ap Art and Design Drawing Sustained Investigation Samples 2019 2020 PDF
102 pages
DMIT LAB Sample Report Advanced
No ratings yet
DMIT LAB Sample Report Advanced
45 pages
Questions On Feral Children
No ratings yet
Questions On Feral Children
3 pages
Grade 11 Personal Development Quiz
100% (1)
Grade 11 Personal Development Quiz
9 pages
Performance Task 3.1 Math Video Presentation: Procedure: 1. 2
No ratings yet
Performance Task 3.1 Math Video Presentation: Procedure: 1. 2
2 pages
Step Standard 6 - Analysis of Student Learning
No ratings yet
Step Standard 6 - Analysis of Student Learning
4 pages