0% found this document useful (0 votes)

28 views62 pages

Lecture 1

Uploaded by

dylan.j.gormley

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views62 pages

Lecture 1

Uploaded by

dylan.j.gormley

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

CS 677 Pattern Recognition

Week 1: Introduction

Dr. Amr El-Wakeel

Lane Department of Computer Science and Electrical Engineering

Spring 24
Course Instructor
• Dr. Amr El-Wakeel, Assistant Professor, Lane Department of CSEE
• Research Interests: CAVs, ITS, internet of things, and healthcare informatics
e-mail: ase00006@mix.wvu.edu
Office: AERB 253,
Office hours: Mondays 1-2 pm
Or by e-mail appointment

Acknowledgment: Dr. Omid Dehzangi

Course design and material

2
Class Overview

• Textbook
• Homeworks, exams, grading
• Course topics
Text Book

• Main Book:
– Pattern Classification by Duda, Hart and Stork, Second Edition, ISBN: 9-
780471056690

• Suggested Material that will help you:

– C. M. Bishop, "Pattern Recognition and Machine Learning", 2006
– Angel R. Martinez and Wendy L. Martinez, Computational Statistics
Handbook with MATLAB (3rd Edition or later)
Class Overview

Homework 15 %
Course project reports, 50%
codes and presentations

Exams (dates will be 35%

discussed in class):
Exam 1: 15%
Exam 2: 20 %
Course Topics
• Course description: The course will introduce graduate students to several topics
in machine learning and pattern recognition, including the following suggested
topics:
-Introduction to pattern recognition
-Bayesian decision theory
-Nearest-neighbor
-Linear discriminant functions
-Linear regression
-Logistic regression
-Gradient descent
-Support vector machines
-Clustering
-Feature Extraction and Reduction
-Neural Networks
-Selected topics (if time permits)

6
Terminology

• Pattern Recognition: “the act of taking raw data and taking an action based on the
category of the pattern.”
• Common Applications: speech recognition, fingerprint identification (biometrics),
Anomaly detection, DNA sequence identification
• Related Terminology:
▪ Data mining: the process of finding anomalies, patterns and correlations within large
data sets to predict outcomes.

▪ Machine Learning: The ability of a machine to improve its performance based on

previous results.
▪ Machine Understanding: acting on the intentions of the user
generating the data.
• Related Fields: artificial intelligence, signal processing and discipline-specific research
(e.g., target recognition, speech recognition, natural language processing).
License Plate Recognition

8
Biometric Recognition

9
Fingerprint Classification

10
Face Detection

11
Autonomous Systems

12
Medical Applications

Skin Cancer Detection Breast Cancer Detection

13
Land Cover Classification
(from aerial or satellite images)

14
Knowledge Discovery Process

Knowledge Interpretation
Pattern Recognition
Machine learning
Data Mining

Data Transformation
Feature Extraction

Preprocessed Selection
Data
Data Cleaning

Data Integration

Databases
Sampling Data
▪ “Big” data arises in many forms:
▪ Physical Measurements: from science (physics, astronomy)
▪ Medical data: biometric sequences, detailed time series
▪ Activity data: GPS location, body sensor activities
▪ Business data: customer behavior tracking at fine detail
▪ Common themes:
▪ Data is large, and growing
▪ There are important patterns
and trends in the data
▪ We don’t fully know where to look
or how to find them
Reducing the Data
▪ Although “big” data is about more than just the volume…
…most big data is big!
▪ It is not always possible to store the data in full
• Many applications (telecoms, ISPs, search engines, Sensor data)
can’t keep everything
▪ It is inconvenient to work with data in full
• Just because we can, doesn’t mean we should (Human
behavior)
▪ It is faster to work with a compact summary
• Better to explore data on a laptop
than a cluster
Sampling the Data
▪ Sampling has an intuitive semantics
• We obtain a smaller data set with the same structure
▪ Estimating on a sample is often straightforward
• Run the analysis on the sample that you would on the full
data
• Some rescaling/reweighting may be necessary
▪ Sampling is general and agnostic to the analysis to be
done
• Though sampling can be tuned to optimize some criteria
▪ Sampling is (usually) easy to understand
• So prevalent that we have an intuition about sampling
Sampling as a Mediator of Constraints

Data Characteristics
(Correlations)

Sampling

Resource Constraints Query Requirements

(Bandwidth, Storage, CPU, (Ad Hoc, Accuracy,
GPU) Aggregates, Speed)
SAMPLING……

▪ What is your population of interest?

• To whom do you want to generalize your
results?
–All doctors
–School children
–Nationality
–Women aged 15-45 years
–Other
▪ Can you sample the entire population?

20
21
STUDY POPULATION

SAMPLE

TARGET POPULATION

22
Population definition
▪ A population can be defined as including all people
or items with the characteristic one wishes to
understand.
▪ Because there is very rarely enough time or money
to gather information from everyone or everything
in a population, the goal becomes finding a
representative sample (or subset) of that
population.

23
Data Everywhere!

▪ Lots of data is being collected and warehoused

– Web data, e-commerce
– Sensor data
– Purchases at department/
grocery stores
– Bank/Credit Card
transactions
– Social Network
Type of Data

▪ Relational Data (Structured data:

Tables/Transaction/Legacy Data)
▪ Matrix (Biometrics)
▪ Text Data (Unstructured data: Web)
▪ XML (Semi-structured Data)
▪ Graph Data
• Social Network, Semantic Web (RDF), …
▪ Streaming Data
• You can only scan the data once
What to do with these data?

▪ Aggregation and Statistics

– Data warehouse
▪ Indexing, Searching, and Querying
– Keyword based search
– Pattern matching (e.g. XML)
▪ Knowledge discovery
– Data Analytic
– Statistical Modeling
Random Sample and Statistics

▪ Population: is used to refer to the set or universe of all entities

under study.
▪ However, looking at the entire population may not be feasible,
or may be too expensive.
▪ Instead, we draw a random sample from the population, and
compute appropriate statistics from the sample, that give
estimates of the corresponding population parameters of
interest.
Ex: Time Series Analysis
• Example: Stock Market
• Predict future values
• Determine similar patterns over time
• Classify behavior

28
Nature of Data

Many Observations on Many

Variables
Data File: OBS No.

1
Target Var.

0
Var. 1

63
Var. 2

.
.

.
Var. 100

2 1 54 . . . .

3 0 44 . . . .

. . . . . . .

1,500,000 1 32 . . . .
Types of Problems

▪ Customer and Student Retention

▪ Detection of patient’s symptoms
▪ Credit Scoring (Auto or Home Loans)
▪ Bond Ratings
▪ Detection of Fraudulent Insurance Claims
▪ Is a Newly Introduced Product Meeting with
Consumer Acceptance or Rejection?
▪ Who is a likely Donor to your Charity?
▪ Early Detection of a Stolen or Compromised
Credit Card
Data Rich, Information Poor
▪ The Amount of Raw Data Stored in Corporate Databases is
Exploding
▪ Most of this information is recorded instantaneously and with
minimal cost
▪ Data bases are measured in gigabytes and terabytes (One
terabyte = one trillion bytes. A terabyte is equivalent to about 2
million books!)
▪ Walmart uploads 20 million point-of-sale transactions to 500
parallel processing storage devices each day.
▪ Raw data by itself, however does not provide much
information. That is where Data analytic comes in!

31
Learning/Modeling/Decision making
Learning Task Examples
• Classification maps data into predefined groups or
classes
– Supervised learning
– Pattern recognition
– Prediction
• Regression is used to map a data item to a real valued
prediction variable
• Clustering groups similar data together into clusters
– Unsupervised learning
– Segmentation
– Anomaly detection
• Dimensionality Reduction transformation of data
from a high-dimensional space into a low-
dimensional space retaining meaningful properties of
the original data 33
Prediction Problem

▪ Early Detection of a Stolen or

Compromised Credit Card

Not So Interested in How or Why the

Credit Card was Stolen but Instead
Whether Recent Transactions are
Indicative of a Stolen or Compromised
Credit Card
Prediction:
Classification vs. Regression
▪ Classification:
– predicts categorical class labels
– classifies data (constructs a model) based on the training set
and the values (class labels) in a classifying attribute and
uses it in classifying new data
▪ Regression:
– models continuous-valued functions, i.e., predicts unknown
or missing values
▪ Typical Applications
– credit approval
– target marketing
– medical diagnosis
– treatment effectiveness analysis

35
Classification: Definition

▪ Given a collection of records (training set )

– Each record contains a set of attributes, one of the attributes is the
class.
▪ Find a model for class attribute as a function of the
values of other attributes.
▪ Goal: previously unseen records should be
assigned a class as accurately as possible.
– A test set is used to determine the accuracy of the model. Usually,
the given data set is divided into training and test sets, with training
set used to build the model and test set used to validate it.

36
Classification—A Two-Step Process

▪ Model construction: describing a set of predetermined classes

– Each tuple/sample is assumed to belong to a predefined class, as
determined by the class label attribute
– The set of tuples used for model construction: training set
– The model is represented as classification rules, decision trees, or
mathematical formulae
▪ Model usage: for classifying future or unknown objects
– Estimate accuracy of the model
• The known label of test sample is compared with the classified
result from the model
• Accuracy rate is the percentage of test set samples that are
correctly classified by the model
• Test set is independent of training set, otherwise over-fitting will
occur
37
Illustrating Classification Task

Tid Attrib1 Attrib2 Attrib3 Class Learning

1 Yes Large 125K No
algorithm
2 No Medium 100K No

3 No Small 70K No

4 Yes Medium 120K No

Induction
5 No Large 95K Yes

6 No Medium 60K No

7 Yes Large 220K No Learn

8 No Small 85K Yes Model
9 No Medium 75K No

10 No Small 90K Yes

Model
10

Training Set
Apply
Tid Attrib1 Attrib2 Attrib3 Class Model
11 No Small 55K ?

12 Yes Medium 80K ?

13 Yes Large 110K ? Deduction

14 No Small 95K ?

15 No Large 67K ?
10

Test Set

38
An Example Data Set and Decision Tree

3 sunny med big yes

4 sunny no small yes
5 sunny big big yes
6 rainy no small no outlook
7 rainy med small yes
sunny rainy
8 rainy big big yes
9 rainy no big no
10 rainy med big no yes company

no big
med

no sailboat yes

small big

yes no
Examples of Classification Task

• Predicting tumor cells as benign or malignant

• Classifying credit card transactions

as legitimate or fraudulent

• Classifying if the a driver subject is stressed or not

• Categorizing news stories as finance,

weather, entertainment, sports, etc

40
Issues regarding classification
Issues (1): Data Preparation
• Data cleaning
– Preprocess data in order to reduce noise and handle missing values
• Relevance analysis (feature selection)
– Remove the irrelevant or redundant attributes
• Data transformation
– Generalize and/or normalize data

41
Issues regarding classification
Issues (2): Evaluating Classification Methods
• Predictive accuracy
• Speed and scalability
– time to construct the model
– time to use the model
• Robustness
– handling noise and missing values
• Scalability
– The concept of generalization
• Interpretability:
– understanding and insight provided by the model
• Goodness of rules
– decision tree size
– compactness of classification rules

42
Error Analysis for a Two Class Problem

fth
Negative Positive

1: True Negative (TN)

1 3 2: False Negative (FN)
3: True Positive (TP)
Confusion matrix
4: False Positive (FP)

2 4

43
Evaluation Criteria

Accuracy= TP – FP/P+N

44
Multi-class classification
▪ Multi-class vs. binary
classification
– one vs. all (one vs. many, one
vs. rest)
– N classes ➔ train N
classifiers
– each classifier uses one class
for the positive examples and
the rest classes for the
negative examples
▪ Combine the results: select the
classifier with the highest
confidence score

45
Bias, variance, generalization error
▪ A model underfits the training data, if it does not
capture all of the structure available from the data.
(b)
▪ A model overfits if it captures too many of the
idiosyncrasies of the training data. (d)

46
▪ What does it mean to overfit or underfit?
▪ Assume we are doing regression.
▪ Suppose we have a training set
Strain = {( x (1) , y (1) ),..., ( x ( m ) , y ( m ) )}
from some distribution D.
▪ Define the average training error of a hypothesis h
1 m
 Strain (h ) =  (h ( x (i ) ) − y (i ) ) 2
m i =1
▪ We are interested in generalization error,
 (h ) = E( x , y ) from D [(h ( x) − y ) 2 ]
▪ Both underfitting or overfitting lead to high generalization error.
(previous figure)

47
(a) linear regression fits of linear function to 3 different training sets
randomly selected over the interval [0,4] ➔ low variance
(b) linear model after parameters are averaged over 50,000 trials ➔ high bias
(underestimate in the mid-range, overestimate near the ends)
(c) linear regression fits of fourth-order polynomial to 3 random training sets
➔ high variance
(d) model after averaged over 50,000 trials ➔ low bias

48
▪ We can’t directly find out the generalization error.
▪ Instead, we estimate generalization error using the test error.
m
 S (h ) =  (h ( xtest
test
(i )
) − ytest
(i ) 2
)
i =1

▪ Diagram on the right

shows how training and
test error vary as a
function of model
complexity.
▪ (e.g.) model complexity:
degree of polynomial;
size of decision tree (depth);
number of features

49
50
▪ Define training error as the proportion of training
examples that are misclassified.
1 m
 Strain (h ) =  I { h ( xtrain
(i )
)  ytrain
(i )
)}
m i =1
where I {} is an indicator function such that I{true}=1,
I{false}=0.
▪ Generalization error is defined as the probability of a
new example being misclassified
 S (h ) = P( x , y ) from D (h ( x)  y )

51
Bias and variance in practice

▪ How to choose a model with a good tradeoff between bias and

variance:
▪ what if your learned model gives poor generalization error?
▪ collect more data? use fewer features? more features? adopt a different learning
algorithm?

▪ If your model has high bias, it is too simple.

▪ If your model has high variance, it is too complex.
▪ Compare training and test errors
▪ if they are very different, your model is likely to be high variance
▪ if they are almost the same, your model is likely to be high bias

52
Regression: Least Squares Fitting

▪ Given: data points, functional form,

find constants in function
▪ Example: given (xi, yi), find line through them;
i.e., find a and b in y = ax+b
▪ Example: for fitting a line, minimize  =  ( yi − (axi + b) )
2 2

y=ax+b
(x6,y6)
(x3,y3)
(x5,y5)

(x1,y1)
(x7,y7)

(x4,y4)
(x2,y2)
Function Approximation

▪ You might do this because you actually care

about those numbers…
– Example: measure position of falling object,
fit parabola

time

p = –1/2 gt2
position

 Estimate g from fit

Supervised vs. Unsupervised Learning

• Supervised learning (classification) teacher provides

a category label
– Supervision: The training data (observations,
measurements, etc.) are accompanied by labels indicating
the class of the observations
– New data is classified based on the training set
• Unsupervised learning (clustering) “natural groupings”
– The class labels of training data is unknown
– Given a set of measurements, observations, etc. with the
aim of establishing the existence of classes or clusters in
the data

55
Clustering

income

education

age

56
Recognition or Understanding?

•Which of these images are most scenic?

•How can we develop a system to automatically determine

scenic beauty?
Features Are Confusable

• Regions of overlap represent the • In real problems, features are

classification error confusable and represent
actual variation in the data.
• Error rates can be computed with
knowledge of the joint probability • The traditional role of the
distributions. signal processing engineer
has been to develop better
• Context is used to reduce overlap
features.
(e.g. more features).
Correlation

• Degrees of difficulty: • Real data is often much harder:

Feature selection - Example
No! The same separability can be achieved by
projecting the patterns onto the blue axis i.e.
only one-dimension feature-space is needed.
120

110

100
Males
90
weight [kg]

50
Females
40
1
0
-1
30
-1.2 120 130 140 150 160 170 180 190 200 210 220
-1
height [cm]
30
25
-0.8 20
15
-0.6 10
5
-0.4 0
The Design Cycle
Start

Collect Data Key issues:

• “There is no data like more data.”
• Perceptually-meaningful features?
Choose Features
• How do we find the best model?
• How do we estimate parameters?
Choose Model
• How do we evaluate performance?
Goal of the course:
Train Classifier • Introduce you to mathematically
rigorous ways to train and evaluate
models.
Evaluate Classifier

End
Common Mistakes

• I got 100% accuracy on...

▪ Almost any algorithm works some of the time, but few real-world
problems have ever been completely solved.
▪ Training on the evaluation data is forbidden.
▪ Once you use evaluation data, you should discard it.
• My algorithm is better because...
▪ Statistical significance and experimental design play a big role in
determining the validity of a result.
▪ There is always some probability a random choice of an algorithm will
produce a better result.
• Hence, in this course, we will also learn how to evaluate algorithms.

DSand ML
No ratings yet
DSand ML
76 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Statistics For Data Science
100% (2)
Statistics For Data Science
39 pages
Introduction To Data Science
No ratings yet
Introduction To Data Science
29 pages
3 DM Classification
No ratings yet
3 DM Classification
62 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
89 pages
Pattern Summary Final
No ratings yet
Pattern Summary Final
28 pages
Big Data Lesson 2 Lucrezia Noli
No ratings yet
Big Data Lesson 2 Lucrezia Noli
21 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
Unit 3
No ratings yet
Unit 3
53 pages
Big Data
No ratings yet
Big Data
5 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Bcse 0553
No ratings yet
Bcse 0553
1 page
Week 4 - Intro To ML
No ratings yet
Week 4 - Intro To ML
37 pages
Free Data Science Course Material 2018
No ratings yet
Free Data Science Course Material 2018
32 pages
Data Mining for Analysts
No ratings yet
Data Mining for Analysts
38 pages
Data Science & Analytics Basics
No ratings yet
Data Science & Analytics Basics
71 pages
Data Mining
No ratings yet
Data Mining
37 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
Intro to Machine Learning Course
No ratings yet
Intro to Machine Learning Course
83 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
Data Science S3mca
No ratings yet
Data Science S3mca
55 pages
HIT391-week 3-New
No ratings yet
HIT391-week 3-New
43 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Machine Learning Basics for Beginners
100% (5)
Machine Learning Basics for Beginners
134 pages
Machine Learning
No ratings yet
Machine Learning
137 pages
MILIT PPT Modifies
No ratings yet
MILIT PPT Modifies
43 pages
PRCV Unit-2
No ratings yet
PRCV Unit-2
24 pages
SWE 227 Slide 01
No ratings yet
SWE 227 Slide 01
21 pages
Da 1733591326
No ratings yet
Da 1733591326
132 pages
Lecture 1
No ratings yet
Lecture 1
19 pages
Pattern L1 L6
No ratings yet
Pattern L1 L6
19 pages
Unit6 Part3 General Procedure
No ratings yet
Unit6 Part3 General Procedure
19 pages
Data Classification - Algorithms and Applications-Chapman and Hall - CRC (2014) - (Chapman & Hall - CRC Data Mining and Knowledge Discovery Series) Charu C. Aggarwal PDF
100% (1)
Data Classification - Algorithms and Applications-Chapman and Hall - CRC (2014) - (Chapman & Hall - CRC Data Mining and Knowledge Discovery Series) Charu C. Aggarwal PDF
704 pages
Data Science & Analytics: Course Code: CSE3105 Credits: 02 Credit Hours: 02/week Exam Hours: 03
No ratings yet
Data Science & Analytics: Course Code: CSE3105 Credits: 02 Credit Hours: 02/week Exam Hours: 03
2 pages
Unit - I & II
No ratings yet
Unit - I & II
59 pages
Data Mining and BI - Student Notes 2
No ratings yet
Data Mining and BI - Student Notes 2
40 pages
DsNaIT v2.0
No ratings yet
DsNaIT v2.0
43 pages
DWDM Unit-3
No ratings yet
DWDM Unit-3
9 pages
6220010
No ratings yet
6220010
37 pages
Untitled Document
No ratings yet
Untitled Document
8 pages
Chap1 Intro-2
No ratings yet
Chap1 Intro-2
34 pages
Azencott BioML
No ratings yet
Azencott BioML
87 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
Common DS Interview Questions and Answers - 1
No ratings yet
Common DS Interview Questions and Answers - 1
4 pages
Data Analytics for MBA Students
No ratings yet
Data Analytics for MBA Students
50 pages
MachineLearning Presentation
No ratings yet
MachineLearning Presentation
71 pages
Basic Machine Learning
No ratings yet
Basic Machine Learning
34 pages
lecture1&2-đã chuyển đổi
No ratings yet
lecture1&2-đã chuyển đổi
46 pages
0 KDLVLP Đã G P
No ratings yet
0 KDLVLP Đã G P
523 pages
Learning Book 11 Feb
No ratings yet
Learning Book 11 Feb
322 pages
Introduction To Data Science - Lin and Li
No ratings yet
Introduction To Data Science - Lin and Li
403 pages
Predictive Analytics and Data Mining: Charles Elkan Elkan@cs - Ucsd.edu May 31, 2011
No ratings yet
Predictive Analytics and Data Mining: Charles Elkan Elkan@cs - Ucsd.edu May 31, 2011
165 pages
Pattern Recognition 2nd Ed. (2009)
No ratings yet
Pattern Recognition 2nd Ed. (2009)
113 pages
(Ebook) Introduction To Data Mining by Pang-Ning Tan, Michael Steinbach and Vipin Kumar ISBN 9788131764633, 813176463X Available Full Chapters
No ratings yet
(Ebook) Introduction To Data Mining by Pang-Ning Tan, Michael Steinbach and Vipin Kumar ISBN 9788131764633, 813176463X Available Full Chapters
80 pages
2.0 Machine Learning Introduction
No ratings yet
2.0 Machine Learning Introduction
24 pages
Lecture 6 7
No ratings yet
Lecture 6 7
69 pages
Lecture 4
No ratings yet
Lecture 4
51 pages
Lecture 5
No ratings yet
Lecture 5
66 pages
Lecture 2 3
No ratings yet
Lecture 2 3
72 pages
Prepared By:-Sumant Tiwari Sumit Khanna Sumit K Upadhyay Sweta Jaiswal Tanvir Ahmad MBA-34 Group
No ratings yet
Prepared By:-Sumant Tiwari Sumit Khanna Sumit K Upadhyay Sweta Jaiswal Tanvir Ahmad MBA-34 Group
58 pages
Women The Family and Social Change in La
No ratings yet
Women The Family and Social Change in La
21 pages
Reflective Essay on Gibbs Model in Nursing
100% (5)
Reflective Essay on Gibbs Model in Nursing
9 pages
Quantitative Text Analysis Overview and Fundamentals: Kenneth Benoit
No ratings yet
Quantitative Text Analysis Overview and Fundamentals: Kenneth Benoit
60 pages
Drawing Basics for Design Entrance Exams
No ratings yet
Drawing Basics for Design Entrance Exams
13 pages
10 T1002M6R012
No ratings yet
10 T1002M6R012
2 pages
Dos A Oil
100% (1)
Dos A Oil
5 pages
Demystifying Ict 108
No ratings yet
Demystifying Ict 108
147 pages
Java Printingkia Sportaje
No ratings yet
Java Printingkia Sportaje
2 pages
The Future of Computing - Bits + Neurons + Qubits
No ratings yet
The Future of Computing - Bits + Neurons + Qubits
30 pages
Locking and Spool Diferential
No ratings yet
Locking and Spool Diferential
5 pages
Soal Kelas X Bhasa Inggris SMSTR 1
No ratings yet
Soal Kelas X Bhasa Inggris SMSTR 1
12 pages
Prima Catalouge 6 Pgs-Min
No ratings yet
Prima Catalouge 6 Pgs-Min
6 pages
Question Bank - Stress Management - MBU
No ratings yet
Question Bank - Stress Management - MBU
6 pages
El Patrón Del Pollito
No ratings yet
El Patrón Del Pollito
15 pages
Ias 19
No ratings yet
Ias 19
16 pages
Bagaimana Urgensi Integrasi Nasional Sebagai Salah Satu Parameter Persatuan Dan Kesatuan Bangsa?
No ratings yet
Bagaimana Urgensi Integrasi Nasional Sebagai Salah Satu Parameter Persatuan Dan Kesatuan Bangsa?
44 pages
Ryan Faridabad Holiday Homework
100% (1)
Ryan Faridabad Holiday Homework
4 pages
Tablet Excipients
No ratings yet
Tablet Excipients
22 pages
Geological Investigatons of Roads and Highways
100% (1)
Geological Investigatons of Roads and Highways
18 pages
IIRS - ISRO Brochure
No ratings yet
IIRS - ISRO Brochure
1 page
Hypoglycemia (Aims)
No ratings yet
Hypoglycemia (Aims)
5 pages
IIT Tirupati ME Exam Instructions
No ratings yet
IIT Tirupati ME Exam Instructions
27 pages
Healthy Lifestyle and Eating
No ratings yet
Healthy Lifestyle and Eating
4 pages
Which Members of IZONE Do You Think Had Plastic Surgery (If Any) - Quora
No ratings yet
Which Members of IZONE Do You Think Had Plastic Surgery (If Any) - Quora
1 page
EUCAST Disk Diffusion Guide v6.0
No ratings yet
EUCAST Disk Diffusion Guide v6.0
26 pages
Angels - All You Need To Know (Revised and Updated)
88% (8)
Angels - All You Need To Know (Revised and Updated)
16 pages
Makeup Color Theory Guide
No ratings yet
Makeup Color Theory Guide
2 pages
Optical Transmission System SDH Equipment Huawei OptiX OSN 3500
No ratings yet
Optical Transmission System SDH Equipment Huawei OptiX OSN 3500
8 pages
NEW Telemedconsent
No ratings yet
NEW Telemedconsent
3 pages

Lecture 1

Uploaded by

Lecture 1

Uploaded by

CS 677 Pattern Recognition

Dr. Amr El-Wakeel

Acknowledgment: Dr. Omid Dehzangi

• Suggested Material that will help you:

Exams (dates will be 35%

▪ Machine Learning: The ability of a machine to improve its performance based on

Skin Cancer Detection Breast Cancer Detection

Resource Constraints Query Requirements

▪ What is your population of interest?

▪ Lots of data is being collected and warehoused

▪ Relational Data (Structured data:

▪ Aggregation and Statistics

▪ Population: is used to refer to the set or universe of all entities

Many Observations on Many

▪ Customer and Student Retention

▪ Early Detection of a Stolen or

Not So Interested in How or Why the

▪ Given a collection of records (training set )

▪ Model construction: describing a set of predetermined classes

Tid Attrib1 Attrib2 Attrib3 Class Learning

4 Yes Medium 120K No

7 Yes Large 220K No Learn

10 No Small 90K Yes

12 Yes Medium 80K ?

13 Yes Large 110K ? Deduction

3 sunny med big yes

• Predicting tumor cells as benign or malignant

• Classifying credit card transactions

• Classifying if the a driver subject is stressed or not

• Categorizing news stories as finance,

1: True Negative (TN)

▪ Diagram on the right

▪ How to choose a model with a good tradeoff between bias and

▪ If your model has high bias, it is too simple.

▪ Given: data points, functional form,

▪ You might do this because you actually care

 Estimate g from fit

• Supervised learning (classification) teacher provides

•Which of these images are most scenic?

•How can we develop a system to automatically determine

• Regions of overlap represent the • In real problems, features are

• Degrees of difficulty: • Real data is often much harder:

Collect Data Key issues:

• I got 100% accuracy on...

You might also like