Open navigation menu

Scribd

0% found this document useful (0 votes)

62 views13 pages

HIT3002: Introduction To Artificial Intelligence: Learning From Observations

This document discusses machine learning and decision tree learning. It introduces key concepts like learning agents, supervised vs unsupervised learning, and inductive learning. It describes how decision trees are a representation for hypotheses and can be learned from examples using an algorithm that chooses attributes based on information gain to build the tree. The goal of decision tree learning is to find a simple tree that predicts well on new examples not in the training set.

Uploaded by

Copyright

© Attribution Non-Commercial (BY-NC)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views13 pages

HIT3002: Introduction To Artificial Intelligence: Learning From Observations

This document discusses machine learning and decision tree learning. It introduces key concepts like learning agents, supervised vs unsupervised learning, and inductive learning. It describes how decision trees are a representation for hypotheses and can be learned from examples using an algorithm that chooses attributes based on information gain to build the tree. The goal of decision tree learning is to find a simple tree that predicts well on new examples not in the training set.

Uploaded by

Copyright

© Attribution Non-Commercial (BY-NC)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

HIT3002: Introduction to Artificial Intelligence

Learning from Observations

Outline
Learning agents Inductive learning Decision tree learning

Swinburne University of Technology

Learning
Learning is essential for unknown environments,
i.e., when designer lacks omniscience

Learning is useful as a system construction method,

i.e., expose the agent to reality rather than trying to write it down

Learning modifies the agent's decision mechanisms to improve performance

Learning agents

Swinburne University of Technology

Learning element
Design of a learning element is affected by
Which components of the performance element are to be learned What feedback is available to learn these components What representation is used for the components

Type of feedback:
Supervised learning: correct answers for each example Unsupervised learning: correct answers not given Reinforcement learning: occasional rewards

Inductive learning
Simplest form: learn a function from examples

f is the target function An example is a pair (x, f(x))

Problem: find a hypothesis h

such that h f given a training set of examples

(This is a highly simplified model of real learning:

Ignores prior knowledge

Swinburne University of Technology

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Swinburne University of Technology

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Swinburne University of Technology

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Inductive learning method

Construct/adjust h to agree with f on training set (h is consistent if it agrees with f on all examples) E.g., curve fitting:

Ockhams razor: prefer the simplest hypothesis consistent with data

Swinburne University of Technology

Learning decision trees

Problem: decide whether to wait for a table at a restaurant, based on the following attributes:
1. 2. 3. 4. 5. 6. 7. 8. 9.

Alternate: is there an alternative restaurant nearby? Bar: is there a comfortable bar area to wait in? Fri/Sat: is today Friday or Saturday? Hungry: are we hungry? Patrons: number of people in the restaurant (None, Some, Full) Price: price range ($, $$, $$$) Raining: is it raining outside? Reservation: have we made a reservation? Type: kind of restaurant (French, Italian, Thai, Burger)

10. WaitEstimate: estimated waiting time (0-10, 10-30, 30-60, >60)

Attribute-based representations
Examples described by attribute values (Boolean, discrete, continuous) E.g., situations where I will/won't wait for a table:

Classification of examples is positive (T) or negative (F)

Swinburne University of Technology

Decision trees
One possible representation for hypotheses E.g., here is the true tree for deciding whether to wait:

Expressiveness
Decision trees can express any function of the input attributes. E.g., for Boolean functions, truth table row path to leaf:

Trivially, there is a consistent decision tree for any training set with one path to leaf for each example (unless f nondeterministic in x) but it probably won't generalize to new examples Prefer to find more compact decision trees

Swinburne University of Technology

Hypothesis spaces
How many distinct decision trees with n Boolean attributes? = number of Boolean functions = number of distinct truth tables with 2n rows = 22
n

E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees

Hypothesis spaces
How many distinct decision trees with n Boolean attributes? = number of Boolean functions n = number of distinct truth tables with 2n rows = 22 E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees How many purely conjunctive hypotheses (e.g., Hungry Rain)? Each attribute can be in (positive), in (negative), or out
3n distinct conjunctive hypotheses

More expressive hypothesis space

increases chance that target function can be expressed increases number of hypotheses consistent with training set may get worse predictions

Swinburne University of Technology

Decision tree learning

Aim: find a small tree consistent with the training examples Idea: (recursively) choose "most significant" attribute as root of (sub)tree

Choosing an attribute
Idea: a good attribute splits the examples into subsets that are (ideally) "all positive" or "all negative"

Patrons? is a better choice

Swinburne University of Technology

10

Using information theory

To implement Choose-Attribute in the DTL algorithm Information Content (Entropy): I(P(v1), , P(vn)) = i=1 -P(vi) log2 P(vi) For a training set containing p positive examples and n negative examples:

I(

p n p p n n , )= log 2 log 2 p+n p+n p+n p+n p+n p+n

Information gain
A chosen attribute A divides the training set E into subsets E1, , Ev according to their values for A, where A has v distinct values.

remainder ( A) =
i =1

p i + ni pi ni I( , ) p + n pi + ni pi + ni

Information Gain (IG) or reduction in entropy from the attribute test:

IG ( A) = I (

p n , ) remainder ( A) p+n p+n

Choose the attribute with the largest IG

Swinburne University of Technology

11

Information gain
For the training set, p = n = 6, I(6/12, 6/12) = 1 bit Consider the attributes Patrons and Type (and others too):
IG ( Patrons ) = 1 [ 2 4 6 2 4 I (0,1) + I (1,0) + I ( , )] = .0541 bits 12 12 12 6 6 2 1 1 4 2 2 4 2 2 2 1 1 IG (Type) = 1 [ I ( , ) + I ( , ) + I ( , ) + I ( , )] = 0 bits 12 2 2 12 2 2 12 4 4 12 4 4

Patrons has the highest IG of all attributes and so is chosen by the DTL algorithm as the root

Example contd.
Decision tree learned from the 12 examples:

Substantially simpler than true tree---a more complex hypothesis isnt justified by small amount of data

Swinburne University of Technology

12

Performance measurement
How do we know that h f ?
1. 2.

Use theorems of computational/statistical learning theory Try h on a new test set of examples
(use same distribution over example space as training set)

Learning curve = % correct on test set as a function of training set size

Summary
Learning needed for unknown environments, lazy designers Learning agent = performance element + learning element For supervised learning, the aim is to find a simple hypothesis approximately consistent with training examples Decision tree learning using information gain Learning performance = prediction accuracy measured on test set

Swinburne University of Technology

13

You might also like

Learning From Observations: Section 1 - 3
No ratings yet
Learning From Observations: Section 1 - 3
26 pages
CS6364 Lecture18 - ML Decision Tree
No ratings yet
CS6364 Lecture18 - ML Decision Tree
30 pages
Ai - Unit Vi
No ratings yet
Ai - Unit Vi
40 pages
Machine Learning Learning
No ratings yet
Machine Learning Learning
35 pages
JU Ch9
No ratings yet
JU Ch9
21 pages
Chapter 8: Learning: By, Safa Hamdare
No ratings yet
Chapter 8: Learning: By, Safa Hamdare
46 pages
10 Learning
No ratings yet
10 Learning
32 pages
Chapter Five Learning
No ratings yet
Chapter Five Learning
50 pages
Chapter19 4e
No ratings yet
Chapter19 4e
67 pages
Tycs Ai Unit 2
No ratings yet
Tycs Ai Unit 2
84 pages
Inductive and Decision Tree Learning
No ratings yet
Inductive and Decision Tree Learning
30 pages
10 Learning Annot
No ratings yet
10 Learning Annot
32 pages
Unit 5 2
No ratings yet
Unit 5 2
31 pages
Ai Unit 5 Part 3
No ratings yet
Ai Unit 5 Part 3
9 pages
Chapter 6:artificial Intelligence Learning: By. Getaneh T
No ratings yet
Chapter 6:artificial Intelligence Learning: By. Getaneh T
59 pages
9 Learning
No ratings yet
9 Learning
16 pages
2024 Lecture11 MLAlgorithms
No ratings yet
2024 Lecture11 MLAlgorithms
84 pages
Robotics
No ratings yet
Robotics
5 pages
Chap 18
No ratings yet
Chap 18
51 pages
48 Learning From Memorization AIC17 V1
No ratings yet
48 Learning From Memorization AIC17 V1
49 pages
Decision Tree
No ratings yet
Decision Tree
42 pages
Unit 5
No ratings yet
Unit 5
21 pages
AI Learning: Decision Trees
No ratings yet
AI Learning: Decision Trees
64 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
AI Unit 4
No ratings yet
AI Unit 4
91 pages
Artificial Intelligence: Slide 6
100% (1)
Artificial Intelligence: Slide 6
42 pages
ML Lecture 3
No ratings yet
ML Lecture 3
13 pages
Lect6 PDF
No ratings yet
Lect6 PDF
66 pages
Ai Module V Part2
No ratings yet
Ai Module V Part2
8 pages
Mod 4-1
No ratings yet
Mod 4-1
42 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
06 Learning
No ratings yet
06 Learning
51 pages
Lec7 - Nonparametric Methods - II
No ratings yet
Lec7 - Nonparametric Methods - II
38 pages
TTNT 09 Learning From Examples
No ratings yet
TTNT 09 Learning From Examples
58 pages
2025 Lecture07 P1 ID3
No ratings yet
2025 Lecture07 P1 ID3
41 pages
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
No ratings yet
Artificial Intelligence: Foundations & Applications: Prof. Partha P. Chakrabarti & Arijit Mondal
24 pages
Learning
No ratings yet
Learning
51 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
AICh 6
No ratings yet
AICh 6
44 pages
Ai Unit V
No ratings yet
Ai Unit V
18 pages
Lecture 4 - Intro To Machine Learning and Decision Trees
No ratings yet
Lecture 4 - Intro To Machine Learning and Decision Trees
61 pages
Unit-5 1
No ratings yet
Unit-5 1
88 pages
Unit 3
No ratings yet
Unit 3
81 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Lec12 2
No ratings yet
Lec12 2
103 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Learning Agents Overview
No ratings yet
Learning Agents Overview
42 pages
Lecture 06 Part A - Macine Learning
No ratings yet
Lecture 06 Part A - Macine Learning
77 pages
Unit V-Part 1-1
No ratings yet
Unit V-Part 1-1
45 pages
Artificial Intelligence: Machine Learning
No ratings yet
Artificial Intelligence: Machine Learning
110 pages
Cs 171 18 IntroLearning Old
No ratings yet
Cs 171 18 IntroLearning Old
47 pages
7 DecisionTree
No ratings yet
7 DecisionTree
58 pages
UNIT-VI Learning
No ratings yet
UNIT-VI Learning
19 pages
Decision Tree - 1
No ratings yet
Decision Tree - 1
31 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Chap5 - Machine Learning Part II - Decision Tree
No ratings yet
Chap5 - Machine Learning Part II - Decision Tree
68 pages
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
No ratings yet
Cooperating Intelligent Systems: Learning From Observations Chapter 18, AIMA
51 pages
4th Grade History Lesson Plan
No ratings yet
4th Grade History Lesson Plan
12 pages
Jurnal Keperawatan Muhammadiyah: Pelangi Jiwa Aobama, Dedy Purwito
No ratings yet
Jurnal Keperawatan Muhammadiyah: Pelangi Jiwa Aobama, Dedy Purwito
11 pages
Comparison Between Engineering Branches: Cse and It
No ratings yet
Comparison Between Engineering Branches: Cse and It
12 pages
Working Stress Method in Concrete
100% (1)
Working Stress Method in Concrete
8 pages
DAE 2nd Year Annual Result 2025
No ratings yet
DAE 2nd Year Annual Result 2025
1 page
Psychology G4670 Theories in Social and Personality
No ratings yet
Psychology G4670 Theories in Social and Personality
6 pages
Ducational Ackground
No ratings yet
Ducational Ackground
3 pages
Earth Energies & Spirit Release
100% (4)
Earth Energies & Spirit Release
6 pages
Filipino Values Positive & Negative
80% (5)
Filipino Values Positive & Negative
16 pages
Articles On Energy Crisis
No ratings yet
Articles On Energy Crisis
1 page
Mechanical Engineering Labs Guide
No ratings yet
Mechanical Engineering Labs Guide
24 pages
Grasshopper
No ratings yet
Grasshopper
111 pages
API-Pt-I, Vol-7
100% (1)
API-Pt-I, Vol-7
239 pages
Pipe Stress Analysis Using CAESAR II
No ratings yet
Pipe Stress Analysis Using CAESAR II
42 pages
Sky Colors Unit Plan for Grade 1
No ratings yet
Sky Colors Unit Plan for Grade 1
20 pages
Yuma County Storm Drainage Manual FC - PWSIII
100% (1)
Yuma County Storm Drainage Manual FC - PWSIII
66 pages
Public-Private Agencies Networking For Food Safety
88% (8)
Public-Private Agencies Networking For Food Safety
4 pages
Physics Unit 1 6PH01 & Unit 2 6PH02 June 2009 MS
No ratings yet
Physics Unit 1 6PH01 & Unit 2 6PH02 June 2009 MS
27 pages
Ethics Explored in Dilbert Episode
No ratings yet
Ethics Explored in Dilbert Episode
2 pages
Grade 8 Quiz 4 T2 Reinforcement Sheet Answer Key
No ratings yet
Grade 8 Quiz 4 T2 Reinforcement Sheet Answer Key
3 pages
Master Pearls of Wisdom
100% (1)
Master Pearls of Wisdom
179 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Chap 004 A
100% (1)
Chap 004 A
119 pages
Lecture - Stack ADT PDF
No ratings yet
Lecture - Stack ADT PDF
16 pages
Universal Design For Learning
100% (3)
Universal Design For Learning
6 pages
Bilingual Advertisement
No ratings yet
Bilingual Advertisement
32 pages
Flat Earth Maps & Posters Collection
No ratings yet
Flat Earth Maps & Posters Collection
1 page
Alien Role-Playing Game Guide
0% (1)
Alien Role-Playing Game Guide
8 pages
Process Safety Time PST A Critical Parameter in FS 1757655361
No ratings yet
Process Safety Time PST A Critical Parameter in FS 1757655361
2 pages
University of Saint Louis Tuguegarao Tuguegarao City
No ratings yet
University of Saint Louis Tuguegarao Tuguegarao City
18 pages