0% found this document useful (0 votes)

148 views48 pages

Introduction (BT4222) YL

Uploaded by

rui91seu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

148 views48 pages

Introduction (BT4222) YL

Uploaded by

rui91seu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Welcome to BT4222

Lecturer: Yiliang ZHAO

● PhD in Computer Science

● Director, Head of Data Science, Openspace Ventures (Current)

● Adjunct Faculty, MITB (Artificial Intelligence), SMU (Current)

● J/APAC Machine Learning Practice Lead, Google

● Senior Data Scientist, Shopee

Teaching Assistant: Ta YU
● Ph.D. student (2020 Aug intake) in Information
Systems (current)

● Master in MIS, National Chengchi University,

Taiwan
○ Decision and Quantitative Analysis Lab
○ Machine Learning - Recommendation system

● Office: IS Research Lab 2 [COM2-01-03]

● E0546019@u.nus.edu
● https://www.linkedin.com/in/yutanccu/
Teaching Assistant: Jingqiao TAO
● Ph.D. student (2020 Aug intake) in Information
Systems & Analytics

● Bachelor in MIS, Zhejiang University, China

● Office: IS Research Lab 2 [COM2-01-03]

● tao_jingqiao@u.nus.edu

● https://www.linkedin.com/in/jingqiao-tao-b62812223
Teaching Assistant: Zhang Xinyi
● Ph.D. student (2020 Aug intake) in Information Systems &
Analytics

● Bachelor in Financial management, SCUT

● Master in Business Analytics, HKU

● Office: IS Research Lab 1 [COM2-01-02]

● xinyizhang@u.nus.edu

● https://www.linkedin.com/in/xinyi-zhang-8324b4176/
Ice Breaker
● Tell us about your background

● Tell us what you would like to get out of the course

Some Expectations
● Knowledge sharing instead of teaching
○ Interactive
○ Initiative
○ Innovative
Some Expectations
● Knowledge sharing instead of teaching
○ Interactive
○ Initiative
○ Innovative
● Tuned towards more industry-focused learning
○ Try to be less theoretical
○ Focus on project/report/presentation
Some Expectations
● Knowledge sharing instead of teaching
○ Interactive
○ Initiative
○ Innovative
● Tuned towards more industry-focused learning
○ Try to be less theoretical
○ Focus on project/report/presentation

● Ask questions verbally instead of using chat

Agenda
● Introduction to Natural Language Processing
● Introduction to Deep Learning
● Deep learning and NLP
● K-Nearest Neighbour Classifier
● Hands-On
Terms
● Artificial Intelligence: Intelligence exhibited by machines to mimic a human
mind
● Machine Learning: Computers being able to learn without hand-coding each
step
● Deep Learning: Multi-layered algorithms for learning from data
● Data Science: Methods, processes, and systems to extract insights from data
● Data Analytics: Discovery of meaningful patterns in data
What is what
Goodfellow, Ian, et al. Deep learning. Vol. 1. Cambridge: MIT press, 2016.

Goodfellow, Ian, et al. Deep learning. Vol. 1. Cambridge: MIT press, 2016.
Natural Language Processing
What is Natural Language Processing (NLP)?

● Natural Language Processing: a field with

three sub-topics:
○ Computer Science
○ Artificial intelligence
○ Linguistics
● NLP enables computers to understand and
process human languages.
● One definition of AI-complete is perfect
language understanding.
Easy NLP Tasks
● Spell Checking

● Keyword Search
Medium-Level NLP Tasks
● Name Entity Recognition

● Convert unstructured text into a well structured document

Medium-Level NLP Tasks
● Topic Classification

● Assign topic into each document/piece of text

Hard NLP Tasks
● Sentiment Analysis

● Aspect-based sentiment Analysis

● Analyze opinions/sentiment behind text

Hard NLP Tasks
● Machine Translation
● Question Answering

● Visual Question Answering

NLP is very challenging
● AI-complete
● Ambiguity of Language
○ Lexical/Semantic Ambiguity: The fisherman went to the bank.
○ Syntactic/Structural Ambiguity: He watched her paint with enthusiasm.

● Data Variation
○ We have ImageNet, while we do not have such huge labelled volume text data

● Complexity in representation, learning and using

linguistic/situational/word/visual knowledge
Some Machine Translation Examples
Cloud Natural Language

Extract Detect Analyze Classify

entities sentiment syntax content
https://cloud.google.com/natural-language/ Conﬁdential + Proprietary
Machine Learning
Machine Learning
Machine Learning can be decomposed into three components:
● Representation (Model and Data Level)
● Evaluation (Loss Function/ Target Function)

● Optimization: How to search representation to obtain better evaluation

Representation Learning

● Given a task: how to classify these following shapes:

● Our system should work as:

○ Input: Image
○ Representation: Number of corners.
○ Model: Fed with representation and based on mathematical models or rules to make prediction

● Designing features is a complex process, which require a deep domain

expertise.
● Deep learning is the method which tries to learn features by the model itself.
Deep Learning
Deep Learning

● Deep learning is a subfield of machine learning

● Most machine learning methods work well
because of high-quality feature engineering.
○ SIFT or HOG features for images
○ MFCC or LPC features for speech
○ Features about words parts (suffix, capitalized)

● Optimization in conventional machine learning

only focus on model-level to improve evaluation.
Deep Learning
DL focus on representation learning instead of feature engineering
○ Representation learning attempts to automatically learn good features or representation
○ It will learn multiple levels of representation
○ From “raw” inputs x
Deep Learning for Speech
The first real-world tasks addressed by deep learning is speech recognition
Deep Learning for Computer Vision
● Computer vision may be the most well-known breakthrough of DL.
● ImageNet Classification with Deep Convolutional Neural Networks.
ImageNet Scoreboard
Deep Learning For Arts
Style transfer based on Deep Learning: use one image to stylize another.
Deep Learning For Data Generation
Glow, a reversible generative model using invertible 1*1 convolutions, learns a
latent space where certain directions capture attributes like age, hair color, and so
on. (Kingma & Dhariwal 2018)
Why is Deep Learning Powerful Now?
● Feature engineering require high-level expert knowledge, which are easily
over-specified and incomplete.

● Large amounts of training data

● Modern multi-core CPUs/GPUs/TPUs
● Better deep learning ‘tricks’ such as regularization, optimization, transfer etc.
● Better context-modeling due to less independence assumptions
● Effective method for end-to-end system optimization.
Deep Learning meets NLP
Deep Learning Meets NLP
● Deep learning methods are used to solve NLP problems with a focus on
representation learning, i.e. better vectors.
● Based on different levels of natural language, DL has achieved several big
improvements:
○ Linguistic Levels: word, syntax
○ Intermediate tasks/tools: entities, parsing, parts-of-speech
○ Full applications: sentiment analysis, machine translation, question answering
Word Vector
Each word is represented as a dense and real-valued vector in a low dimensional
space.

This is a graphic from (He. et al, 2014)

Semantic Vector
● Semantic behind sentences/documents
can be encoded as vectors.
● Deep learning is able to do the
composition as:
○ Every word is a vector
○ A neural network (CNN or RNN) do the
composition
Sentiment Analysis
● Traditional approaches:
○ Bag-of-words are used and fed
into classifiers.
○ Sentiment word list are used,
which contain positive and
negative words.

● Deep learning models

○ Same semantic vector models
○ Word vectors or even char vectors
as input
Question Answering
● Traditional approaches
○ Hand-craft rules are designed to
capture word and other knowledge.
○ Regular expression used a lot

● Deep learning approaches:

○ Same semantic vector models
○ Questions and answers are projected
into the same vector space.

From Tan et al 2016

Chatbot
● Traditional approaches:
○ Hand-craft knowledge base are used.
○ Can not address out-of-domain question.

● Deep learning approaches:

○ Neural language models which can generate language.
Machine Translation
● Traditional approaches:
○ Statistical model (Moses)
○ Very large complex system

● Deep learning approaches:

○ Source sentence is mapped to
vector, then output sentence
generated
KNN Classiﬁer
Different Learning Methods
● Eager Learning
○ Explicit description of target function on the whole
training set

● Instance-based Learning
○ Learning=storing all training instances
○ Classification=assigning target function to a new
instance
○ Referred to as “Lazy” learning
K Nearest Neighbour Classifier
● All instances correspond to points in an n-dimensional Euclidean space
● Classification is delayed till a new instance arrives
● Classification done by comparing feature vectors of the different points
● Target function may be discrete or real-valued
Summary
● NLP achieves the interaction between computers and human languages;
● ML = Representation + Loss/Target + Optimization;
● Deep Learning is promising these days given large data and faster
computation resources
● Deep Learning has lots of applications in NLP
● KNN is a simple instance-based learning approach

NLP Full - GPT 4o
No ratings yet
NLP Full - GPT 4o
108 pages
Deep Learning
100% (3)
Deep Learning
32 pages
IF4071 Deep Learning Notes
No ratings yet
IF4071 Deep Learning Notes
188 pages
1 Introduction
No ratings yet
1 Introduction
81 pages
Lesson 02 Introduction To Deep Learning
No ratings yet
Lesson 02 Introduction To Deep Learning
74 pages
Ict 423 - Deep Learning
No ratings yet
Ict 423 - Deep Learning
18 pages
Natural Language Processing With Deep Learning 1 PDF
No ratings yet
Natural Language Processing With Deep Learning 1 PDF
37 pages
Lecture 5 Emerging Technology
No ratings yet
Lecture 5 Emerging Technology
20 pages
Deep Learning For Natural Language GDG Bloomington 1690248059
No ratings yet
Deep Learning For Natural Language GDG Bloomington 1690248059
41 pages
Chapter1. Introduction To Deep Learning
No ratings yet
Chapter1. Introduction To Deep Learning
21 pages
Deep Learning Course Syllabus
No ratings yet
Deep Learning Course Syllabus
38 pages
Lecture 1
No ratings yet
Lecture 1
37 pages
What Is AI 1610590751
No ratings yet
What Is AI 1610590751
8 pages
Basics AI & ML
No ratings yet
Basics AI & ML
30 pages
Ai Full Stack
No ratings yet
Ai Full Stack
15 pages
Deep Learning
No ratings yet
Deep Learning
285 pages
Machine Learning Careers in Bangladesh
No ratings yet
Machine Learning Careers in Bangladesh
32 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Unit 1a - Fundamentals of Deep Learning
No ratings yet
Unit 1a - Fundamentals of Deep Learning
54 pages
Slide 1 Introduction
No ratings yet
Slide 1 Introduction
33 pages
Deep Learning With Tensorflow
100% (1)
Deep Learning With Tensorflow
70 pages
Sem 7 All
No ratings yet
Sem 7 All
15 pages
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-I Deep Learning Techniques (WWW - Jntumaterials.co - In)
23 pages
Module 1 DL Snotes
No ratings yet
Module 1 DL Snotes
11 pages
MLUnit 1
No ratings yet
MLUnit 1
131 pages
What's The Difference Between AI, Machine Learning
No ratings yet
What's The Difference Between AI, Machine Learning
21 pages
Deep Learning
No ratings yet
Deep Learning
22 pages
Lect 4-Introduction To Deep Learning
No ratings yet
Lect 4-Introduction To Deep Learning
33 pages
Deep Learning in Image & Text Processing
No ratings yet
Deep Learning in Image & Text Processing
21 pages
Mdss Notes Unit I
No ratings yet
Mdss Notes Unit I
31 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
ML Unit-1
No ratings yet
ML Unit-1
34 pages
Deep Learning Natural Language Processing Term Paper
No ratings yet
Deep Learning Natural Language Processing Term Paper
6 pages
Machine Learningand Deep Learning AComparative Review
No ratings yet
Machine Learningand Deep Learning AComparative Review
9 pages
Unit - 1 Deep Learning Techniques
No ratings yet
Unit - 1 Deep Learning Techniques
18 pages
Note ss1
No ratings yet
Note ss1
22 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
ENG6500 1 IntroductionToMLDL Part1
No ratings yet
ENG6500 1 IntroductionToMLDL Part1
74 pages
AD3501 Deep Learning PRAISE
No ratings yet
AD3501 Deep Learning PRAISE
24 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
DL All Units Materials
No ratings yet
DL All Units Materials
138 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
NLP Levels
No ratings yet
NLP Levels
8 pages
Basics of ML
No ratings yet
Basics of ML
70 pages
Intro To AI - Course Notes
No ratings yet
Intro To AI - Course Notes
26 pages
Unit 1-L2
No ratings yet
Unit 1-L2
22 pages
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
No ratings yet
Introduction To Deep Learning: Radu Ionescu, Prof. PHD
90 pages
Machine Learning: Martin Jaggi & Nicolas Flammarion
No ratings yet
Machine Learning: Martin Jaggi & Nicolas Flammarion
52 pages
Introduction to Machine Learning Course
No ratings yet
Introduction to Machine Learning Course
37 pages
Class Notes - XI
No ratings yet
Class Notes - XI
17 pages
Aiml 1
No ratings yet
Aiml 1
26 pages
Machine Learning and Its Applications
No ratings yet
Machine Learning and Its Applications
81 pages
June 2017 QP - Paper 2 Edexcel English Language GCSE
No ratings yet
June 2017 QP - Paper 2 Edexcel English Language GCSE
20 pages
DLP Grade 8
No ratings yet
DLP Grade 8
3 pages
Daily Lesson Plan: Tuesday 10.45pg - 11.45pg English 6 Alar Tahun 6
No ratings yet
Daily Lesson Plan: Tuesday 10.45pg - 11.45pg English 6 Alar Tahun 6
1 page
Hora Santa Tema La Mision
No ratings yet
Hora Santa Tema La Mision
10 pages
English Passive Voice Exercises
No ratings yet
English Passive Voice Exercises
3 pages
1ST Quarter Examination in Music
0% (1)
1ST Quarter Examination in Music
2 pages
‎⁨اسئلة مصطلحات الدور الأول⁩
No ratings yet
‎⁨اسئلة مصطلحات الدور الأول⁩
5 pages
200L4Z
No ratings yet
200L4Z
3 pages
NSTSE Class 4 Question Paper 2016 Part 1: Examrace
No ratings yet
NSTSE Class 4 Question Paper 2016 Part 1: Examrace
5 pages
G7 English Paper 2
No ratings yet
G7 English Paper 2
5 pages
SHS English Oral Com Daily Lesson Log
No ratings yet
SHS English Oral Com Daily Lesson Log
5 pages
117-24M (A)
No ratings yet
117-24M (A)
16 pages
MC Sa Biology Tel
No ratings yet
MC Sa Biology Tel
1 page
Siddeeq Public School English: Grammar
No ratings yet
Siddeeq Public School English: Grammar
3 pages
Mid Year2 Hazırlık - Teacher
No ratings yet
Mid Year2 Hazırlık - Teacher
15 pages
AELS Written QP Spring 2020 FINAL
100% (1)
AELS Written QP Spring 2020 FINAL
16 pages
SHRG AIN: Complete The The
No ratings yet
SHRG AIN: Complete The The
9 pages
Infinitive Past Simple Past Participle Meaning
No ratings yet
Infinitive Past Simple Past Participle Meaning
3 pages
Norma Ukppg
No ratings yet
Norma Ukppg
17 pages
Midterm in OACC 201 Advanced Shorthand
No ratings yet
Midterm in OACC 201 Advanced Shorthand
4 pages
Narrative Peer Review Worksheet
No ratings yet
Narrative Peer Review Worksheet
3 pages
Academic II Plan Teacher Guide Updated April 2020b
No ratings yet
Academic II Plan Teacher Guide Updated April 2020b
592 pages
The Youngest CEO - T
No ratings yet
The Youngest CEO - T
5 pages
Topics To Be Covered For Upsr
No ratings yet
Topics To Be Covered For Upsr
4 pages
Language Features Reference Sheet
No ratings yet
Language Features Reference Sheet
1 page
Holiday Homework - VIII 2025-26
No ratings yet
Holiday Homework - VIII 2025-26
3 pages
El104 - Sociolinguistic Concepts
No ratings yet
El104 - Sociolinguistic Concepts
4 pages
4C - Rivaldi Umara Batistuta 2223200209 Translation Idioms
No ratings yet
4C - Rivaldi Umara Batistuta 2223200209 Translation Idioms
5 pages
Early Latin Verse 00 L Indu of T
No ratings yet
Early Latin Verse 00 L Indu of T
392 pages
Future Tenses: Will vs. Going To
No ratings yet
Future Tenses: Will vs. Going To
2 pages

Introduction (BT4222) YL

Uploaded by

Introduction (BT4222) YL

Uploaded by

Welcome to BT4222

Lecturer: Yiliang ZHAO

● Director, Head of Data Science, Openspace Ventures (Current)

● Adjunct Faculty, MITB (Artificial Intelligence), SMU (Current)

● J/APAC Machine Learning Practice Lead, Google

● Senior Data Scientist, Shopee

● Master in MIS, National Chengchi University,

● Office: IS Research Lab 2 [COM2-01-03]

● Bachelor in MIS, Zhejiang University, China

● Office: IS Research Lab 2 [COM2-01-03]

● Bachelor in Financial management, SCUT

● Master in Business Analytics, HKU

● Office: IS Research Lab 1 [COM2-01-02]

● Tell us what you would like to get out of the course

● Ask questions verbally instead of using chat

● Natural Language Processing: a field with

● Convert unstructured text into a well structured document

● Assign topic into each document/piece of text

● Aspect-based sentiment Analysis

● Analyze opinions/sentiment behind text

● Visual Question Answering

● Complexity in representation, learning and using

Extract Detect Analyze Classify

● Optimization: How to search representation to obtain better evaluation

● Given a task: how to classify these following shapes:

● Our system should work as:

● Designing features is a complex process, which require a deep domain

● Deep learning is a subfield of machine learning

● Optimization in conventional machine learning

● Large amounts of training data

This is a graphic from (He. et al, 2014)

● Deep learning models

● Deep learning approaches:

From Tan et al 2016

● Deep learning approaches:

● Deep learning approaches:

You might also like