0% found this document useful (0 votes)

7 views21 pages

Lec 2

The document discusses the distinctions between Knowledge-Based NLP and Statistical NLP, highlighting their respective strengths and weaknesses. It explains concepts such as the Noisy Channel Model, Bayesian Decision Theory, and various techniques for sentiment analysis, including the Naïve Bayes Classifier. Additionally, it emphasizes the importance of corpus data and allied disciplines that contribute to the field of Natural Language Processing.

Uploaded by

abhijit.ka001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views21 pages

Lec 2

Uploaded by

abhijit.ka001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Knowledge Based NLP and Statistical NLP

Each has its place

Knowledge Based NLP

Linguist

rules
Computer

rules/probabilities

corpus
Statistical NLP
Science without religion is blind;
Region without science is lame:
Einstein

NLP=Computation+Linguistics

NLP without Linguistics is blind

And
NLP without Computation is lame
Key difference between Statistical/ML-
based NLP and Knowledge-based
/linguistics-based NLP
 Stat NLP: speed and robustness are the
main concerns
 KB NLP: Phenomena based
 Example:
 Boys, Toys, Toes

 To get the root remove “s”

 How about foxes, boxes, ladies

 Understand phenomena: go deeper

 Slower processing
Noisy Channel Model

w NoisytChannel

(wn, wn-1, … , w1) (tm, tm-1, … , t1)

Sequence w is transformed into

sequence t.
Bayesian Decision Theory and Noisy
Channel Model are close to each other

 Bayes Theorem : Given the random variables A

and B,
P ( A) P ( B | A)
P( A | B) 
P( B)

P( A | B) Posterior probability

P ( A) Prior probability

P ( B | A) Likelihood
Discriminative vs.
Generative Model

W* = argmax (P(W|SS))
W

Generative
Discriminative
Model
Model

Compute directly from Compute from

P(W|SS) P(W).P(SS|W)
Corpus
 A collection of text called corpus, is used for
collecting various language data
 With annotation: more information, but manual labor
intensive
 Practice: label automatically; correct manually
 The famous Brown Corpus contains 1 million tagged
words.
 Switchboard: very famous corpora 2400
conversations, 543 speakers, many US dialects,
annotated with orthography and phonetics
Example-1 of Application of Noisy Channel Model:
Probabilistic Speech Recognition (Isolated Word)
[8]
 Problem Definition : Given a sequence of speech
signals, identify the words.
 2 steps :
 Segmentation (Word Boundary Detection)

 Identify the word

 Isolated Word Recognition :

 Identify W given SS (speech signal)

^
W  arg max P(W | SS )
W
Identifying the word
^
W  arg max P (W | SS )
W

 arg max P (W ) P ( SS | W )
W

 P(SS|W) = likelihood called “phonological

model “  intuitively more tractable!
 P(W) = prior probability called “language
model”
# W appears in the corpus
P (W ) 
# words in the corpus
Pronunciation Dictionary
Pronunciation Automaton

Word s4
0.73 ae 1.0
1.0 1.0 1.0 1.0
Tomato t o m t o end
0.27 1.0
s1 s2 s3 aa s6 s7
s5

 P(SS|W) is maintained in this way.

 P(t o m ae t o |Word is “tomato”) = Product of arc
probabilities
Example Problem-2
 Analyse sentiment of the text
 Positive or Negative Polarity
 Challenges:
 Unclean corpora

 Thwarted Expression: The movie has

everything: cast, drama, scene,

photography, story; the director has
managed to make a mess of all this
 Sarcasm: The movie has everything:

cast, drama, scene, photography,

story; see at your own risk.
Sentiment Classification

 Positive, negative, neutral – 3 class

 Create a representation for the

document
 Classify the representation

The most popular way of representing a

document is feature vector (indicator
sequence).
Established Techniques

 Naïve Bayes Classifier (NBC)

 Support Vector Machines (SVM)
 Neural Networks
 K nearest neighbor classifier
 Latent Semantic Indexing
 Decision Tree ID3
 Concept based indexing
Successful Approaches

The following are successful

approaches as reported in
literature.

 NBC – simple to understand and

implement
 SVM – complex, requires
foundations of perceptions
Mathematical Setting
Indicator/feature
vectors to be formed
We have training set
A: Positive Sentiment Docs
B: Negative Sentiment Docs

Let the class of positive and negative

documents be C+ and C- , respectively.
Given a new document D label it
positive if P(C |D) > P(C |D)
+ -
Priori Probability
Docum Vector Classif
ent cation Let T = Total no of documents
And let |+| = M
So,|-| = T-M

D1 V1 + P(D being positive)=M/T

D2 V2 - Priori probability is calculated

without considering any features
D3 V3 +
of the new document.
.. .. ..

D4000 V4000 -
Apply Bayes Theorem
Steps followed for the NBC algorithm:
 Calculate Prior Probability of the classes. P(C ) and P(C )
+ -
 Calculate feature probabilities of new document -
P(D| C+ ) and P(D| C-)
 Probability of a document D belonging to a class C can
be calculated by Baye’s Theorem as follows:
P(C|D) = P(C) * P(D|C)
P(D)

• Document belongs to C+ , if

P(C+ ) * P(D|C+) > P(C- ) * P(D|C-)

Calculating P(D|C+)
P(D|C+) is the probability of class C+ given D. This is calculated
as follows:
 Identify a set of features/indicators to evaluate a document
and generate a feature vector (VD). VD =<x1 , x2 , x3 … xn >
 Hence, P(D|C+) = P(VD|C+)
= P( <x1 , x2 , x3 … xn > | C+)
= |<x1,x2,x3…..xn>, C+ |
| C+ |
 Based on the assumption that all features are
Independently Identically Distributed (IID)
= P( <x1 , x2 , x3 … xn > | C+ )
= P(x1 |C+) * P(x2 |C+) * P(x3 |C+) *…. P(xn |C+)
=∏ i=1
n
P(xi |C+)
 P(xi |C+) can now be calculated as |xi |/|C+ |
Baseline Accuracy
 Just on Tokens as features, 80%
accuracy
 20% probability of a document
being misclassified
 On large sets this is significant
To improve accuracy…

 Clean corpora
 POS tag

 Concentrate on critical POS tags

(e.g. adjective)
 Remove ‘objective’ sentences ('of'

ones)
 Do aggregation

Use minimal to sophisticated NLP

Allied Disciplines
Philosophy Semantics, Meaning of “meaning”, Logic
(syllogism)
Linguistics Study of Syntax, Lexicon, Lexical Semantics etc.

Probability and Statistics Corpus Linguistics, Testing of Hypotheses,

System Evaluation
Cognitive Science Computational Models of Language Processing,
Language Acquisition
Psychology Behavioristic insights into Language Processing,
Psychological Models
Brain Science Language Processing Areas in Brain

Physics Information Theory, Entropy, Random Fields

Computer Sc. & Engg. Systems for NLP

CSC 528 Lecture 3
No ratings yet
CSC 528 Lecture 3
42 pages
Lecture 02
No ratings yet
Lecture 02
31 pages
Ima 2000
No ratings yet
Ima 2000
56 pages
AI Lec 04+05 - Naive Bayes
No ratings yet
AI Lec 04+05 - Naive Bayes
55 pages
03 ML Essentials
No ratings yet
03 ML Essentials
52 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
24 pages
Mod 1
No ratings yet
Mod 1
71 pages
Lecture 8-1 - Text Classification, Naïve Bayes, Vector Space Classification
No ratings yet
Lecture 8-1 - Text Classification, Naïve Bayes, Vector Space Classification
38 pages
Applied Natural Language Processing: Barbara Rosario
No ratings yet
Applied Natural Language Processing: Barbara Rosario
39 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
24 pages
Lecture03 Naive Bayes
No ratings yet
Lecture03 Naive Bayes
33 pages
Natural Language Processing
No ratings yet
Natural Language Processing
49 pages
L5 TextClassification Updated
No ratings yet
L5 TextClassification Updated
179 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
Predictive Methods For Text Mining
No ratings yet
Predictive Methods For Text Mining
75 pages
Ai Lecture22
No ratings yet
Ai Lecture22
32 pages
Lecture Feb20&25
No ratings yet
Lecture Feb20&25
11 pages
Irs Unit 4 CH 1
No ratings yet
Irs Unit 4 CH 1
58 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
NLP Unit-5
No ratings yet
NLP Unit-5
13 pages
Multimedia Application L7 - For
No ratings yet
Multimedia Application L7 - For
46 pages
NLP NB
No ratings yet
NLP NB
52 pages
04 Textcat
No ratings yet
04 Textcat
101 pages
Machine Learning and Statistical Natural Language Processing
No ratings yet
Machine Learning and Statistical Natural Language Processing
27 pages
Sentiment Analysis: Using Naïve Bayes Classifier
No ratings yet
Sentiment Analysis: Using Naïve Bayes Classifier
18 pages
Naïve Bayes for CS Students
No ratings yet
Naïve Bayes for CS Students
55 pages
7 MachineLearningBasics
No ratings yet
7 MachineLearningBasics
46 pages
Introduction To NLPAbebe Zerihun
No ratings yet
Introduction To NLPAbebe Zerihun
45 pages
Week 4
No ratings yet
Week 4
45 pages
Lecture5 421
No ratings yet
Lecture5 421
115 pages
NB 24 Aug
No ratings yet
NB 24 Aug
85 pages
NB 24 Aug
No ratings yet
NB 24 Aug
79 pages
Unit 5 Updated
No ratings yet
Unit 5 Updated
107 pages
Cs383 Lecture16 PDF
No ratings yet
Cs383 Lecture16 PDF
46 pages
Lecture 6 - Word2Vec and Text Classification
No ratings yet
Lecture 6 - Word2Vec and Text Classification
66 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
NLP Assignement Solution
No ratings yet
NLP Assignement Solution
6 pages
Multimedia Application L8
No ratings yet
Multimedia Application L8
68 pages
Unit 5
No ratings yet
Unit 5
107 pages
II ND Unit NLP
No ratings yet
II ND Unit NLP
21 pages
NLP 1
No ratings yet
NLP 1
13 pages
Naivebayes 2021
No ratings yet
Naivebayes 2021
77 pages
4 Naive Bayes
No ratings yet
4 Naive Bayes
82 pages
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
No ratings yet
Winter Semester 2023-24 CSE3015 ETH AP2023246000714 Quiz-I-Question-Paper
74 pages
Speech and Language Processing - J&M
No ratings yet
Speech and Language Processing - J&M
599 pages
Lecture 05
No ratings yet
Lecture 05
45 pages
Lec # 9
No ratings yet
Lec # 9
18 pages
NLP 160709201345
No ratings yet
NLP 160709201345
61 pages
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
No ratings yet
Introduction To Computational Linguistics: Eugene Charniak and Mark Johnson
148 pages
Naive Bayes With Sentiment Classification
No ratings yet
Naive Bayes With Sentiment Classification
82 pages
MLRD 2
No ratings yet
MLRD 2
15 pages
Qta Lse Day4 PDF
No ratings yet
Qta Lse Day4 PDF
59 pages
Multinomial NB
No ratings yet
Multinomial NB
52 pages
Lecture 4
No ratings yet
Lecture 4
36 pages
4 NB 2024
No ratings yet
4 NB 2024
82 pages
Introduction To NLP
No ratings yet
Introduction To NLP
50 pages
Text Classification
No ratings yet
Text Classification
60 pages
Unit 1 NLP KCS072
No ratings yet
Unit 1 NLP KCS072
12 pages
The Geopolitics of AI
No ratings yet
The Geopolitics of AI
103 pages
Python For Finance and Algorithmic Trading 2nd Edition Machine Learning Deep Learning Time Series Analysis Risk and Portfolio Management For MetaTrader 5 Live Trading Inglese Download
100% (1)
Python For Finance and Algorithmic Trading 2nd Edition Machine Learning Deep Learning Time Series Analysis Risk and Portfolio Management For MetaTrader 5 Live Trading Inglese Download
41 pages
Syllabus - Introduction To Machine Learning
No ratings yet
Syllabus - Introduction To Machine Learning
3 pages
(2019) Towards Machine Learning With Zero Real - World Data
No ratings yet
(2019) Towards Machine Learning With Zero Real - World Data
6 pages
Anomaly Detection of Spacecraft Telemetry Data Using Temporal Convolution Network
No ratings yet
Anomaly Detection of Spacecraft Telemetry Data Using Temporal Convolution Network
5 pages
Activity Detection For The Wellbeing of Dogs Using Wearable Sensors Based On Deep Learning
No ratings yet
Activity Detection For The Wellbeing of Dogs Using Wearable Sensors Based On Deep Learning
11 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
42 pages
Certified AI & ML BlackBelt Plus Program - Projects
No ratings yet
Certified AI & ML BlackBelt Plus Program - Projects
66 pages
DS Unit2
No ratings yet
DS Unit2
23 pages
Final Report B10 - 250122 - 212902
No ratings yet
Final Report B10 - 250122 - 212902
55 pages
Enhancing Malicious URL Detection A Novel Framework Leveraging Priority Coefficient and Feature Evaluation
No ratings yet
Enhancing Malicious URL Detection A Novel Framework Leveraging Priority Coefficient and Feature Evaluation
26 pages
Stroke Prediction Using Linear Regression
No ratings yet
Stroke Prediction Using Linear Regression
8 pages
Phishing URL Detection Research Paper
No ratings yet
Phishing URL Detection Research Paper
12 pages
(Ebook PDF) Introduction To Data Mining 2nd Edition by Pang-Ning Tanpdf Download
100% (8)
(Ebook PDF) Introduction To Data Mining 2nd Edition by Pang-Ning Tanpdf Download
51 pages
Ijdm D 25 00571 2
No ratings yet
Ijdm D 25 00571 2
26 pages
CS3492-DBMS Question Bank - Watermark
No ratings yet
CS3492-DBMS Question Bank - Watermark
23 pages
MetaFraud: Detecting Financial Fraud
No ratings yet
MetaFraud: Detecting Financial Fraud
37 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Unit 5 Machine Learning With PU Solution
No ratings yet
Unit 5 Machine Learning With PU Solution
68 pages
Predicting Loan Defaults with AI
No ratings yet
Predicting Loan Defaults with AI
10 pages
Pulsar Data Analysis with Machine Learning
No ratings yet
Pulsar Data Analysis with Machine Learning
10 pages
SVM, KNN, Tree NBC
No ratings yet
SVM, KNN, Tree NBC
22 pages
Lab NN KNN SVM
No ratings yet
Lab NN KNN SVM
13 pages
BE - LP III Lab Manual
No ratings yet
BE - LP III Lab Manual
54 pages
Machine Learning Model For Predicting The Crack Detection and Pattern Recognition
No ratings yet
Machine Learning Model For Predicting The Crack Detection and Pattern Recognition
13 pages
Amharic Idiom Paper
No ratings yet
Amharic Idiom Paper
9 pages
ML Final Assessment-2
No ratings yet
ML Final Assessment-2
10 pages
T-3 A Comprehensive Crop Recommendation System Integrating Machine Learning and Deep Learning Models
No ratings yet
T-3 A Comprehensive Crop Recommendation System Integrating Machine Learning and Deep Learning Models
8 pages
Advancements in Crowd-Monitoring System: A Comprehensive Analysis of Systematic Approaches and Automation Algorithms: State-Of-The-Art
No ratings yet
Advancements in Crowd-Monitoring System: A Comprehensive Analysis of Systematic Approaches and Automation Algorithms: State-Of-The-Art
16 pages
Aspiring Software Engineer's Journey
No ratings yet
Aspiring Software Engineer's Journey
1 page

Lec 2

Uploaded by

Lec 2

Uploaded by

Knowledge Based NLP and Statistical NLP

Each has its place

Knowledge Based NLP

NLP without Linguistics is blind

 To get the root remove “s”

 How about foxes, boxes, ladies

 Understand phenomena: go deeper

(wn, wn-1, … , w1) (tm, tm-1, … , t1)

Sequence w is transformed into

 Bayes Theorem : Given the random variables A

Compute directly from Compute from

 Identify the word

 Isolated Word Recognition :

 P(SS|W) = likelihood called “phonological

 P(SS|W) is maintained in this way.

 Thwarted Expression: The movie has

everything: cast, drama, scene,

cast, drama, scene, photography,

 Positive, negative, neutral – 3 class

The most popular way of representing a

 Naïve Bayes Classifier (NBC)

The following are successful

 NBC – simple to understand and

Let the class of positive and negative

D1 V1 + P(D being positive)=M/T

D2 V2 - Priori probability is calculated

P(C+ ) * P(D|C+) > P(C- ) * P(D|C-)

 Concentrate on critical POS tags

Use minimal to sophisticated NLP

Probability and Statistics Corpus Linguistics, Testing of Hypotheses,

Physics Information Theory, Entropy, Random Fields

Computer Sc. & Engg. Systems for NLP

You might also like