0% found this document useful (0 votes)

28 views29 pages

Introduction

COMP 3361 is a Natural Language Processing course for Spring 2024, taught by Instructor Tao Yu, with classes held on Tuesdays and Fridays. The course covers core NLP techniques, large language models, and their applications, with a grading structure that includes assignments, a project, an exam, and participation. Communication will primarily occur via Slack, and students are encouraged to utilize AI tools like ChatGPT while adhering to academic integrity guidelines.

Uploaded by

9gt5rqjjnq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views29 pages

Introduction

Uploaded by

9gt5rqjjnq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

COMP 3361

Natural Language Processing

Spring 2024

1
Logistics
● Location: KB 132
● Meetings: Tuesday 9:30 am - 10:20 am and Friday 9:30 am - 11:20 am
● Instructor: Tao Yu (https://taoyds.github.io/)
● Office hours: Wednesday 4 - 5 pm @IDS

P2
Logistics
Course website: https://taoyds.github.io/courses/comp3361

● We will maintain the website for schedule, lecture slides, reading lists,
grading policies, etc
● Only submit your reports on Moodle.

P3
Logistics
Slack: https://join.slack.com/t/slack-fdv4728/shared_invite/zt-2asgddr0h-6wIXbRndwKhBw2IX2~ZrJQ

● We will use Slack as the primary mode of communication. DM me on

Slack instead of emails.
○ Answer any questions about lectures, assignments, grading, and so on
○ Share random thoughts, highlight interesting papers, brag about cool finding there.

● Join Slack via the invitation link above.

P4
Course prerequisites
● COMP3314 or COMP3340; and MATH1853
● Familiarity with deep learning and machine learning
● Familiarity with Python programming
● Helpful: exposure to AI assistants such as ChatGPT

P5
Course goals
● Understand core techniques and modern advances in NLP, especially
in the era of large language models.

● Design, implement, and test NLP systems based on large language

models.

P6
Components and grading
● Assignments: 40%
○ ~2 assignments, 20% for each
● Course project: 30%
○ More guidelines will be announced soon
● In-class exam: 25%

● Class participation: 5%

P7
Policy on ChatGPT, Copilot, and other AI assistants
● This course emphasizes understanding the capabilities and limitations of
these AI systems, and there's no better way to do that than by using them!
Collaboration with these systems is allowed, treating them as
collaborators in the problem-solving process. However, Using them to
substantially complete assignments will be considered a violation of
the Honor Code.

P8
Class readings

● Readings from textbook chapters, blogs, tutorials, and papers will be

posted on the course website.

● You may find it useful to do these readings before lecture as

preparation or after lecture to review, but you are not expected to
know everything discussed in the textbook if it isn’t covered in lecture.

● Paper readings are intended to supplement the course material if you

are interested in diving deeper on particular topics.

P9
Topics and Schedule (Tentative)
● Introduction and NLP model basics
● Large language models (LLMs)
● NLP applications
● Advanced LLM topics

P 10
Introduction and NLP model basics
● Word embeddings
● Text Classification and Language Modeling
● Sequence-to-Sequence, Attention, Transformers

P 11
Large language models (LLMs)
● LLM pretraining
● LLM Prompting, in-context learning
● LLM evaluation, data, and benchmarking
● Instruction tuning for LLMs
● LLM alignment/RLHF

P 12
NLP applications
● Question answering, reasoning
● Text generation
● Semantic parsing, code generation
● LM agent, language grounding

P 13
Advanced LLM topics
● Robustness, interpretability, explainability of LLMs
● Bias, toxicity, and privacy in LLMs
● Parameter-efficient LM tuning
● Efficient LLM methods and Infrastructure
● Multimodal LM, language in robotics, and embodied interaction

P 14
What is NLP? Wait, what is language?
● Language is the abstraction of the real world!
● Natural Language Processing (NLP) aims to teach computers human
languages a computational perspective.

P 15
About NLP: teaching computers human languages

● NLP in real world applications

○ Q&A / IR - Google search

P 16
About NLP: teaching computers human languages

● NLP in real world applications

○ Q&A / IR - Google search

Input: x Output: y
When was HKU founded? March 30, 1911

P 17
AI brain/model: f(x)
About NLP: teaching computers human languages

● NLP in real world applications

○ Q&A / IR - Google search
○ Dialogs - Apple Siri / Amazon Alexa

P 18
About NLP: teaching computers human languages

● NLP in real world applications

○ Q&A / IR - Google search
○ Dialogs - Apple Siri / Amazon Alexa
○ Grammar checking (Grammarly), summarization, sentiment analysis …

P 19
What ChatGPT can do?

P 20
https://beta.openai.com/examples/
Q&A example with ChatGPT

P 21
https://beta.openai.com/examples/
More examples with ChatGPT

P 22
https://beta.openai.com/examples/
Examples with ChatGPT

P 23
https://beta.openai.com/examples/
New learning paradigm: in-context learning

P 24
Few-shot in-context learning

● Few-shot: In additional to the task

description, the model sees a few examples
of the task. task description

● No fine-tuning, GPT-3 doesn’t update their

task examples
parameters!

task prompt

GPT-3 output
P 25
Language Models are Few-Shot Learners
About NLP: teaching computers human languages

● NLP in real world applications

○ Q&A / IR - Google search
○ Dialogs - Apple Siri / Amazon Alexa
○ Grammar checking (Grammarly), summarization, sentiment analysis …
○ Text to images: image creation from a text description - OpenAI’s DALLE-2

P 26
DALLE-2 demo: text to images

P 27
https://openai.com/dall-e-2/
DALLE-2: text to images

P 28
https://openai.com/dall-e-2/
Language models are powerful, but they still suffer from
● Lack of interpretability
● Inconsistency
● Limited scalability
● Restricted capabilities
● …

P 29

BIT4133 Natural Language Processing Course Outline and Week 1 Introduction
No ratings yet
BIT4133 Natural Language Processing Course Outline and Week 1 Introduction
4 pages
Lecture01 Introduction
No ratings yet
Lecture01 Introduction
35 pages
CSR 322 Syllabus
No ratings yet
CSR 322 Syllabus
2 pages
Lec-1 Introduction
No ratings yet
Lec-1 Introduction
68 pages
E061341 - NLP
No ratings yet
E061341 - NLP
3 pages
NLP Intro Logistics MIHE
No ratings yet
NLP Intro Logistics MIHE
21 pages
NLP Teaching Plan With UNIT 1
No ratings yet
NLP Teaching Plan With UNIT 1
60 pages
Lec1 Intro Slides
No ratings yet
Lec1 Intro Slides
83 pages
AIM829 Natural Language Processin - Tulika Saha
No ratings yet
AIM829 Natural Language Processin - Tulika Saha
6 pages
NLP Syllabus
No ratings yet
NLP Syllabus
1 page
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
57 pages
CCS369
No ratings yet
CCS369
2 pages
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
No ratings yet
CCS369 TEXT AND SPEECH ANALYSIS - Syllabus
4 pages
Deep Learning With NLP - Theory
No ratings yet
Deep Learning With NLP - Theory
2 pages
Statistical NLP and Sequence Labeling: ML4291 Natural Language Processing LTPC 2 0 2 3 Course Objectives
No ratings yet
Statistical NLP and Sequence Labeling: ML4291 Natural Language Processing LTPC 2 0 2 3 Course Objectives
3 pages
Gujarat Technological University: W.E.F. AY 2018-19
No ratings yet
Gujarat Technological University: W.E.F. AY 2018-19
2 pages
Sasidhar Goud (Report)
No ratings yet
Sasidhar Goud (Report)
22 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
9 pages
Natural Language Processing - Session 1 - Introduction
100% (1)
Natural Language Processing - Session 1 - Introduction
55 pages
M.Tech NLP Course Overview
No ratings yet
M.Tech NLP Course Overview
2 pages
NLP Materia
No ratings yet
NLP Materia
29 pages
NLP Syllabus
No ratings yet
NLP Syllabus
4 pages
NLP AI Professional Presentation 2
No ratings yet
NLP AI Professional Presentation 2
18 pages
CMU NLP Online Course Overview
No ratings yet
CMU NLP Online Course Overview
13 pages
Seminar Outline NLP
No ratings yet
Seminar Outline NLP
5 pages
Applied Natural Language Processing
No ratings yet
Applied Natural Language Processing
3 pages
NLP Syllabus
No ratings yet
NLP Syllabus
2 pages
17B1NCI731 - ML&NLP - CD - Odd - 25-26
No ratings yet
17B1NCI731 - ML&NLP - CD - Odd - 25-26
2 pages
Natural Language Processing (NLP) (A Complete Guide)
No ratings yet
Natural Language Processing (NLP) (A Complete Guide)
26 pages
Syllabus NLP (UE19CS334)
No ratings yet
Syllabus NLP (UE19CS334)
2 pages
Natural Language Processing
No ratings yet
Natural Language Processing
43 pages
NLP Syallabus Elective
No ratings yet
NLP Syallabus Elective
3 pages
FRM Course Syllabus IPDownload
No ratings yet
FRM Course Syllabus IPDownload
2 pages
Cs-3-Lesson Plan
No ratings yet
Cs-3-Lesson Plan
3 pages
Natural Language Processing
No ratings yet
Natural Language Processing
4 pages
NLP Coding Guide for Beginners
No ratings yet
NLP Coding Guide for Beginners
10 pages
NLP Question Bank
No ratings yet
NLP Question Bank
36 pages
03 NLP Document
No ratings yet
03 NLP Document
38 pages
95-820 Applications of NL (X) and LLM
No ratings yet
95-820 Applications of NL (X) and LLM
15 pages
Intro IT356
No ratings yet
Intro IT356
45 pages
NLP Handwritten Notes
No ratings yet
NLP Handwritten Notes
26 pages
LING3401 Course Outline
No ratings yet
LING3401 Course Outline
5 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
CS312 NLP Lecture 1 Introduction
No ratings yet
CS312 NLP Lecture 1 Introduction
21 pages
Course Introduction-I-1
No ratings yet
Course Introduction-I-1
39 pages
ME02023011
No ratings yet
ME02023011
3 pages
AI Class Notes Session 3 Natural Language Processing and AI
No ratings yet
AI Class Notes Session 3 Natural Language Processing and AI
2 pages
Text and Speech Analysis Course
No ratings yet
Text and Speech Analysis Course
3 pages
NLP Unit1 Presentation
No ratings yet
NLP Unit1 Presentation
65 pages
Intro To NLP Course Outline (Fall-2024)
No ratings yet
Intro To NLP Course Outline (Fall-2024)
4 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
AnandKumar Course Intro IT356
No ratings yet
AnandKumar Course Intro IT356
42 pages
NLP DL
No ratings yet
NLP DL
26 pages
Seminar Title: Natural Language Processing: Understanding and Generating Human Language
No ratings yet
Seminar Title: Natural Language Processing: Understanding and Generating Human Language
20 pages
Syllabus NLP
100% (1)
Syllabus NLP
2 pages
GPT Syllabus
No ratings yet
GPT Syllabus
3 pages
Introduction AdvNLP
No ratings yet
Introduction AdvNLP
12 pages
Natural Language Processing: Teaching Machines To Understand Human Language
No ratings yet
Natural Language Processing: Teaching Machines To Understand Human Language
2 pages
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
No ratings yet
Natural Language Processing A Machine Learning Perspective by Yue Zhang, Westlake University Zhiyang Teng, Westlake University
768 pages
1 Solution
No ratings yet
1 Solution
3 pages
Pre-Training & LLM 2
No ratings yet
Pre-Training & LLM 2
46 pages
LLM Scaling Laws & Emergent Capacities
No ratings yet
LLM Scaling Laws & Emergent Capacities
23 pages
E3. AI Agents
No ratings yet
E3. AI Agents
49 pages
E5. Efficient LM Methods
No ratings yet
E5. Efficient LM Methods
41 pages
LLM Prompting & In-Context Learning
No ratings yet
LLM Prompting & In-Context Learning
18 pages
Introduction
No ratings yet
Introduction
6 pages
Neural Language Models & Tokenization
No ratings yet
Neural Language Models & Tokenization
70 pages
Multi-Class Classification
No ratings yet
Multi-Class Classification
52 pages
Deep Learning Recap
No ratings yet
Deep Learning Recap
13 pages
0.1. Probability Review
No ratings yet
0.1. Probability Review
6 pages
Matrices and Linear Transformations
No ratings yet
Matrices and Linear Transformations
74 pages
Subspace and Basis
No ratings yet
Subspace and Basis
60 pages
Orthogonality
No ratings yet
Orthogonality
61 pages
Ahmed Mohamed Belal Rizk
No ratings yet
Ahmed Mohamed Belal Rizk
2 pages
Final Test - Speaking Quiz
No ratings yet
Final Test - Speaking Quiz
3 pages
Test Bank For Communication in Everyday Life 1st Edition
No ratings yet
Test Bank For Communication in Everyday Life 1st Edition
10 pages
Value Proposition
No ratings yet
Value Proposition
2 pages
Only Ten Questions
No ratings yet
Only Ten Questions
2 pages
Unit-6 Comunication
No ratings yet
Unit-6 Comunication
22 pages
Hypertext
No ratings yet
Hypertext
16 pages
Core Competency Assignment 1 - 2023-2024
No ratings yet
Core Competency Assignment 1 - 2023-2024
2 pages
Q1 Lesson 1 English 10
No ratings yet
Q1 Lesson 1 English 10
19 pages
Higher Secondary Blue Print 2020 21
No ratings yet
Higher Secondary Blue Print 2020 21
36 pages
SUP 2083 Group Assignment-5
No ratings yet
SUP 2083 Group Assignment-5
7 pages
Teaching World Englishes PDF
No ratings yet
Teaching World Englishes PDF
26 pages
Class XI Updated English Syllabussession 2020 21 PDF
No ratings yet
Class XI Updated English Syllabussession 2020 21 PDF
4 pages
English Phonetic Chart PDF
No ratings yet
English Phonetic Chart PDF
23 pages
Connected Speech e Phrasal Verbs A Coppie
No ratings yet
Connected Speech e Phrasal Verbs A Coppie
7 pages
Music Lesson Plan
No ratings yet
Music Lesson Plan
2 pages
Second Term KG3 Phonics Revision 2023
No ratings yet
Second Term KG3 Phonics Revision 2023
9 pages
Noting and Drafting 1
No ratings yet
Noting and Drafting 1
33 pages
Gen Ed7 Week7 (New)
No ratings yet
Gen Ed7 Week7 (New)
3 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
Grade 12 English Assessment Plan
No ratings yet
Grade 12 English Assessment Plan
13 pages
1.2 Pronouns
No ratings yet
1.2 Pronouns
33 pages
Social Media Marketing Questionaire
No ratings yet
Social Media Marketing Questionaire
3 pages
Learning Task 5
No ratings yet
Learning Task 5
6 pages
Classical Spanish Drama in Restoration English 16601700 1st Edition Jorge Braga Riera Download
100% (2)
Classical Spanish Drama in Restoration English 16601700 1st Edition Jorge Braga Riera Download
82 pages
Protocol Listening Difficulties.
No ratings yet
Protocol Listening Difficulties.
11 pages
Spelling - Ing and - Ed Form
No ratings yet
Spelling - Ing and - Ed Form
2 pages
Year 6 Persuasive Writing Formative Assesment Rubric
No ratings yet
Year 6 Persuasive Writing Formative Assesment Rubric
1 page
Vidyasagar Teachers Training College (B. Ed)
No ratings yet
Vidyasagar Teachers Training College (B. Ed)
1 page
Tiểu luận Contrastive Linguistic - Linh
100% (1)
Tiểu luận Contrastive Linguistic - Linh
15 pages