0% found this document useful (0 votes)

86 views76 pages

Intro to Computer Vision Course

This document provides an overview of an introductory computer vision course. It discusses the instructor, course topics including low-level vision, geometry, recognition, and light/color. It also outlines course requirements, projects, and grading.

Uploaded by

nandhinigirijavks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views76 pages

Intro to Computer Vision Course

Uploaded by

nandhinigirijavks

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 76

CS5670: Intro to Computer Vision

Instructor: Noah Snavely

Instructor
• Noah Snavely (snavely@cs.cornell.edu)

• Research interests:
– Computer vision and graphics
– 3D reconstruction and visualization of Internet
photo collections
– Deep learning for computer graphics
– Virtual reality video
Today
1. What is computer vision?

2. Course overview

3. Image filtering
Today
• Readings
– Szeliski, Chapter 1 (Introduction)
Every image tells a story
• Goal of computer vision:
perceive the “story”
behind the picture
• Compute properties of
the world
– 3D shape
– Names of people or
objects
– What happened?
The goal of computer vision
Can the computer match human
perception?
• Yes and no (mainly no)
– computers can be better at
“easy” things
– humans are much better at
“hard” things

• But huge progress has

been made
– Accelerating in the last 4
years due to deep learning
– What is considered “hard”
keeps changing
Human perception has its
shortcomings

Sinha and Poggio, Nature, 1996

But humans can tell a lot about a
scene from a little information…

Source: “80 million tiny images” by Torralba, et al.

The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world
The goal of computer vision
• Recognize objects and people

Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba

The goal of computer vision
• “Enhance” images
The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”

Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
The goal of computer vision
• Improve photos (“Computational Photography”)

Low-light photography (credit: Hasinoff et al., SIGGRAPH ASIA 2016)

Super-resolution (source: 2d3)

Inpainting / image completion (image credit: Hays and Efros)

Why study computer vision?
• Billions of images/videos captured per day

• Huge number of useful applications

• The next slides show the current state of the art
Optical character recognition (OCR)
• If you have a scanner, it probably came with OCR software

Digit recognition, AT&T labs License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
http://www.research.att.com/~yann/

Sudoku grabber
http://sudokugrab.blogspot.com/

Source: S. Seitz
Automatic check processing
Face detection

• Nearly all cameras detect faces in real time

– (Why?)
Face Recognition
Face recognition

Who is she? Source: S. Seitz

Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns” Read the story

Source: S. Seitz
Leaf Recognition
Bird Identification

Merlin Bird ID (based on Cornell Tech technology!)

Special effects: camera tracking

Boujou, 2d3
Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

3D face tracking w/ consumer cameras

Snapchat Lenses

Face2Face system (Thies et al.)

Sports

Sportvision first down line

Nice explanation on www.howstuffworks.com

Source: S. Seitz
Vision-based interaction (and games)

Assistive technologies

Nintendo Wii has camera-based IR

tracking built in. See Lee’s work at
CMU on clever tricks on using it to
create a multi-touch display!
Kinect
Smart cars

• Mobileye
• Tesla Autopilot
• Safety features in many high-end cars
Self-driving cars

Google Waymo
Robotics

NASA’s Mars Curiosity Rover Amazon Picking Challenge

https://en.wikipedia.org/wiki/Curiosity_(rover) http://www.robocup2016.org/en/events
/amazon-picking-challenge/

Amazon Prime Air

Medical imaging

Image guided surgery

3D imaging
Grimson et al., MIT
MRI, CT

Source: S. Seitz
Virtual & Augmented Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

My own work
• Automatic 3D reconstruction from Internet
photo collections
“Statue of Liberty” “Half Dome, Yosemite” “Colosseum, Rome”

Flickr photos

3D model
Photosynth
City-scale reconstruction

Reconstruction of Dubrovnik, Croatia, from ~40,000 images

Current state of the art
• You just saw examples of current systems.
– Most of these are less than 5 years old

• This is a very active research area, and rapidly

changing
– Many new apps in the next 5 years

• To learn more about vision applications and

companies
– David Lowe maintains an excellent overview of vision
companies
• http://www.cs.ubc.ca/spider/lowe/vision.html
Why is computer vision difficult?

Viewpoint variation

Scale
Illumination
Why is computer vision difficult?

Motion (Source: S. Lazebnik)

Intra-class variation

Background clutter Occlusion

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

But there are lots of cues we can exploit…

Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a
particular 2D picture

– We often need to use prior knowledge about the

structure of the world
Image source: F. Durand
CS5670: Introduction to Computer
VIsion
Teaching Assistant
• Zhengqi Li
(zl548@cornell.edu)

• Office hours:
When: TuTh 3:30 – 5pm
Where: Bear Hug
(starting next week)
Important notes
• Textbook:
Rick Szeliski, Computer Vision: Algorithms and
Applications
online at: http://szeliski.org/Book/

• Course webpage:
http://www.cs.cornell.edu/courses/cs5670/2017sp/

• Announcements/grades via Piazza/CMS

https://piazza.com/class#fall2013/cs46705670
https://cms.csuglab.cornell.edu/
Course requirements
• Prerequisites—these are essential!
– Data structures
– A good working knowledge of Python programming
– Linear algebra
– Vector calculus

• Course does not assume prior imaging experience

– computer vision, image processing, graphics, etc.
Course overview (tentative)
1. Low-level vision
– image processing, edge detection,
feature detection, cameras, image
formation
2. Geometry and algorithms
– projective geometry, stereo,
structure from motion, Markov
random fields
3. Recognition
– face detection / recognition,
category recognition, segmentation
4. Light, color, and reflectance
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Feature extraction Image formation

Project: Hybrid images from image
pyramids

G 1/8

G 1/4

Gaussian 1/2
Project: Feature detection and matching
2. Geometry

Projective geometry
Stereo

Multi-view stereo Structure from motion

Project: Creating panoramas
Project: Photometric Stereo
3. Recognition

Face detection and recognition

Single instance recognition

Category recognition
Sources: D. Lowe, L. Fei-Fei
Project: Deep Learning for Recognition
4. Light, color, and reflectance

Light & Color Reflectance

Grading
• Occasional quizzes (at the beginning of class)
• One prelim, one final exam
– (considering final project instead of exam)

• Rough grade breakdown:

– Quizzes + class evaluation: ~5%
– Midterm: 15-20%
– Programming projects: 40-50%
– Final exam: 15-20%
Late policy

• Three free “slip days” will be available for the

semester

• Late projects will be penalized by 5% for first

late day, and 10% for each day it is late after,
and no extra credit will be awarded.
Academic Integrity
• Assignments will be done solo or in pairs (we’ll
let you know for each project)

• Please do not leave any code public on GitHub

(or the like) at the end of the semester!

• Please see the Cornell Code of Academic

Integrity (http://cuinfo.cornell.edu/aic.cfm)
Questions?

Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
1 Intro
No ratings yet
1 Intro
103 pages
Unit 1
No ratings yet
Unit 1
186 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Lec01 Intro
No ratings yet
Lec01 Intro
55 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
CS436 CS5310 EE513 L01 Introduction
No ratings yet
CS436 CS5310 EE513 L01 Introduction
54 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
CV Overview
No ratings yet
CV Overview
83 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Lec00 Intro Computervision
No ratings yet
Lec00 Intro Computervision
58 pages
Lecture-1 CV
No ratings yet
Lecture-1 CV
18 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
CV-1 1
No ratings yet
CV-1 1
18 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Computer Vision
100% (1)
Computer Vision
48 pages
FALLSEM2025-26 VL BCSE407L 00100 TH 2025-07-31 Introduction
No ratings yet
FALLSEM2025-26 VL BCSE407L 00100 TH 2025-07-31 Introduction
37 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Computer Vision for Beginners
No ratings yet
Computer Vision for Beginners
26 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Computer Vision & Eye Tracking Report
No ratings yet
Computer Vision & Eye Tracking Report
31 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
CV - Lecture 1 - Iintroduction
No ratings yet
CV - Lecture 1 - Iintroduction
24 pages
Unit 5 Introduction Robot Vision
No ratings yet
Unit 5 Introduction Robot Vision
60 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
ComputerVision Intro
No ratings yet
ComputerVision Intro
50 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
61 pages
Intro
No ratings yet
Intro
66 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
10 pages
Computer Vision Research Document
No ratings yet
Computer Vision Research Document
3 pages
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
No ratings yet
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
68 pages
1907 11737
No ratings yet
1907 11737
16 pages
FULLTEXT01
No ratings yet
FULLTEXT01
244 pages
Challenging Programming in Python
No ratings yet
Challenging Programming in Python
287 pages
Introduction To Compressive Sensing 2.5
100% (1)
Introduction To Compressive Sensing 2.5
118 pages
Curriculum and Syllabi: M. Tech
No ratings yet
Curriculum and Syllabi: M. Tech
40 pages
Dependency Management in A Large Agile Environment
No ratings yet
Dependency Management in A Large Agile Environment
6 pages
Super Coolant AF-NAC - 50522
0% (1)
Super Coolant AF-NAC - 50522
6 pages
ACCT205 - Portfolio Project Directions and Rubrics
No ratings yet
ACCT205 - Portfolio Project Directions and Rubrics
10 pages
MUPROSPECTUS2023
No ratings yet
MUPROSPECTUS2023
50 pages
Form 2 English April Holiday Assignment
No ratings yet
Form 2 English April Holiday Assignment
4 pages
Steel Structure Design Resources
No ratings yet
Steel Structure Design Resources
1 page
Pharmaceutical Packaging Materials
No ratings yet
Pharmaceutical Packaging Materials
3 pages
Elements Facts at Your Fingertips Pocket Eyewitness DK Instant Download
100% (2)
Elements Facts at Your Fingertips Pocket Eyewitness DK Instant Download
55 pages
Co1 Science 1ST Quarter
No ratings yet
Co1 Science 1ST Quarter
4 pages
2nd Mid Term 2023 X KEY
No ratings yet
2nd Mid Term 2023 X KEY
12 pages
Consonant Gradation PDF
No ratings yet
Consonant Gradation PDF
10 pages
UAV Evolution at Northrop Grumman
No ratings yet
UAV Evolution at Northrop Grumman
50 pages
Class X Science Exam Paper
No ratings yet
Class X Science Exam Paper
8 pages
Polycure AC
No ratings yet
Polycure AC
71 pages
Effectiveness of Conservative Interventions After Acute Hamstrings Injuries in Athletes: A Living Systematic Review
No ratings yet
Effectiveness of Conservative Interventions After Acute Hamstrings Injuries in Athletes: A Living Systematic Review
21 pages
English Project Ok
No ratings yet
English Project Ok
19 pages
Xii Physical Education Practical
No ratings yet
Xii Physical Education Practical
65 pages
ATX-UK Exam Guidance 2024-2025
No ratings yet
ATX-UK Exam Guidance 2024-2025
7 pages
ALL MCQ QUESTIONS & ANSWERS RELATED TO NCC 'A' CERTIFICATE EXAMINATION - 062416.hi - en
No ratings yet
ALL MCQ QUESTIONS & ANSWERS RELATED TO NCC 'A' CERTIFICATE EXAMINATION - 062416.hi - en
86 pages
Sage Intelligence Reporting - Beginner Training Manual
83% (6)
Sage Intelligence Reporting - Beginner Training Manual
48 pages
21h2z7b4 FBM207
No ratings yet
21h2z7b4 FBM207
12 pages
Sayan Exam Form - Signed-1
No ratings yet
Sayan Exam Form - Signed-1
1 page
AP Physics 1 - Student Workbook
No ratings yet
AP Physics 1 - Student Workbook
358 pages
Key Partner Types: Pharm-Bio Technology and Traditional Medicine Centre (PHARMBIOTRAC)
No ratings yet
Key Partner Types: Pharm-Bio Technology and Traditional Medicine Centre (PHARMBIOTRAC)
2 pages
Log Cabin House UZES 70m² Sale
No ratings yet
Log Cabin House UZES 70m² Sale
7 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
23 pages
DHC 8 Sop PDF
No ratings yet
DHC 8 Sop PDF
251 pages
Computer Application Packages
No ratings yet
Computer Application Packages
50 pages
Campus Recruitment Proposal - Cermati
No ratings yet
Campus Recruitment Proposal - Cermati
25 pages
63b2 - Elizabeth Kusia Europass CV 19.01.17
No ratings yet
63b2 - Elizabeth Kusia Europass CV 19.01.17
4 pages

Intro to Computer Vision Course

Uploaded by

Intro to Computer Vision Course

Uploaded by

CS5670: Intro to Computer Vision

Instructor: Noah Snavely

• But huge progress has

Sinha and Poggio, Nature, 1996

Source: “80 million tiny images” by Torralba, et al.

cars slide credit: Fei-Fei, Fergus & Torralba

Source: Nayar and Nishino, “Eyes for Relighting”

Low-light photography (credit: Hasinoff et al., SIGGRAPH ASIA 2016)

Super-resolution (source: 2d3)

Inpainting / image completion (image credit: Hays and Efros)

• Huge number of useful applications

Digit recognition, AT&T labs License plate readers

• Nearly all cameras detect faces in real time

Who is she? Source: S. Seitz

Merlin Bird ID (based on Cornell Tech technology!)

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

Face2Face system (Thies et al.)

Sportvision first down line

Nintendo Wii has camera-based IR

NASA’s Mars Curiosity Rover Amazon Picking Challenge

Amazon Prime Air

Image guided surgery

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

Reconstruction of Dubrovnik, Croatia, from ~40,000 images

• This is a very active research area, and rapidly

• To learn more about vision applications and

Motion (Source: S. Lazebnik)

Background clutter Occlusion

slide credit: Fei-Fei, Fergus & Torralba

– We often need to use prior knowledge about the

• Announcements/grades via Piazza/CMS

• Course does not assume prior imaging experience

Feature extraction Image formation

Multi-view stereo Structure from motion

Face detection and recognition

Light & Color Reflectance

• Rough grade breakdown:

• Three free “slip days” will be available for the

• Late projects will be penalized by 5% for first

• Please do not leave any code public on GitHub

• Please see the Cornell Code of Academic

You might also like