0% found this document useful (0 votes)

24 views130 pages

Lecture 13

Uploaded by

feolivos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views130 pages

Lecture 13

Uploaded by

feolivos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 130

Stereo

16-385 Computer Vision

http://www.cs.cmu.edu/~16385/ Spring 2020, Lecture 13
Course announcements

• Homework 3 is due on March 4th.

- How many of you have looked at/started/finished homework 3?

• Take-home quiz 5 is due on March 1st.

Overview of today’s lecture

• Leftover from two-view geometry.

• Revisiting triangulation.

• Disparity.

• Stereo rectification.

• Stereo matching.

• Improving stereo matching.

• Structured light.
Slide credits

Some of these slides were adapted directly from:

• Kris Kitani (16-385, Spring 2017).

• Srinivasa Narasimhan (16-823, Spring 2017).
Revisiting triangulation
How would you reconstruct 3D points?

Left image Right image

How would you reconstruct 3D points?

Left image Right image

1. Select point in one image (how?)
How would you reconstruct 3D points?

Left image Right image

1. Select point in one image (how?)
2. Form epipolar line for that point in second image (how?)
How would you reconstruct 3D points?

Left image Right image

1. Select point in one image (how?)
2. Form epipolar line for that point in second image (how?)
3. Find matching point along line (how?)
How would you reconstruct 3D points?

Left image Right image

1. Select point in one image (how?)
2. Form epipolar line for that point in second image (how?)
3. Find matching point along line (how?)
4. Perform triangulation (how?)
Triangulation

3D point

left image right image

left camera with matrix right camera with matrix

How would you reconstruct 3D points?

Left image Right image

1. Select point in one image (how?)
2. Form epipolar line for that point in second image (how?) What are the disadvantages
3. Find matching point along line (how?) of this procedure?
4. Perform triangulation (how?)
Stereo rectification
What’s different between these two images?
Objects that are close move more or less?
The amount of horizontal movement is
inversely proportional to …
The amount of horizontal movement is
inversely proportional to …

… the distance from the camera.

More formally…
3D point

image plane

camera center camera center

image plane
(baseline)
How is X related to x?

(baseline)
(baseline)
How is X related to x’?

(baseline)
(baseline)
(baseline)

Disparity
(wrt to camera origin of image plane)
(baseline)

Disparity
inversely proportional
to depth
Real-time stereo sensing

Nomad robot searches for meteorites in Antartica

http://www.frc.ri.cmu.edu/projects/meteorobot/index.html
Subaru
Eyesight system

Pre-collision
braking
What other vision system uses
disparity for depth sensing?
This is how 3D movies work
Is disparity the only depth cue
the human visual system uses?
So can I compute depth from any two
images of the same object?
So can I compute depth from any two
images of the same object?

1. Need sufficient baseline

2. Images need to be ‘rectified’ first (make epipolar lines horizontal)

1. Rectify images
(make epipolar lines horizontal)
2. For each pixel
a. Find epipolar line
b. Scan line for best match
c. Compute depth from disparity
How can you make the epipolar lines horizontal?
3D point

image plane

camera center camera center

What’s special about these two cameras?

When are epipolar lines horizontal?
When this relationship holds:

R=I t = (T, 0, 0)

x’
t
Proof in take-home quiz 5
It’s hard to make the image planes exactly parallel
How can you make the epipolar lines horizontal?
Use stereo rectification?
What is stereo rectification?
What is stereo rectification?

Reproject image
planes onto a
common plane
parallel to the line
between camera
centers

How can you do this?

What is stereo rectification?

Reproject image
planes onto a
common plane
parallel to the line
between camera
centers

Need two
homographies (3x3
transform), one for
each input image
reprojection

C. Loop and Z. Zhang. Computing Rectifying Homographies for Stereo Vision.Computer Vision and Pattern Recognition, 1999.
Stereo Rectification
1. Rotate the right camera by R
(aligns camera coordinate system orientation only)

2. Rotate (rectify) the left camera so that the epipole

is at infinity

3. Rotate (rectify) the right camera so that the epipole

is at infinity

4. Adjust the scale

Stereo Rectification:

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
Stereo Rectification:

rotate by R

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
Stereo Rectification:

rotate by Rrect

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
Stereo Rectification:

scale by H

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
Stereo Rectification:

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
Step 1: Compute E to get R

SVD: Let

We get FOUR solutions:

two possible rotations two possible translations

We get FOUR solutions:

Which one do we choose?

Compute determinant of R, valid solution must be equal to 1
(note: det(R) = -1 means rotation and reflection)

Compute 3D point using triangulation, valid solution has positive Z value

(Note: negative Z means point is behind the camera )
Let’s visualize the four configurations…

image plane

optical axis
Camera Icon

camera center

Find the configuration where the point is in front of both cameras

Find the configuration where the points is in front of both cameras
Find the configuration where the points is in front of both cameras
Stereo Rectification:

1. Compute E to get R
2. Rotate right image by R
3. Rotate both images by Rrect
4. Scale both images by H
When do epipolar
lines become
horizontal?
Parallel cameras

Where is the epipole?

Parallel cameras

epipole at infinity
Setting the epipole to infinity
(Building Rrect from e)

epipole e
Let Given: (using SVD on E)
(translation from E)

epipole coincides with translation vector

cross product of e and

the direction vector of
the optical axis

orthogonal vector
If and orthogonal

then
If and orthogonal

then

Where is this point located on the image plane?

If and orthogonal

then

Where is this point located on the image plane?

At x-infinity
Stereo Rectification Algorithm
1. Estimate E using the 8 point algorithm (SVD)

2. Estimate the epipole e (SVD of E)

3. Build Rrect from e

4. Decompose E into R and T

5. Set R1=Rrect and R2 = RRrect

6. Rotate each left camera point (warp image)

[x’ y’ z’] = R1 [x y z]

7. Rectified points as p = f/z’[x’ y’ z’]

8. Repeat 6 and 7 for right camera points using R2

What can we do after
rectification?
Stereo matching
Depth Estimation via Stereo Matching
1. Rectify images
(make epipolar lines horizontal)
2. For each pixel
a. Find epipolar line
How would
b. Scan line for best match
you do this?
c. Compute depth from disparity
Reminder from filtering
How do we detect an edge?
Reminder from filtering
How do we detect an edge?
• We filter with something that looks like an edge.

* 1 0 -1

horizontal edge filter

1
* 0
original -1

We can think of linear filtering as a way to evaluate

how similar an image is locally to some template.
vertical edge filter
Find this template
How do we detect the template in he following image?
Find this template
How do we detect the template in he following image?

filter

output What will

the output
look like?

image

Solution 1: Filter the image using the template as filter kernel.

Find this template
How do we detect the template in he following image?

filter

output

image

Solution 1: Filter the image using the template as filter kernel. What went wrong?
Find this template
How do we detect the template in he following image?

filter

output

image

Increases for higher

Solution 1: Filter the image using the template as filter kernel. local intensities.
Find this template
How do we detect the template in he following image?

filter template mean

output What will

the output
look like?

image

Solution 2: Filter the image using a zero-mean template.

Find this template
How do we detect the template in he following image?

output

filter template mean

output True detection

thresholding
image False
detections

Solution 2: Filter the image using a zero-mean template. What went wrong?
Find this template
How do we detect the template in he following image?

output

filter template mean

output

Not robust to high-

contrast areas
image

Solution 2: Filter the image using a zero-mean template.

Find this template
How do we detect the template in he following image?

filter

output What will

the output
look like?

image

Solution 3: Use sum of squared differences (SSD).

Find this template
How do we detect the template in he following image?

1-output

filter

output True detection

thresholding
image

Solution 3: Use sum of squared differences (SSD). What could go wrong?

Find this template
How do we detect the template in he following image?

1-output

filter

output

Not robust to local

intensity changes
image

Solution 3: Use sum of squared differences (SSD).

Find this template
How do we detect the template in he following image?

Observations so far:

• subtracting mean deals with brightness bias

• dividing by standard deviation removes contrast bias

Can we combine the two effects?

Find this template
How do we detect the template in he following image?

What will
the output
filter template mean
look like?
output

local patch mean

image

Solution 4: Normalized cross-correlation (NCC).

Find this template
How do we detect the template in he following image?

1-output

True detections

thresholding

Solution 4: Normalized cross-correlation (NCC).

Find this template
How do we detect the template in he following image?

1-output

True detections

thresholding

Solution 4: Normalized cross-correlation (NCC).

What is the best method?
It depends on whether you care about speed or invariance.

• Zero-mean: Fastest, very sensitive to local intensity.

• Sum of squared differences: Medium speed, sensitive to intensity offsets.

• Normalized cross-correlation: Slowest, invariant to contrast and brightness.

Stereo Block Matching
Left Right

scanline

Matching cost
disparity

• Slide a window along the epipolar line and compare contents of

that window with the reference window in the left image
• Matching cost: SSD or normalized correlation
SSD
Normalized cross-correlation
Similarity Measure Formula
Sum of Absolute Differences (SAD)

Sum of Squared Differences (SSD)

Zero-mean SAD

Locally scaled SAD

Normalized Cross Correlation (NCC)

SAD SSD NCC Ground truth

Effect of window size

W=3 W = 20
Effect of window size

W=3 W = 20

Smaller window Larger window

+ More detail + Smoother disparity maps
- More noise - Less detail
- Fails near boundaries
When will stereo block matching fail?
When will stereo block matching fail?

textureless regions repeated patterns

specularities
Improving stereo matching
Block matching Ground truth

What are some problems with the result?

How can we improve depth estimation?
How can we improve depth estimation?
Too many discontinuities.
We expect disparity values to change slowly.

Let’s make an assumption:

depth should change smoothly
Stereo matching as …

Energy Minimization

What defines a good stereo correspondence?

1. Match quality
– Want each pixel to find a good match in the other image
2. Smoothness
– If two pixels are adjacent, they should (usually) move about the same
amount
energy function
(for one pixel)

{
{
data term smoothness term

Want each pixel to find a good match Adjacent pixels should (usually)
in the other image move about the same amount
(block matching result) (smoothness function)
data term
SSD distance between windows
centered at I(x, y) and J(x+ d(x,y), y)
SSD distance between windows
centered at I(x, y) and J(x+ d(x,y), y)

smoothness term

: set of neighboring pixels

4-connected 8-connected
neighborhood neighborhood
smoothness term

L1 distance

“Potts model”
One possible solution…

Dynamic Programming

Can minimize this independently per scanline

using dynamic programming (DP)

: minimum cost of solution such that d(x,y) = d

Match only Match & smoothness (via graph cut)

Ground Truth

Y. Boykov, O. Veksler, and R. Zabih, Fast Approximate Energy Minimization via Graph Cuts, PAMI 2001
All of these cases remain difficult, what can we do?

textureless regions repeated patterns

specularities
Structured light
Use controlled (“structured”) light to make correspondences easier

Disparity between laser points on

the same scanline in the images
determines the 3-D coordinates of
the laser point on object
Use controlled (“structured”) light to make correspondences easier
Structured light and two cameras

laser

I J
Structured light and one camera

Projector acts like

“reverse” camera

I J
Example: Laser scanner

Digital Michelangelo Project

http://graphics.stanford.edu/projects/mich/
15-463/15-663/15-862 Computational Photography
Learn about structured light and other cameras – and build some on your own!

cameras that take video at the speed of light cameras that measure depth in real time

cameras that capture

entire focal stacks

cameras that see around corners http://graphics.cs.cmu.edu/courses/15-463/

References
Basic reading:
• Szeliski textbook, Section 8.1 (not 8.1.1-8.1.3), Chapter 11, Section 12.2.
• Hartley and Zisserman, Section 11.12.

Stereo Matching and Rectification
No ratings yet
Stereo Matching and Rectification
13 pages
Epipolar Geometry for CV Experts
No ratings yet
Epipolar Geometry for CV Experts
53 pages
Computer Vision 4 3D Vision Motion 2 Students
No ratings yet
Computer Vision 4 3D Vision Motion 2 Students
60 pages
Stereo Image Processing Using Opencv
No ratings yet
Stereo Image Processing Using Opencv
25 pages
L17 Panaroma Disparity
No ratings yet
L17 Panaroma Disparity
73 pages
Stereo Vision Using The Opencv Library: Sebastian DR Oppelmann Moos Hueting Sander Latour Martijn Van Der Veen June 2010
No ratings yet
Stereo Vision Using The Opencv Library: Sebastian DR Oppelmann Moos Hueting Sander Latour Martijn Van Der Veen June 2010
15 pages
Stereo 3d Vision
No ratings yet
Stereo 3d Vision
53 pages
4 2 Stereo-Geo
No ratings yet
4 2 Stereo-Geo
55 pages
Computer Vision 4 3D Vision Motion 2
No ratings yet
Computer Vision 4 3D Vision Motion 2
67 pages
Stereopsis 1: Camera Geometry and 3d-Reconstruction
No ratings yet
Stereopsis 1: Camera Geometry and 3d-Reconstruction
54 pages
26 Stereo
No ratings yet
26 Stereo
39 pages
Lecture 06 StereoVision
No ratings yet
Lecture 06 StereoVision
62 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
99 pages
cv2 2
No ratings yet
cv2 2
79 pages
Daniel Stereo
No ratings yet
Daniel Stereo
8 pages
2-View Geometry in Computer Vision
No ratings yet
2-View Geometry in Computer Vision
20 pages
Tute Questions
No ratings yet
Tute Questions
6 pages
Stereo Vision Disparity Estimation
No ratings yet
Stereo Vision Disparity Estimation
29 pages
Part 09 MD
No ratings yet
Part 09 MD
40 pages
Measuring Height: T B R R B T
No ratings yet
Measuring Height: T B R R B T
51 pages
Image Formation in Stereo Vision Setup - RK
No ratings yet
Image Formation in Stereo Vision Setup - RK
31 pages
Slide 3DP 07 Epipolar Geometry
No ratings yet
Slide 3DP 07 Epipolar Geometry
31 pages
Stereo Calibration 2
No ratings yet
Stereo Calibration 2
6 pages
Image Stitching for CS Students
No ratings yet
Image Stitching for CS Students
57 pages
Real Time 3D Depth Estimation and
No ratings yet
Real Time 3D Depth Estimation and
6 pages
Stereo Matching Using Correlation
No ratings yet
Stereo Matching Using Correlation
14 pages
Lecture9 2
No ratings yet
Lecture9 2
32 pages
MCV C4 2024 Exam Answers
No ratings yet
MCV C4 2024 Exam Answers
7 pages
10 Stereo
No ratings yet
10 Stereo
131 pages
Unit 3
No ratings yet
Unit 3
43 pages
Camera Geometry and Calibration
No ratings yet
Camera Geometry and Calibration
35 pages
4 3 Stereo-App
No ratings yet
4 3 Stereo-App
70 pages
EpipolarGeonetry Vvgood
No ratings yet
EpipolarGeonetry Vvgood
16 pages
Robot Terrain Modeling with Camera
100% (2)
Robot Terrain Modeling with Camera
49 pages
Cylindrical Rectification PDF
No ratings yet
Cylindrical Rectification PDF
7 pages
Stereo Vision Due Diligence
No ratings yet
Stereo Vision Due Diligence
6 pages
03 Two-View Geometry
No ratings yet
03 Two-View Geometry
40 pages
Camera Calibration and Stereo Vision
No ratings yet
Camera Calibration and Stereo Vision
4 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
95 pages
Stereo Vision
No ratings yet
Stereo Vision
214 pages
CVR Module 3
No ratings yet
CVR Module 3
12 pages
Epipolar Geometry
No ratings yet
Epipolar Geometry
44 pages
A Wearable Stereo Camera System For Distance Measurement Towards Assistive Robot
No ratings yet
A Wearable Stereo Camera System For Distance Measurement Towards Assistive Robot
39 pages
Multiple View Geometry
No ratings yet
Multiple View Geometry
42 pages
CV Notes
No ratings yet
CV Notes
11 pages
3D Geometry Applied in Computer Vision Applicatioms
No ratings yet
3D Geometry Applied in Computer Vision Applicatioms
72 pages
2D-to-3D Photo Rendering For 3D Displays: Comandu@dsi - Unifi.it Atsuto - Maki@crl - Toshiba.co - Uk
No ratings yet
2D-to-3D Photo Rendering For 3D Displays: Comandu@dsi - Unifi.it Atsuto - Maki@crl - Toshiba.co - Uk
8 pages
326 Intro
No ratings yet
326 Intro
2 pages
Determining The Epipolar Geometry and Its Uncertainty: A Review
No ratings yet
Determining The Epipolar Geometry and Its Uncertainty: A Review
35 pages
VSLAM
No ratings yet
VSLAM
75 pages
Slide 3DP 05 Homographies
No ratings yet
Slide 3DP 05 Homographies
35 pages
Depth Reconstruction With Deep Neural Networks (Part 1)
No ratings yet
Depth Reconstruction With Deep Neural Networks (Part 1)
66 pages
Comp Cheat Sheet
No ratings yet
Comp Cheat Sheet
2 pages
Epipolar Geometry 2
No ratings yet
Epipolar Geometry 2
50 pages
(2025) C4 - L5-6 - Epipolargeometry in Computer Vision
No ratings yet
(2025) C4 - L5-6 - Epipolargeometry in Computer Vision
78 pages
CCD and Stereo Vision Explained
No ratings yet
CCD and Stereo Vision Explained
7 pages
Lec 17
No ratings yet
Lec 17
10 pages
PfAS 5 Point - Cloud - Processing
No ratings yet
PfAS 5 Point - Cloud - Processing
37 pages
Requirements - Second - Report V1
No ratings yet
Requirements - Second - Report V1
1 page
2nd Report Group 06
No ratings yet
2nd Report Group 06
9 pages
Simon Fischer - Practice (Violin) (2004, Edition Peters)
100% (9)
Simon Fischer - Practice (Violin) (2004, Edition Peters)
344 pages
CLC 12-Combined Final Capstone Proposal Ref
No ratings yet
CLC 12-Combined Final Capstone Proposal Ref
4 pages
Sentential Devices For Conveying Givenness and Newness: A Cross-Cultural Developmental Study BATES & MACWHINNEY
No ratings yet
Sentential Devices For Conveying Givenness and Newness: A Cross-Cultural Developmental Study BATES & MACWHINNEY
20 pages
Al Aamerah Housing Project Infrastructure - Updated Program
No ratings yet
Al Aamerah Housing Project Infrastructure - Updated Program
2 pages
Zero Breakdown Concepts
50% (4)
Zero Breakdown Concepts
24 pages
Worksheets in Genmath
No ratings yet
Worksheets in Genmath
5 pages
Control Flow Statements - DPP - 01 - Shreshth GATE 2025 Computer Science Weekday (Hinglish)
No ratings yet
Control Flow Statements - DPP - 01 - Shreshth GATE 2025 Computer Science Weekday (Hinglish)
5 pages
Power Grid Chronological Rules
No ratings yet
Power Grid Chronological Rules
6 pages
Louise Atkins - Robert West - Susan Michie - The Behaviour Change Wheel - A Guide To Designing Interventions (2014, Silverback Publishing) - Libgen - Li
No ratings yet
Louise Atkins - Robert West - Susan Michie - The Behaviour Change Wheel - A Guide To Designing Interventions (2014, Silverback Publishing) - Libgen - Li
333 pages
Procedure and Uses
No ratings yet
Procedure and Uses
3 pages
Grade 1 MAPEH Sound Lessons
No ratings yet
Grade 1 MAPEH Sound Lessons
3 pages
Pamflet Teknik Kimia
No ratings yet
Pamflet Teknik Kimia
2 pages
Aerodata Maintenance PriceList 2017
No ratings yet
Aerodata Maintenance PriceList 2017
8 pages
IECEx UL09 0025 - NFMV
No ratings yet
IECEx UL09 0025 - NFMV
5 pages
Computer Engineering Reading Comprehension 7
No ratings yet
Computer Engineering Reading Comprehension 7
2 pages
Skill Assessment: Tableau
No ratings yet
Skill Assessment: Tableau
1 page
Arts 6 Quarter 4 Module 1
100% (2)
Arts 6 Quarter 4 Module 1
9 pages
An Introduction To Appearance Analysis (En)
0% (1)
An Introduction To Appearance Analysis (En)
7 pages
Fuzzy Logic for Tech Enthusiasts
No ratings yet
Fuzzy Logic for Tech Enthusiasts
12 pages
Latihan Soal Personal Letter Kelas Xi - Quizizz
No ratings yet
Latihan Soal Personal Letter Kelas Xi - Quizizz
15 pages
Internship Vvce Final
No ratings yet
Internship Vvce Final
31 pages
Lecture 15 (Chap#5-SNMPv1-Communication and Functional Model)
No ratings yet
Lecture 15 (Chap#5-SNMPv1-Communication and Functional Model)
22 pages
G.O.Rt - No. 2501, Fin. - SMPC - Deptt. Dt.13.7.2006.
100% (1)
G.O.Rt - No. 2501, Fin. - SMPC - Deptt. Dt.13.7.2006.
2 pages
Electrostatics Formula Sheet
No ratings yet
Electrostatics Formula Sheet
4 pages
Temperature's Effect on Aspirin Purity
No ratings yet
Temperature's Effect on Aspirin Purity
5 pages
Newborn Care Center by Slidesgo
No ratings yet
Newborn Care Center by Slidesgo
41 pages
Life at My Own Pace - Answers
100% (3)
Life at My Own Pace - Answers
3 pages
Biology NSSCO/NASSCAS 2023 Topics
No ratings yet
Biology NSSCO/NASSCAS 2023 Topics
1 page
Allcargo Corporate Brochure
No ratings yet
Allcargo Corporate Brochure
12 pages
Arithmetic and Edited Pictures
No ratings yet
Arithmetic and Edited Pictures
18 pages
Embark - The CFOs Roadmap To Finance Transformation
No ratings yet
Embark - The CFOs Roadmap To Finance Transformation
15 pages