0% found this document useful (0 votes)

90 views75 pages

Computer Vision ch2

This document discusses an introductory computer vision course. It provides an overview of topics covered in the first week, including a brief history of computer vision and different applications. It also discusses what an image is, how images can be represented as matrices, and how image transformations like convolution and filtering work. Convolution and filtering are described as powerful ways to implement complex image operations by applying a kernel or filter to local neighborhoods in an image.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views75 pages

Computer Vision ch2

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 75

EE5811

Topics in Computer Vision

Dr. LI Haoliang
Department of Electrical Engineering
Recap of Week 1
• A brief History of Computer Vision
• Different Applications
What is an image?
What is an image?

Digital Camera

We’ll focus on these in this course

Also image formation The Eye

Source: A. Efros
What is an image?
• A grid (matrix) of intensity values
255 255 255 255 255 255 255 255 255 255 255 255

255 255 255 255 255 255 255 255 255 255 255 255

255 255 255 20 0 255 255 255 255 255 255 255

255 255 255 75 75 75 255 255 255 255 255 255

=
255 255 75 95 95 75 255 255 255 255 255 255

255 255 96 127 145 175 255 255 255 255 255 255

255 255 127 145 175 175 175 255 255 255 255 255

255 255 127 145 200 200 175 175 95 255 255 255

255 255 127 145 200 200 175 175 95 47 255 255

255 255 127 145 145 175 127 127 95 47 255 255

255 255 74 127 127 127 95 95 95 47 255 255

255 255 255 74 74 74 74 74 74 255 255 255

255 255 255 255 255 255 255 255 255 255 255 255

(common to use one byte per value: 0 = black, 255 = white)

What is an image?
• We can think of a (grayscale) image as a function, f,
from R2 to R:
• f (x,y) gives the intensity at position (x,y)
f (x, y)

3D view

• A digital image is a discrete (sampled, quantized) version

of this function
Image transformations
• As with any function, we can apply operators to an
image

Example

g (x,y) = f (x,y) + 20 g (x,y) = f (-x,y)

Characterizing image
transformations

[i] does not mean transformation is applied at each pixel separately

Source: Deva Ramanan

Characterizing image
transformations
• Properties of “nice” functional transformation
Impulse response
• Delta function
Impulse response
• Delta function
Convolution
Convolution
Example
Example
Properties of Convolution

We can efficiently implement complex operations

Powerful way to think about ANY image transformation that

satisfies additivity, scaling, and shift-invariance.
Size
• Given F of length N and H of length M, what’s size
of G = F * H?
(Cross) Correlation

Commutative properties do not hold

Convolution vs. Correlation
Cross-correlation
Let be the image, be the kernel (of
size 2k+1 x 2k+1), and be the output
image

This is called a cross-correlation operation:

• Can think of as a “dot product” between

local neighborhood and kernel for each pixel
Convolution
• Same as cross-correlation, except that the kernel is
“flipped” (horizontally and vertically)

This is called a convolution operation:

• Convolution is commutative and associative

Linear filtering
• Cross-correlation, convolution
• Replace each pixel by a linear combination (a weighted sum)
of its neighbors
• The prescription for the linear combination is called
the “kernel” (or “mask”, “filter”)

10 5 3 0 0 0
4 6 1 0 0.5 0 8
1 1 8 0 1 0.5
Local image data kernel Modified image data

Source: L. Zhang
Filters
• Filtering
• Form a new image whose pixels are a combination of the
original pixels
• Why?
• To get useful information from images
• E.g., extract edges or contours (to understand shape)
• To enhance the image
• E.g., to remove noise
• E.g., to sharpen or to “enhance image”
Canonical Image Processing
problems
• Image Restoration
• denoising
• deblurring
• Image Compression
• JPEG, JPEG2000, MPEG..
• Computing Field Properties
• optical flow
• disparity
• Locating Structural Features
• corners
• edges
Question: Noise reduction
• Given a camera and a still scene, how can you
reduce noise?

Take lots of images and average them!

What’s the next best thing?
Source: S. Seitz
Image filtering
• Modify the pixels in an image based on some function of
a local neighborhood of each pixel

10 5 3 Convolution
4 5 1 7
1 1 7

Local image data Modified image data

Source: L. Zhang
Convolution

Adapted from F. Durand

Border effects
Border padding
Mean filtering
0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 10 20 30 30 30 20 10
0 0 0 90 90 90 90 90 0 0 0 20 40 60 60 60 40 20

1 1 1 0 0 0 90 90 90 90 90 0 0 0 30 60 90 90 90 60 30

1
1
1
1
1
1
*
0
0
0
0
0
0
0
0
0
90
90
90
90
0
90
90
90
90
90
90
90
90
90
90
0
0
0
0
0
0
= 0
0
0
30
30
20
50
50
30
80
80
50
80
80
50
90
90
60
60
60
40
30
30
20
0 0 0 0 0 0 0 0 0 0 10 20 30 30 30 30 20 10
0 0 90 0 0 0 0 0 0 0 10 10 10 0 0 0 0 0

0 0 0 0 0 0 0 0 0 0
Mean filtering/Moving average
Mean filtering/Moving average
Mean filtering/Moving average
Mean filtering/Moving average
Mean filtering/Moving average
Mean filtering/Moving average
Linear filters: examples

0 0 0

=
* 0
0
1
0
0
0

Original Identical image

Source: D. Lowe
Linear filters: examples

0 0 0

=
* 1
0
0
0
0
0

Original Shifted left

By 1 pixel

Source: D. Lowe
Linear filters: examples

1 1 1

=
* 1
1
1
1
1
1

Original Blur (with a mean filter)

Source: D. Lowe
Linear filters: examples

-
0 0 0 1 1 1

=
* 0
0
2
0
0
0
1
1
1
1
1
1

Sharpening filter
Original (accentuates edges)

Source: D. Lowe
Sharpening

Source: D. Lowe
Smoothing with box filter revisited

Source: D. Forsyth
Gaussian Kernel

0.003 0.013 0.022 0.013 0.003

0.013 0.059 0.097 0.059 0.013
0.022 0.097 0.159 0.097 0.022
0.013 0.059 0.097 0.059 0.013
0.003 0.013 0.022 0.013 0.003

5 x 5,  = 1

• Constant factor at front makes volume sum to 1 (can be ignored, as

we should re-normalize weights to sum to 1 in any case)

Source: C. Rasmussen
Gaussian Kernel

Source: C. Rasmussen
Gaussian filters

= 1 pixel = 5 pixels = 10 pixels = 30 pixels

Mean vs. Gaussian filtering
Gaussian filter
• Removes “high-frequency” components from the
image (low-pass filter)
• Convolution with self is another Gaussian

* =

Source: K. Grauman
Sharpening revisited
• What does blurring take away?

– =
original smoothed (5x5) detail

Let’s add it back:

+α =
original detail sharpened
Source: S. Lazebnik
Filters: Thresholding
Image Scaling
This image is too big to fit on the
screen. How can we generate a
half-sized version?

Source: S. Seitz
Image sub-sampling

1/8

1/4

Throw away every other row and

column to create a smaller size image
- called image sub-sampling
Source: S. Seitz
Image sub-sampling

1/2 1/4 (2x zoom) 1/8 (4x zoom)

Why does this look so bad?

Source: S. Seitz
Aliasing

• Occurs when your sampling rate is not high enough to

capture the amount of detail in your image
• Can give you the wrong signal/image—an alias

• To do sampling right, need to understand the structure of

your signal/image

• To avoid aliasing:
• sampling rate ≥ 2 * max frequency in the image
• said another way: ≥ two samples per cycle
• This minimum sampling rate is called the Nyquist rate
Source: L. Zhang
Nyquist limit – 2D example

Good sampling

Bad sampling
Aliasing
• When downsampling by a factor of two
• Original image has frequencies that are too high

• How can we fix this?

Gaussian pre-filtering

G 1/8

G 1/4

Gaussian 1/2

• Solution: filter the image, then subsample

Source: S. Seitz
Subsampling with Gaussian pre-filtering

Gaussian 1/2 G 1/4 G 1/8

• Solution: filter the image, then subsample

Source: S. Seitz
Compare with...

1/2 1/4 (2x zoom) 1/8 (4x zoom)

Source: S. Seitz
Gaussian
pre-filtering
• Solution: filter
the image, then
subsample

F0 F1 F2

blur subsample blur subsample …

F0 * H F1 * H
Gaussian
pyramid

F0 F1 F2

blur subsample blur subsample …

F0 * H F1 * H
Gaussian pyramids
[Burt and Adelson, 1983]

• In computer graphics, a mip map [Williams, 1983]

• A precursor to wavelet transform

Gaussian Pyramids have all sorts of applications in computer vision

Source: S. Seitz
Upsampling
• This image is too small for this screen:
• How can we make it 10 times as big?
• Simplest approach:
repeat each row
and column 10 times
• (“Nearest neighbor
interpolation”)
Image interpolation

d = 1 in this
example

1 2 3 4 5

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function

• If we could somehow reconstruct the original function, any new
image could be generated, at any resolution and scale

Adapted from: S. Seitz

Image interpolation

d = 1 in this
example

1 2 3 4 5

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function

• If we could somehow reconstruct the original function, any new
image could be generated, at any resolution and scale

Adapted from: S. Seitz

Image interpolation

1 d = 1 in this
example

1 2 2.5 3 4 5

• What if we don’t know ?

• Guess an approximation:
• Can be done in a principled way: filtering
• Convert to a continuous function:

• Reconstruct by convolution with a reconstruction filter, h

Adapted from: S. Seitz

Image interpolation
“Ideal” reconstruction

Nearest-neighbor
interpolation

Linear interpolation

Gaussian reconstruction

Source: B. Curless
Image interpolation
• What does the 2D version of this hat function look like?

performs
linear interpolation bilinear interpolation

Better filters give better resampled images

• Bicubic is common choice

Cubic reconstruction filter

Image interpolation
Original image: x 10

Nearest-neighbor interpolation Bilinear interpolation Bicubic interpolation

Potential project: Seam carving

https://en.wikipedia.org/wiki/Seam_carving
Image interpolation
• Resizing (resampling)
• Remapping (geometrical Transformation, rotation,...)
• Inpainting (restauration of holes)
• Morphing, nonlinear transformations
Coding Exercise
• Using Python to implement image filtering with the
provided image as input (cv2.filter2D)

-
0 0 0 1 1 1
1 1 1
0 2 0 1 1 1
1 1 1 0 0 0 1 1 1
1 1 1

You can also try

other filters.
Coding Exercise
• Image Interpolation
• Nearest Neighbor Interpolation: selects the value of the
nearest point and does not consider the values of
neighboring points at all, yielding a piecewise-constant
interpolant.
• Bilinear Interpolation: uses values of only the 4 nearest
pixels, located in diagonal directions from a given pixel
• Bicubic Interpolation: considers 16 pixels (4×4).

cv2.resize(src, dsize[, dst[, fx[, fy[, interpolation]]]]) → dst

Coding Exercise

https://docs.opencv.org/3.4/dc/dff/tutorial_py_pyramids.html
Question
• Suppose that you filter an image f(x,y) with a
spatial filter mask w(x,y) using convolution, where
the mask is smaller than the image in both spatial
directions.
• Show the important property that, if the coefficients of
the mask sum to zero, then the sum of all the elements
in the resulting filtered image will be zero also (you may
assume that the border of the image has been padded
with the appropriate number of zeros).
• Would the result be the same if the filtering is
implemented using correlation?
Solution

Lecture1 2
No ratings yet
Lecture1 2
33 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
Lec01 Filter
No ratings yet
Lec01 Filter
60 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
Sampling and Reconstruction: 15-463: Computational Photography Alexei Efros, CMU, Fall 2007
No ratings yet
Sampling and Reconstruction: 15-463: Computational Photography Alexei Efros, CMU, Fall 2007
55 pages
Ip 1
No ratings yet
Ip 1
123 pages
Proc Imagini
No ratings yet
Proc Imagini
54 pages
Lect02 ImageProcessingReview
No ratings yet
Lect02 ImageProcessingReview
53 pages
3.2 Quizlec03 Resample For Web
No ratings yet
3.2 Quizlec03 Resample For Web
44 pages
Lecture 2 1 Image Filtering 2018
No ratings yet
Lecture 2 1 Image Filtering 2018
46 pages
Lecture 3 Siududs
No ratings yet
Lecture 3 Siududs
50 pages
Lec 20
No ratings yet
Lec 20
56 pages
Lecture 2.1 - Image Processing Image Filtering: Idar Dyrdal
No ratings yet
Lecture 2.1 - Image Processing Image Filtering: Idar Dyrdal
38 pages
SGM4-Study Guide For Module 4
No ratings yet
SGM4-Study Guide For Module 4
15 pages
Lec 5
No ratings yet
Lec 5
127 pages
Module2 ImageEnhancement Preprocess
No ratings yet
Module2 ImageEnhancement Preprocess
53 pages
Module-1 - Chapter3 Image Processing
No ratings yet
Module-1 - Chapter3 Image Processing
48 pages
Lec 4
No ratings yet
Lec 4
122 pages
Module 1 Chapter 3 CV
No ratings yet
Module 1 Chapter 3 CV
19 pages
Linear Filters
No ratings yet
Linear Filters
65 pages
Lect2 PDF
No ratings yet
Lect2 PDF
43 pages
Image Filtering: Davide Scaramuzza
No ratings yet
Image Filtering: Davide Scaramuzza
63 pages
Filtering Basics
No ratings yet
Filtering Basics
83 pages
DIP Unit 2 (Enhancement, Binary, Colour)
No ratings yet
DIP Unit 2 (Enhancement, Binary, Colour)
126 pages
Computer Vision-Lec 02
No ratings yet
Computer Vision-Lec 02
121 pages
Image Processing Seminar
No ratings yet
Image Processing Seminar
48 pages
Digital Image Processing: Assignment No. 2
No ratings yet
Digital Image Processing: Assignment No. 2
18 pages
CVR 2
No ratings yet
CVR 2
24 pages
Linear Filters
No ratings yet
Linear Filters
41 pages
Suggested Readings: Image Processing Basics
No ratings yet
Suggested Readings: Image Processing Basics
11 pages
DSP
No ratings yet
DSP
260 pages
4 Chapter2
No ratings yet
4 Chapter2
45 pages
03 - Basics of Image Processing
No ratings yet
03 - Basics of Image Processing
82 pages
Module-2 - Computer Vision Complete
No ratings yet
Module-2 - Computer Vision Complete
113 pages
CV Unit-2
No ratings yet
CV Unit-2
30 pages
Image Filtering: Associate Professor Faculty of Computer Science Institute of Business Administration - Karachi
No ratings yet
Image Filtering: Associate Professor Faculty of Computer Science Institute of Business Administration - Karachi
59 pages
Module 2 24
No ratings yet
Module 2 24
48 pages
UNIT-2 Image Enhancement (Updated)
No ratings yet
UNIT-2 Image Enhancement (Updated)
35 pages
Module-2 - Computer Vision Complete
No ratings yet
Module-2 - Computer Vision Complete
57 pages
CV Unit-2 Final
No ratings yet
CV Unit-2 Final
29 pages
04 Low Level Processing
No ratings yet
04 Low Level Processing
66 pages
Image Processing Techniques
No ratings yet
Image Processing Techniques
19 pages
Image Enhancement Techniques Guide
No ratings yet
Image Enhancement Techniques Guide
22 pages
Revision Chapter 3
No ratings yet
Revision Chapter 3
114 pages
Image Enhancement: Images Courtesy: Digital Image Processing FOURTH EDITION, Rafael C. Gonzalez - Richard E. Woods
No ratings yet
Image Enhancement: Images Courtesy: Digital Image Processing FOURTH EDITION, Rafael C. Gonzalez - Richard E. Woods
130 pages
Image Processing Husseina Ozigi Otaru
No ratings yet
Image Processing Husseina Ozigi Otaru
54 pages
Image Enhancement in Spatial Domain: Pixel Operations and Histogram Processing
No ratings yet
Image Enhancement in Spatial Domain: Pixel Operations and Histogram Processing
59 pages
Digital Image Processing
No ratings yet
Digital Image Processing
37 pages
Image Enhancement Techniques
No ratings yet
Image Enhancement Techniques
106 pages
Image Filtering
No ratings yet
Image Filtering
17 pages
Tıbbi Görüntüieme
No ratings yet
Tıbbi Görüntüieme
137 pages
Assignment 2 Img PDF
No ratings yet
Assignment 2 Img PDF
7 pages
Image Processing for Engineers
No ratings yet
Image Processing for Engineers
139 pages
Film Photography: Imaging
No ratings yet
Film Photography: Imaging
158 pages
Chapter - 3 Image Filtering
No ratings yet
Chapter - 3 Image Filtering
79 pages
Ch03 Enhancement 3
No ratings yet
Ch03 Enhancement 3
74 pages
Module 2 IVP
No ratings yet
Module 2 IVP
151 pages
Image Processing Interpolation
No ratings yet
Image Processing Interpolation
69 pages
Image Filtering 1
No ratings yet
Image Filtering 1
86 pages
Edge-Preserving Image Filters
No ratings yet
Edge-Preserving Image Filters
4 pages
Automatic Lung Nodules Segmentation and Its 3D Visualization
No ratings yet
Automatic Lung Nodules Segmentation and Its 3D Visualization
98 pages
Aiwua NSX d737
No ratings yet
Aiwua NSX d737
72 pages
Doblinger Matlab Course
No ratings yet
Doblinger Matlab Course
99 pages
Audicodes - Codec y Slic PDF
No ratings yet
Audicodes - Codec y Slic PDF
113 pages
12th International Conference On Signal and Image Processing (Signal 2025)
No ratings yet
12th International Conference On Signal and Image Processing (Signal 2025)
3 pages
Pixil
No ratings yet
Pixil
23 pages
I Jcs It 20140503357
No ratings yet
I Jcs It 20140503357
3 pages
ENG311
No ratings yet
ENG311
2 pages
Dilation and Erosion
No ratings yet
Dilation and Erosion
39 pages
Image Compression
No ratings yet
Image Compression
24 pages
DSP Exam for ECE Students
No ratings yet
DSP Exam for ECE Students
2 pages
Kramer DSP 62 Aec and Uc Um 2
No ratings yet
Kramer DSP 62 Aec and Uc Um 2
88 pages
EE 333 Digital Image Processing
No ratings yet
EE 333 Digital Image Processing
4 pages
DSP Lab File
No ratings yet
DSP Lab File
56 pages
Linear Convolution and Even and Odd Signals
No ratings yet
Linear Convolution and Even and Odd Signals
8 pages
Digital Image Processing
No ratings yet
Digital Image Processing
50 pages
3D Ultrasound Image Reconstruction Based On VTK With Marching Cube
No ratings yet
3D Ultrasound Image Reconstruction Based On VTK With Marching Cube
5 pages
Anna University Chennai Chennai 600 025
No ratings yet
Anna University Chennai Chennai 600 025
50 pages
2020 Houdini Learning
100% (2)
2020 Houdini Learning
87 pages
Lab 3
No ratings yet
Lab 3
4 pages
Texture Slicing
No ratings yet
Texture Slicing
6 pages
Laboratory 4 Linear and Non Linear Spatial Filters: Cameraman - Tif' 'Ave' 'Replicate'
No ratings yet
Laboratory 4 Linear and Non Linear Spatial Filters: Cameraman - Tif' 'Ave' 'Replicate'
10 pages
Crown - I-Tech-8000 - SM - Original (1) - 1-46
No ratings yet
Crown - I-Tech-8000 - SM - Original (1) - 1-46
46 pages
Computer Vision LAB Assignment
No ratings yet
Computer Vision LAB Assignment
17 pages
Yamaha RX v573 HTR 5065
No ratings yet
Yamaha RX v573 HTR 5065
134 pages
Double Pendulum For Analyzing Alarithms
No ratings yet
Double Pendulum For Analyzing Alarithms
20 pages
Berg 4 Instructions
No ratings yet
Berg 4 Instructions
2 pages
Onetimechane Timetable
No ratings yet
Onetimechane Timetable
55 pages
Discrete Time Systems
100% (1)
Discrete Time Systems
34 pages

Computer Vision ch2

Uploaded by

Computer Vision ch2

Uploaded by

EE5811

Topics in Computer Vision

We’ll focus on these in this course

Also image formation The Eye

255 255 255 75 75 75 255 255 255 255 255 255

255 255 74 127 127 127 95 95 95 47 255 255

255 255 255 74 74 74 74 74 74 255 255 255

(common to use one byte per value: 0 = black, 255 = white)

• A digital image is a discrete (sampled, quantized) version

g (x,y) = f (x,y) + 20 g (x,y) = f (-x,y)

[i] does not mean transformation is applied at each pixel separately

Source: Deva Ramanan

We can efficiently implement complex operations

Powerful way to think about ANY image transformation that

Commutative properties do not hold

This is called a cross-correlation operation:

• Can think of as a “dot product” between

This is called a convolution operation:

• Convolution is commutative and associative

Take lots of images and average them!

Local image data Modified image data

Adapted from F. Durand

Original Identical image

Original Shifted left

Original Blur (with a mean filter)

0.003 0.013 0.022 0.013 0.003

• Constant factor at front makes volume sum to 1 (can be ignored, as

= 1 pixel = 5 pixels = 10 pixels = 30 pixels

Let’s add it back:

Throw away every other row and

1/2 1/4 (2x zoom) 1/8 (4x zoom)

Why does this look so bad?

• Occurs when your sampling rate is not high enough to

• To do sampling right, need to understand the structure of

• How can we fix this?

• Solution: filter the image, then subsample

Gaussian 1/2 G 1/4 G 1/8

• Solution: filter the image, then subsample

1/2 1/4 (2x zoom) 1/8 (4x zoom)

blur subsample blur subsample …

blur subsample blur subsample …

• In computer graphics, a mip map [Williams, 1983]

Gaussian Pyramids have all sorts of applications in computer vision

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function

Adapted from: S. Seitz

Recall how a digital image is formed

• It is a discrete point-sampling of a continuous function

Adapted from: S. Seitz

• What if we don’t know ?

• Reconstruct by convolution with a reconstruction filter, h

Adapted from: S. Seitz

Better filters give better resampled images

Cubic reconstruction filter

Nearest-neighbor interpolation Bilinear interpolation Bicubic interpolation

Potential project: Seam carving

You can also try

cv2.resize(src, dsize[, dst[, fx[, fy[, interpolation]]]]) → dst

You might also like