0% found this document useful (0 votes)

28 views25 pages

Lec 19

Generative adversarial networks (GANs) are a type of implicit generative model that uses two neural networks, a generator and discriminator, competing against each other. The generator tries to generate realistic samples to fool the discriminator, while the discriminator tries to distinguish real samples from generated ones. GANs have achieved impressive results generating high-resolution images but evaluating their quality is challenging. Variations like CycleGAN can perform tasks like style transfer without paired training examples.

Uploaded by

thesurajzaware

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views25 pages

Lec 19

Uploaded by

thesurajzaware

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

CSC321 Lecture 19: Generative Adversarial Networks

Roger Grosse

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 1 / 25

Overview

In generative modeling, we’d like to train a network that models a

distribution, such as a distribution over images.
One way to judge the quality of the model is to sample from it.
This field has seen rapid progress:

2018
2009 2015
Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 2 / 25
Overview

Four modern approaches to generative modeling:

Generative adversarial networks (today)
Reversible architectures (next lecture)
Autoregressive models (Lecture 7, and next lecture)
Variational autoencoders (CSC412)
All four approaches have different pros and cons.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 3 / 25

Implicit Generative Models

Implicit generative models implicitly define a probability distribution

Start by sampling the code vector z from a fixed, simple distribution
(e.g. spherical Gaussian)
The generator network computes a differentiable function G mapping
z to an x in data space

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 4 / 25

Implicit Generative Models
A 1-dimensional example:

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 5 / 25

Implicit Generative Models

https://blog.openai.com/generative-models/

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 6 / 25

Implicit Generative Models

This sort of architecture sounded preposterous to many of us, but

amazingly, it works.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 7 / 25

Generative Adversarial Networks

The advantage of implicit generative models: if you have some

criterion for evaluating the quality of samples, then you can compute
its gradient with respect to the network parameters, and update the
network’s parameters to make the sample a little better
The idea behind Generative Adversarial Networks (GANs): train two
different networks
The generator network tries to produce realistic-looking samples
The discriminator network tries to figure out whether an image came
from the training set or the generator network
The generator network tries to fool the discriminator network

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 8 / 25

Generative Adversarial Networks

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 9 / 25

Generative Adversarial Networks
Let D denote the discriminator’s predicted probability of being data
Discriminator’s cost function: cross-entropy loss for task of classifying
real vs. fake images

JD = Ex∼D [− log D(x)] + Ez [− log(1 − D(G (z)))]

One possible cost function for the generator: the opposite of the
discriminator’s

JG = −JD
= const + Ez [log(1 − D(G (z)))]

This is called the minimax formulation, since the generator and

discriminator are playing a zero-sum game against each other:

max min JD
G D

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 10 / 25

Generative Adversarial Networks
Updating the discriminator:

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 11 / 25

Generative Adversarial Networks
Updating the generator:

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 12 / 25

Generative Adversarial Networks

Alternating training of the generator and discriminator:

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 13 / 25

A Better Cost Function

We introduced the minimax cost function for the generator:

JG = Ez [log(1 − D(G (z)))]

One problem with this is saturation.

Recall from our lecture on classification: when the prediction is really
wrong,
“Logistic + squared error” gets a weak gradient signal
“Logistic + cross-entropy” gets a strong gradient signal
Here, if the generated sample is really bad, the discriminator’s
prediction is close to 0, and the generator’s cost is flat.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 14 / 25

A Better Cost Function

Original minimax cost:

JG = Ez [log(1 − D(G (z)))]

Modified generator cost:

JG = Ez [− log D(G (z))]

This fixes the saturation problem.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 15 / 25

Generative Adversarial Networks

Since GANs were introduced in 2014, there have been hundreds of

papers introducing various architectures and training methods.
Most modern architectures are based on the Deep Convolutional GAN
(DC-GAN), where the generator and discriminator are both conv nets.
GAN Zoo: https://github.com/hindupuravinash/the-gan-zoo
Good source of horrible puns (VEEGAN, Checkhov GAN, etc.)

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 16 / 25

GAN Samples
Celebrities:

Karras et al., 2017. Progressive growing of GANs for improved quality, stability, and variation

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 17 / 25

GAN Samples
Bedrooms:

Karras et al., 2017. Progressive growing of GANs for improved quality, stability, and variation

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 18 / 25

GAN Samples
Objects:

Karras et al., 2017. Progressive growing of GANs for improved quality, stability, and variation

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 19 / 25

GAN Samples

GANs revolutionized generative modeling by producing crisp,

high-resolution images.
The catch: we don’t know how well they’re modeling the distribution.

Can’t measure the log-likelihood they assign to held-out data.

Could they be memorizing training examples? (E.g., maybe they
sometimes produce photos of real celebrities?)
We have no way to tell if they are dropping important modes from the
distribution.
See Wu et al., “On the quantitative analysis of decoder-based
generative models” for partial answers to these questions.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 20 / 25

CycleGAN

Style transfer problem: change the style of an image while preserving the
content.

Data: Two unrelated collections of images, one for each style

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 21 / 25
CycleGAN

If we had paired data (same content in both styles), this would be a

supervised learning problem. But this is hard to find.
The CycleGAN architecture learns to do it from unpaired data.
Train two different generator nets to go from style 1 to style 2, and
vice versa.
Make sure the generated samples of style 2 are indistinguishable from
real images by a discriminator net.
Make sure the generators are cycle-consistent: mapping from style 1 to
style 2 and back again should give you almost the original image.

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 22 / 25

CycleGAN

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 23 / 25

CycleGAN

Style transfer between aerial photos and maps:

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 24 / 25

CycleGAN

Style transfer between road scenes and semantic segmentations (labels of

every pixel in an image by object category):

Roger Grosse CSC321 Lecture 19: Generative Adversarial Networks 25 / 25

GANs and CycleGANs Explained
No ratings yet
GANs and CycleGANs Explained
24 pages
L19 GANs
No ratings yet
L19 GANs
9 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
8 pages
A Survey On Generative Adversarial Networks (GANs)
No ratings yet
A Survey On Generative Adversarial Networks (GANs)
5 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Unit 5
No ratings yet
Unit 5
26 pages
Chapter8 GANs
No ratings yet
Chapter8 GANs
24 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
GANs for Tech Enthusiasts
100% (1)
GANs for Tech Enthusiasts
14 pages
Generative Adversarial Network
No ratings yet
Generative Adversarial Network
19 pages
MODULE 6 - 2 Generative Adversarial Network (GAN)
No ratings yet
MODULE 6 - 2 Generative Adversarial Network (GAN)
33 pages
Week 8
No ratings yet
Week 8
61 pages
Gen AI Unit 3
No ratings yet
Gen AI Unit 3
52 pages
Aai 2
No ratings yet
Aai 2
83 pages
Generative Adversarial Networks Seminar Report
50% (4)
Generative Adversarial Networks Seminar Report
11 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
71 pages
Unit6 Aml
No ratings yet
Unit6 Aml
63 pages
Gans
No ratings yet
Gans
26 pages
Deep & Reinforcement - Unit 3
No ratings yet
Deep & Reinforcement - Unit 3
8 pages
E-Note 28189 Content Document 20241127105359AM
No ratings yet
E-Note 28189 Content Document 20241127105359AM
32 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
Generative Adversarial Networks (GANs) - Engine and Applications PDF
No ratings yet
Generative Adversarial Networks (GANs) - Engine and Applications PDF
13 pages
29 - Gan - 1
No ratings yet
29 - Gan - 1
24 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
81 pages
GANs 1
No ratings yet
GANs 1
25 pages
Module 6.2 GAN
No ratings yet
Module 6.2 GAN
29 pages
Gans + Final Practice Questions: Instructor: Preethi Jyothi
No ratings yet
Gans + Final Practice Questions: Instructor: Preethi Jyothi
28 pages
GANs
No ratings yet
GANs
13 pages
3rd Unit Notes
No ratings yet
3rd Unit Notes
16 pages
GANppt
100% (1)
GANppt
34 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Unit 5
No ratings yet
Unit 5
46 pages
GAN Report by Manisha
No ratings yet
GAN Report by Manisha
30 pages
What Are Generative Adversarial Networks - 2
No ratings yet
What Are Generative Adversarial Networks - 2
20 pages
Generative Adversarial Networks and Some of GAN Applications - Everything You Need To Know
No ratings yet
Generative Adversarial Networks and Some of GAN Applications - Everything You Need To Know
26 pages
Readinggroup Gan 20170417 170425005433
No ratings yet
Readinggroup Gan 20170417 170425005433
26 pages
GANs
No ratings yet
GANs
41 pages
Generative Adversarial Networks (Gans) : Date: 14.11.2022
100% (1)
Generative Adversarial Networks (Gans) : Date: 14.11.2022
12 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
01 GAN & Its Application
No ratings yet
01 GAN & Its Application
21 pages
GAN Technical Final Report
No ratings yet
GAN Technical Final Report
21 pages
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
No ratings yet
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
88 pages
PDL Unit 5-GAN
No ratings yet
PDL Unit 5-GAN
36 pages
Generative Models: GANs & Diffusion
No ratings yet
Generative Models: GANs & Diffusion
47 pages
Gans Stanford
No ratings yet
Gans Stanford
39 pages
Background - What Is A Generative Model
No ratings yet
Background - What Is A Generative Model
18 pages
Unit V
No ratings yet
Unit V
20 pages
GAN Lecture
No ratings yet
GAN Lecture
53 pages
DL Unit6 Gan
No ratings yet
DL Unit6 Gan
44 pages
Generative Adversarial Network GAN A General Review On Different Variants of GAN and Applications
No ratings yet
Generative Adversarial Network GAN A General Review On Different Variants of GAN and Applications
8 pages
Understanding Generative Adversarial Networks
No ratings yet
Understanding Generative Adversarial Networks
24 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
Introduction To GANs
No ratings yet
Introduction To GANs
10 pages
Dss16 DL Gan
No ratings yet
Dss16 DL Gan
51 pages
Chapter 8 - GAN
No ratings yet
Chapter 8 - GAN
86 pages
A Technical Seminar2018-19
No ratings yet
A Technical Seminar2018-19
15 pages
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
No ratings yet
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
21 pages
GaNs L7
No ratings yet
GaNs L7
14 pages
Experiment 1 Robocell & Scorbase: 1.objective
No ratings yet
Experiment 1 Robocell & Scorbase: 1.objective
9 pages
Section A: Ques. 1
No ratings yet
Section A: Ques. 1
31 pages
Multi-Agent Deep Reinforcement Learning: Maxim Egorov Stanford University
No ratings yet
Multi-Agent Deep Reinforcement Learning: Maxim Egorov Stanford University
8 pages
Brenna Hammerly Tasl 501 App Act 4c
No ratings yet
Brenna Hammerly Tasl 501 App Act 4c
4 pages
Ece - 41003 Linear System 2017-2018 Academic Year 2 Semester
No ratings yet
Ece - 41003 Linear System 2017-2018 Academic Year 2 Semester
8 pages
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
No ratings yet
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
6 pages
B. Tech - (AR19 & AR 20) Question Bank Template
No ratings yet
B. Tech - (AR19 & AR 20) Question Bank Template
7 pages
Application of Clustering Algorithms On Tourism Industry
No ratings yet
Application of Clustering Algorithms On Tourism Industry
7 pages
YOLO: Efficient Object Detection Guide
No ratings yet
YOLO: Efficient Object Detection Guide
19 pages
Ball and Beam Courseware Sample For MATLAB Users
100% (1)
Ball and Beam Courseware Sample For MATLAB Users
10 pages
Educational Data Miningphd Thesispdf
100% (3)
Educational Data Miningphd Thesispdf
6 pages
LSTM: A Search Space Odyssey: Klaus Greff, Rupesh K. Srivastava, Jan Koutn Ik, Bas R. Steunebrink, J Urgen Schmidhuber
No ratings yet
LSTM: A Search Space Odyssey: Klaus Greff, Rupesh K. Srivastava, Jan Koutn Ik, Bas R. Steunebrink, J Urgen Schmidhuber
1 page
Communication Types and Elements
100% (1)
Communication Types and Elements
2 pages
Proposal For Dissertation
No ratings yet
Proposal For Dissertation
2 pages
SAP S Overall Data Warehousing Strategy
No ratings yet
SAP S Overall Data Warehousing Strategy
7 pages
Deep Reinforcement Learning: From Q-Learning To Deep Q-Learning
No ratings yet
Deep Reinforcement Learning: From Q-Learning To Deep Q-Learning
9 pages
Ch05-Database Systems and Data Management
No ratings yet
Ch05-Database Systems and Data Management
58 pages
Summer Internship Report (ETSI-600) (KOUSTAV DUTTA 49)
No ratings yet
Summer Internship Report (ETSI-600) (KOUSTAV DUTTA 49)
36 pages
SKEE 3133 Chapter 1 Revised
No ratings yet
SKEE 3133 Chapter 1 Revised
56 pages
AI & ML Free Resource Guide
No ratings yet
AI & ML Free Resource Guide
9 pages
Easa Concept Paper Guidance For Level 1and2 Machine Learning Applications Proposed Issue 02 Feb2023
No ratings yet
Easa Concept Paper Guidance For Level 1and2 Machine Learning Applications Proposed Issue 02 Feb2023
242 pages
Cse-V-database Management Systems U4
No ratings yet
Cse-V-database Management Systems U4
11 pages
Data Analysis
No ratings yet
Data Analysis
17 pages
Lecture-07 Search Techniques
No ratings yet
Lecture-07 Search Techniques
7 pages
Test 1 Week 5
No ratings yet
Test 1 Week 5
3 pages
Foundations of Data Science - Unit 3
No ratings yet
Foundations of Data Science - Unit 3
18 pages
Design of Intelligent Classroom Facial Recognition
No ratings yet
Design of Intelligent Classroom Facial Recognition
9 pages
Closed Loop
100% (1)
Closed Loop
19 pages
Xii - Python With SQL
No ratings yet
Xii - Python With SQL
9 pages
An Introduction To Automatic Controls
No ratings yet
An Introduction To Automatic Controls
43 pages