0% found this document useful (0 votes)

13 views46 pages

GAN Presentation

Uploaded by

andoridtv100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views46 pages

GAN Presentation

Uploaded by

andoridtv100

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Generate Dog Images with

Generative Adversarial Networks (GAN)

Machine Learning II Project

Group 1: Gaofeng Huang, Jun Ying, Xi Zhang

Outline
Introduction of GAN

Data Description

Model Description

Experimental Setup

Results

Potential Improvement
Introduction of GAN
Application of GAN in the Real World

Shakespearean poetry Random Music

Snap Chat Babyface Filter
generator generator
Introduction of GAN

Background

GAN is a very new stuff

and has a promising
future. Especially in last
year (2018), GAN was
developed with an
exponential increment. In
other words, it is almost
an infant technology.
NN
Problem Definition
& Research Target

We are very curious

about how our familiar
neural networks apply
in a new structure.
That is our research
motivation.
Data Source
Data Description
Dog Breed Number of
Observations

0 n02085620-Chihuahua 152
1 n02085782- 185
Japanese_spaniel
2 n02085936-Maltese_dog 252
3 n02086079-Pekinese 149
4 n02086240-Shih-Tzu 214
. . .
. . .
. . .
116 n02113978- 155
Mexican_hairless
117 n02115641-dingo 156
118 n02115913-dhole 150
119 n02116738- 169
African_hunting_dog
Description of Models - The Basic Concept

Structure of GAN
Input random vectors into generator to generate fake image. Discriminator is responsible to classify the fake and real image.
As the discriminator becomes stricter, the generator must generate more realistic image to cheat the discriminator.
Description of Models – Adversarial Relationship
≠
!!!

?
Description of Models – Discriminator Training

● Initial discriminator. ● Label the real images as 1, and generated

images as 0 .
● Input the random vectors into the fixed
generator to get the generated images. ● Update the parameter of discriminator like
classifier training.
● Sample from the database.
Description of Models – Generator Training

● Fix the trained discriminator. ● Force the generator to generate images

that can be graded close to 1.
● Generate image.
● Like the optimizer in our familiar neural
● Get the score from trained discriminator. network but gradient ascent process.
Description of Models – Generator Training

When we put these two processes together, we got a whole big network. If we
extract the output from the hidden layer in the middle, we get a complete
image. Details will be shown in the model implement section.
Description of Models – Math Concept
Discriminator parameter: 𝜃+ Sample from database: {𝑥 # , 𝑥 % , … , 𝑥 ' }
Noise samples: 𝑧# , 𝑧 % , … , 𝑧 '
Generator parameter : 𝜃, Obtain generated data {{𝑥- # , 𝑥- % , … , 𝑥- ' }}, 𝑥- . = G(𝑧 . ).

Update discriminator parameters 𝜽𝒅 : Update generator parameters 𝜽𝒈 :

# # #
𝑉1 = ' ∑'.4# 𝑙𝑜𝑔𝐷 𝑥 . + ∑' 𝑙𝑜𝑔 1 − 𝐷(𝑥- . ) 𝑉1 = ' ∑'.4# log(𝐷 𝐺(𝑧 . ) )
' .4#

1 +) 1 ,)
𝜃, ← 𝜃, + 𝜂 𝑉(𝜃
𝜃+ ← 𝜃+ + 𝜂 𝑉(𝜃 * 𝜂: 𝐿𝑒𝑎𝑟𝑛𝑖𝑛𝑔 𝑅𝑎𝑡𝑒

Assent gradient to update the parameter of Assent gradient to update the parameter
discriminator. of generator .
◦ Generator (13.28m parameters)
Simple GAN with MLP Input noise vector z ∈ ℝ𝟏𝟎𝟎
Dense, 256 + BN + LReLU
◦ To construct a GAN Dense, 512 + BN + LReLU
◦ Generator à Upsampling Dense, 1024 + BN + LReLU
◦ Discriminator à Downsampling Dense + Reshape, 64 x 64 x 3 + Tanh
◦ Upsampling Output fake images ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑

◦ MLP: Increasing number of neurons ◦ Discriminator (6.46m parameters)

◦ Downsampling Input RGB image ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑
◦ MLP: Diminishing number of neurons Flatten + Dense, 512 + LReLU
Dense, 256 + LReLU
Dense, 128 + LReLU
Dense, 1 + Sigmoid
Output probability ∈ ℝ𝟏
Simple GAN with MLP

◦ After 1000 epochs ◦ After 3000 epochs ◦ After 10000 epochs

Simple GAN with MLP: Problems
Imbalance between Generator and Discriminator. ◦ After 10000 epochs
◦ Even with larger epochs, the generated images are
still blurred. à Stop learning anymore.
◦ Generator always cannot compete with Discriminator.
◦ Intuitively, creativity is more difficult than criticism. In fact,
it tends to be easy to distinguish an artwork is real or
fake. However, without seeing the real artwork, it is really
hard to create a fake artwork which looks just like the real
one.
◦ Mathematically, the gradient descent of Generator will
be vanishing.
Simple GAN with MLP: Problems
Imbalance between Generator and Discriminator. ◦ After 10000 epochs
◦ Even with larger epochs, the generated images are
still blurred. à Stop learning anymore.
◦ Generator always cannot compete with Discriminator.
◦ Intuitively, creativity is more difficult than criticism. In fact,
it tends to be easy to distinguish an artwork is real or
fake. However, without seeing the real artwork, it is really
hard to create a fake artwork which looks just like the real
one.
◦ Mathematically, the gradient descent of Generator will
be vanishing.
Potential solutions

◦ A trivial approach is adjusting training times of Generator and Discriminator separately.

◦ In practice, it helps a little bit, but it also makes the training process more unstable.
◦ MLP based Generator cannot focus on the detail features of an image. à Create a
deeper Generator structure with convolution layers.
◦ Deep Convolution GAN (DCGAN)
◦ Some other approaches
◦ spectral normalization.
◦ finding a loss function and an activation function with stable and non-vanishing gradients.
DCGAN - Upsampling & Downsampling

◦ To construct a DCGAN
◦ Generator à Upsampling
◦ Discriminator à Downsampling
◦ Downsampling
◦ MLP: Diminishing number of neurons
◦ Convolution (with stride)
DCGAN - Upsampling & Downsampling

◦ Upsampling
◦ MLP: Increasing number of neurons
◦ Transposed convolution (with stride)
◦ Pretty same as convolution
◦ Stride concept is transposed
◦ Padding concept is transposed
◦ E.g. If Conv2D(stride=(2, 2), padding=‘same’) will
reduce a size from 6x6 to 3x3.
TransposedConv2D(stride=(2, 2), padding=‘same’)
will increase a size from 3x3 to 6x6.
Stride(2, 2), Padding=1 Stride(1, 1), Padding = 0
DCGAN
Construction
◦ Upsampling
◦ Transposed convolution
◦ Downsampling
◦ Replace all max pooling
with convolutional stride.
◦ Activation
◦ LeakyReLU
DCGAN - Final architecture

◦ Generator (1.28m parameters) ◦ Discriminator (1.18m parameters)

Input noise vector z ∈ ℝ𝟏𝟎𝟎 Out size Input RGB image ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑 Out size
Dense + Reshape, 4 x 4 x 128 + LReLU 4 x 4 x 128 3 x 3 Conv + Stride(2, 2), 128 + LReLU 32 x 32 x 128
4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 8 x 8 x 128 3 x 3 Conv + Stride(2, 2), 128 + LReLU 16 x 16 x 128
4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 16 x 16 x 128 3 x 3 Conv + Stride(2, 2), 128 + LReLU 8 x 8 x 128
4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 32 x 32 x 128 3 x 3 Conv + Stride(2, 2), 128 + LReLU 4 x 4 x 128
4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 64 x 64 x 128 3 x 3 Conv + Stride(2, 2), 128 + LReLU 2 x 2 x 128
8 x 8 Conv, 64 x 64 x 3 + Tanh 64 x 64 x 3 Flatten + Dropout 512
Output fake images ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑 Dense, 1 + Sigmoid 1
Output probability ∈ ℝ𝟏
Hard to tune

◦ High sensitivity to hyperparameter. Even the performance of GANs varies with different
random seeds. Tuning a GAN should be very patient.

◦ Solutions. Looking at the gradients and loss changes is the most efficient way to help
with tuning. After that, the only thing we should keep is our patience.
DCGAN - Not well-tuned
◦ CNN creates blocks in images
◦ Comparing with MLP structure, a well-tuned deep
convolution structure helps the networks recognize
the more details inside an image. However, if the
convolution structure is not tuned well, the result will
be worse than simple GAN with MLP. With the
operation of convolution, the generated images may
be cut in blocks.
◦ Fail to learn to generate dog-like
images

DCGAN - Not well-tuned

◦ Even if the blocks problem is mitigated, DCGAN may

fail to generate target-like images. It means both of
Discriminator and Generator are learning in the wrong
directions. ◦ Model collapse
◦ Most generated images look similar. à Mode collapse
◦ Although the image quality is improved in some cases,
the mode collapse problem is still existed in DCGAN.
One trick makes a great improvement

◦ Using soft and noisy labels can help GANs to be stable a lot. Without such changes, our model
cannot create a clear image. Smoothing the positive labels (like 0.9-1.0).

◦ Soft and noisy labels make a balance between Discriminator and Generator in this competition.
Let Discriminator not be over-confident and let Generator not be under-confident.

◦ To be noticed, we should only smooth the one-sided label, particularly the positive labels.
(Goodfellow, 2016)
One trick makes a great improvement

◦ Using soft and noisy labels can help GANs to be stable a lot. Without such changes, our model
cannot create a clear image. Smoothing the positive labels (like 0.9-1.0).

◦ Soft and noisy labels make a balance between Discriminator and Generator in this competition.
Let Discriminator not be over-confident and let Generator not be under-confident.

◦ To be noticed, we should only smooth the one-sided label, particularly the positive labels.
(Goodfellow, 2016)
DCGAN - Performance

◦ In the early training step, it can generate ◦ Some images generated look like a dog,
some clear image but not dog-like. but most of them are still not dog-like.
◦ After 5000 epochs ◦ After 10000 epochs
DCGAN - Performance
◦ the improvement is not significant. This
◦ It is not easy to generate clear dog-like DCGAN model almost achieves its
images without mode collapse. maximum performance.
◦ After 40000 epochs ◦ After 60000 epochs
DCGAN

◦ A glance of the random

generated dogs.
Mode collapse
◦ Issue: Generator only produces low-diversity outputs. A complete mode collapse, which is
not common, means the Generator just makes a trick to create only one type of image to
fool the Discriminator. A partial mode collapse, which always happens, is a hard problem in
GAN to solve.
◦ Solutions: Mode collapse is still a difficult problem in most GANs. Nevertheless, there are
some tricky ways to disperse kind of collapse like Conditional GAN (CGAN). CGAN inputs
the label of one or more real data into the model as a condition, so that the model is
affected by the label. Here, our dog dataset contains 120 kinds of dogs. Therefore, we try to
use CGAN to solve the problem of mode collapse.
CGAN
◦ Conditional Generative Adversarial Networks (CGAN), an extension of the GAN, allows you
to generate images with specific conditions or attributes.
◦ Difference: both the generator and the discriminator of CGAN receive some extra
conditional information, such as the class of the image, a graph, some words, or a
sentence.
◦ The cost function for CGAN is the same as GAN
𝑚𝑖𝑛V 𝑚𝑎𝑥W 𝑉 𝐺, 𝐷 = 𝔼Z log 𝐷 𝑥, 𝑦 + 𝔼\ log 1 − 𝐷 𝐺 𝑧, 𝑦\ , 𝑦\

◦ CGAN can make generator generate different types of images, which will prevent
generator from generating similar images after multiple trainings.
◦ We can control the generator to generate an image which will have some properties we
want.
CGAN
Construction
● Upsampling

○ Combine vector
z with label y
● Downsampling

○ Expand and
reshape label y,
then combine
with image
● Activation

○ LeakyReLU
Description of Models – Discriminator Training
Description of Models – Generator Training

● Fix the trained discriminator. ● Force the generator to generate images

that can be graded close to 1.
● Generate image.
● Like the optimizer in our familiar neural
● Get the score from trained discriminator. network but gradient ascent process.
CGAN - Final architecture
● Generator

Input noise vector z ∈ ℝ𝟏𝟎𝟎 Out size Input label (0-119) Out size

Dense + Reshape,4 x 4 x 256 + LReLU 4 x 4 x 256 Embedding + Dense + Reshape, 4 x 4 4x4x1

Merge input noise vector and input label Out size

Concatenate 4 x 4 x 257

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 8 x 8 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 16 x 16 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 32 x 32 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 64 x 64 x 128

8 x 8 Conv, 64 x 64 x 3 + Tanh 64 x 64 x 3

Output fake images ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑

CGAN - Final architecture
● Discriminator
Input RGB image ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑 Out size Input label (0-119) Out size

- 64 x 64 x 3 Embedding + Dense + Reshape, 64 x 64 64 x 64 x 1

Merge input RGB image and input label Out size

Concatenate 64 x 64 x 4

3 x 3 Conv + Stride(2, 2), 128 + LReLU 32 x 32 x 128

3 x 3 Conv + Stride(2, 2), 128 + LReLU 16 x 16 x 128

3 x 3 Conv + Stride(2, 2), 128 + LReLU 8 x 8 x 128
3 x 3 Conv + Stride(2, 2), 128 + LReLU 4 x 4 x 128

3 x 3 Conv + Stride(2, 2), 128 + LReLU 2 x 2 x 128

Flatten + Dropout 512

Dense, 1 + Sigmoid 1

Output probability ∈ ℝ𝟏
CGAN - Performance

● In the early training

step, it can generate
some clear image
but not dog-like and
no type.

● After 6,000 epochs

CGAN - Performance

● Although still a bunch

of things, some have
the shape of a dog.
In some columns,
there are similar
types.
● After 15,000 epochs
CGAN - Performance

● Most of images look

like a dog. Each
column roughly has
its own type.
● After 30,000 epochs
CGAN - Performance

● There is mode
collapse of some
column.
● After 45,000 epochs
CGAN - Performance

● At this point, the

generator cannot
further generate a
more dog-like image.
● There is mode
collapse of each
column.
● At least, we can get
120 generated dogs
in CGAN.
● After 60,000 epochs
CGAN

A glance of the
random generated
dogs.
Future Improvement of CGAN

● Input conditional data

combine with images which
via convolution into a dense
code.
● The combined data yield a
prediction of each class
possibility.
● Avinash H. (2017). The GAN Zoo. GitHub.
https://github.com/hindupuravinash/the-gan-zoo
● Amir J.(2019). Deep-Learning. GitHub.
https://github.com/amir-jafari/Deep-Learning
● Hongyi, L. (2018). GAN Lecture 1: Introduction.

References
YouTube.
https://www.youtube.com/watch?v=DQNNMiAP5l
w&list=PLJV_el3uVTsMq6JEFPW35BCiOQTsoq
wNw&index=1
● Kaggle Competition. (2019). Generative Dog
Images. Kaggle.
https://www.kaggle.com/c/generative-dog-images
Q&A

Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Gan June 2019
No ratings yet
Gan June 2019
28 pages
UNIT-3 (Gen AI)
No ratings yet
UNIT-3 (Gen AI)
21 pages
Image With GAN-topic
No ratings yet
Image With GAN-topic
20 pages
3rd Unit Notes
No ratings yet
3rd Unit Notes
16 pages
Module 6.2 GAN
No ratings yet
Module 6.2 GAN
29 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Aai 2
No ratings yet
Aai 2
83 pages
Gen AI 10-1
No ratings yet
Gen AI 10-1
60 pages
Introduction To GANs
No ratings yet
Introduction To GANs
10 pages
Models Definition 3. Gans Training 4. Types of Gans 5. Gans Applications
No ratings yet
Models Definition 3. Gans Training 4. Types of Gans 5. Gans Applications
28 pages
GANs: Challenges and Techniques
No ratings yet
GANs: Challenges and Techniques
57 pages
Unit 5
No ratings yet
Unit 5
26 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
No ratings yet
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
11 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
81 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
36 pages
Gan Tutorial Suwang
No ratings yet
Gan Tutorial Suwang
11 pages
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
No ratings yet
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
21 pages
Generative AI Fundamentals GANs QB 14 Aug v1.0
No ratings yet
Generative AI Fundamentals GANs QB 14 Aug v1.0
24 pages
10 Generative Adversarial Networks
No ratings yet
10 Generative Adversarial Networks
37 pages
???ATLAB & Simulink
No ratings yet
???ATLAB & Simulink
13 pages
Name: Ahammad Nadendla Roll No: B19BB030
No ratings yet
Name: Ahammad Nadendla Roll No: B19BB030
12 pages
CSCI 5922 Neural Networks and Deep Learning
No ratings yet
CSCI 5922 Neural Networks and Deep Learning
37 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
Gans + Final Practice Questions: Instructor: Preethi Jyothi
No ratings yet
Gans + Final Practice Questions: Instructor: Preethi Jyothi
28 pages
Unit 5
No ratings yet
Unit 5
46 pages
Slides 1
No ratings yet
Slides 1
50 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
Introduction of Generative Adversarial Network
No ratings yet
Introduction of Generative Adversarial Network
234 pages
Atharv Report Final
No ratings yet
Atharv Report Final
23 pages
Ganss Harward Uni Notes
No ratings yet
Ganss Harward Uni Notes
44 pages
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
No ratings yet
Figure From Ian Goodfellow, Tutorial On Generative Adversarial /networks, 2017
88 pages
6GAN
No ratings yet
6GAN
4 pages
Performance Analysis of Different GAN Models DC-GAN and LS-GAN
No ratings yet
Performance Analysis of Different GAN Models DC-GAN and LS-GAN
6 pages
Chapter8 GANs
No ratings yet
Chapter8 GANs
24 pages
Week 8
No ratings yet
Week 8
61 pages
Generative Models
No ratings yet
Generative Models
39 pages
AdvGAN: Enhancing GANs with Adversarial Training
No ratings yet
AdvGAN: Enhancing GANs with Adversarial Training
12 pages
AAI Extra
No ratings yet
AAI Extra
7 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
Module 5
No ratings yet
Module 5
23 pages
Liu Hu Report
No ratings yet
Liu Hu Report
6 pages
C3W1 Data Augmentation Assignment
No ratings yet
C3W1 Data Augmentation Assignment
16 pages
Gan Fpga
No ratings yet
Gan Fpga
35 pages
Unit 3
No ratings yet
Unit 3
10 pages
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
No ratings yet
AAI - Module 1 - Generative Adversarial Network and Probabilistic Models
12 pages
Lecture 18 20
No ratings yet
Lecture 18 20
65 pages
Lecture16 GAN Cont
No ratings yet
Lecture16 GAN Cont
35 pages
Module 5
No ratings yet
Module 5
23 pages
Genai Week5
No ratings yet
Genai Week5
33 pages
Unit6 Aml
No ratings yet
Unit6 Aml
63 pages
13 GAN v2.15
No ratings yet
13 GAN v2.15
48 pages
Generative AI Notes
No ratings yet
Generative AI Notes
11 pages
Neural Networks for Healthcare Experts
No ratings yet
Neural Networks for Healthcare Experts
43 pages
GANs
No ratings yet
GANs
41 pages
Generative Adversarial Networks Review 1-06-08-1.edit
No ratings yet
Generative Adversarial Networks Review 1-06-08-1.edit
24 pages
Bba Sem III, (New) 2019
0% (1)
Bba Sem III, (New) 2019
22 pages
Lotman SemioticMechanism 1978 PDF
No ratings yet
Lotman SemioticMechanism 1978 PDF
23 pages
Reviewer: The Hardware
No ratings yet
Reviewer: The Hardware
28 pages
Script Manager Training Module
No ratings yet
Script Manager Training Module
75 pages
Executive Assistant Syllabus Recruitment of Officers and Junior Executive Officers 2025 26
No ratings yet
Executive Assistant Syllabus Recruitment of Officers and Junior Executive Officers 2025 26
1 page
Lesson Plan For Undefined Terms in Geometry
No ratings yet
Lesson Plan For Undefined Terms in Geometry
4 pages
Adv Ex 02 Quadratic Equation II (Modified)
No ratings yet
Adv Ex 02 Quadratic Equation II (Modified)
19 pages
Year 10 5.2 At#1 Solutions
No ratings yet
Year 10 5.2 At#1 Solutions
11 pages
Freesat Huffman Table (Compression Type 1)
No ratings yet
Freesat Huffman Table (Compression Type 1)
35 pages
T6 SAS Proofs
No ratings yet
T6 SAS Proofs
7 pages
Ice Cream Statistics
No ratings yet
Ice Cream Statistics
6 pages
Parabola Equations & Graphs Guide
No ratings yet
Parabola Equations & Graphs Guide
17 pages
Understanding Time Complexity Analysis
No ratings yet
Understanding Time Complexity Analysis
8 pages
Market Analyzer Tutorial Strategies
100% (3)
Market Analyzer Tutorial Strategies
18 pages
Principle of Vernier
No ratings yet
Principle of Vernier
10 pages
Skull Crusher-13 Class XI JEE (Adv) Mathematics
No ratings yet
Skull Crusher-13 Class XI JEE (Adv) Mathematics
5 pages
Unknown Input Observer and Robust Control1
No ratings yet
Unknown Input Observer and Robust Control1
33 pages
Econ8026 AS3
No ratings yet
Econ8026 AS3
4 pages
EE Lab Guide: Latches & Flip-Flops
No ratings yet
EE Lab Guide: Latches & Flip-Flops
16 pages
Distance Protection
No ratings yet
Distance Protection
24 pages
Nptel All Week Answers
No ratings yet
Nptel All Week Answers
50 pages
Kinds of Variables and Their Uses
No ratings yet
Kinds of Variables and Their Uses
3 pages
PU Chronicles BITS Pilani
100% (1)
PU Chronicles BITS Pilani
217 pages
DEP 37.19.00.30-Gen Fixed Steel Offshore Structures (Amendments - Supplements To ISO 19902 - 2007) PDF
No ratings yet
DEP 37.19.00.30-Gen Fixed Steel Offshore Structures (Amendments - Supplements To ISO 19902 - 2007) PDF
36 pages
Azzam,+Galley+ +ED PB+ +197 214+ +asmara
No ratings yet
Azzam,+Galley+ +ED PB+ +197 214+ +asmara
18 pages
Solve Linear Equations by Elimination
No ratings yet
Solve Linear Equations by Elimination
15 pages
ML Evaluation Metrics Guide
No ratings yet
ML Evaluation Metrics Guide
16 pages
quizlet-QUIZ 3 Variables in Outsystems
No ratings yet
quizlet-QUIZ 3 Variables in Outsystems
2 pages
Dealing With The Inventory Risk: A Solution To The Market Making Problem
No ratings yet
Dealing With The Inventory Risk: A Solution To The Market Making Problem
31 pages
Data Structures & Algorithms Guide
No ratings yet
Data Structures & Algorithms Guide
2 pages

GAN Presentation

Uploaded by

GAN Presentation

Uploaded by

Generate Dog Images with

Generative Adversarial Networks (GAN)

Group 1: Gaofeng Huang, Jun Ying, Xi Zhang

Shakespearean poetry Random Music

GAN is a very new stuff

We are very curious

● Initial discriminator. ● Label the real images as 1, and generated

● Fix the trained discriminator. ● Force the generator to generate images

Update discriminator parameters 𝜽𝒅 : Update generator parameters 𝜽𝒈 :

◦ MLP: Increasing number of neurons ◦ Discriminator (6.46m parameters)

◦ After 1000 epochs ◦ After 3000 epochs ◦ After 10000 epochs

◦ A trivial approach is adjusting training times of Generator and Discriminator separately.

◦ Generator (1.28m parameters) ◦ Discriminator (1.18m parameters)

DCGAN - Not well-tuned

◦ Even if the blocks problem is mitigated, DCGAN may

◦ A glance of the random

● Fix the trained discriminator. ● Force the generator to generate images

Dense + Reshape,4 x 4 x 256 + LReLU 4 x 4 x 256 Embedding + Dense + Reshape, 4 x 4 4x4x1

Merge input noise vector and input label Out size

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 8 x 8 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 16 x 16 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 32 x 32 x 128

4 x 4 TranspConv + Stride(2, 2), 128 + LReLU 64 x 64 x 128

Output fake images ∈ ℝ𝟔𝟒×𝟔𝟒×𝟑

- 64 x 64 x 3 Embedding + Dense + Reshape, 64 x 64 64 x 64 x 1

Merge input RGB image and input label Out size

3 x 3 Conv + Stride(2, 2), 128 + LReLU 32 x 32 x 128

3 x 3 Conv + Stride(2, 2), 128 + LReLU 16 x 16 x 128

3 x 3 Conv + Stride(2, 2), 128 + LReLU 2 x 2 x 128

Flatten + Dropout 512

● In the early training

● After 6,000 epochs

● Although still a bunch

● Most of images look

● At this point, the

● Input conditional data

You might also like