0% found this document useful (0 votes)

11 views3 pages

Report Aykhan Mahmudov

This report details the implementation and analysis of a PyTorch dataloader and model for classifying playing card images, utilizing a custom dataset class and a modified ResNet18 model. The study found that a learning rate of 0.05 yielded the best training stability and performance, while higher and lower rates resulted in instability or minimal improvement. Key challenges included dataset loading efficiency, transfer learning adaptation, and sensitivity to learning rate selection.

Uploaded by

xeheco2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views3 pages

Report Aykhan Mahmudov

Uploaded by

xeheco2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

PyTorch Dataloader and Model

Training
Aykhan Mahmudov (aykhan.mahmudov.std@bhos.edu.az)
Group IT 21
03.21.2025

1 Overview
This report documents the implementation and analysis of a PyTorch dataloader and model for a card
image classification task. We worked with a dataset containing 53 different classes of playing cards,
implementing a custom dataset class, modifying a pre-trained ResNet18 model, and training the model
with different learning rates. We analyzed the impact of different learning rates on model convergence and
performance, finding that a moderate learning rate of 0.05 provided the best balance between training
stability and convergence speed.

2 Dataset Analysis
After examining the dataset downloaded from Kaggle, we observed the following characteristics:

1. Number of Training Images: The dataset contains a large collection of playing card images
organized by class.

2. Number of Classes: The dataset includes 53 distinct classes, representing all standard playing
cards plus joker.

3. Data Structure: Images are organized in a hierarchical structure with class folders containing
individual card images.

3 Dataset Class Implementation

We implemented the getitem method for the custom dataset class to properly load and process
images:

def getitem(self, index):

# Get the image path and label
image_path = self.image_list[index]
label = self.label_list[index]

# Load and process the image

image = Image.open(image_path).convert(’RGB’)

# Apply transformations if available

if self.transforms:
image = self.transforms(image)

return image, torch.tensor(label)

This implementation ensures proper loading of images from disk, conversion to RGB format, applica-
tion of transformations, and conversion of labels to PyTorch tensors.

1
4 Model Class Implementation
We modified the forward method of the ExModel class to implement the forward pass through the
network:

def forward(self, image):

# Pass the image through the ResNet18 model
features = self.resnet18(image)

# Reshape the output

features = features.view(features.size(0), -1)

# Pass the features through the classifier

out = self.classifier(features)

return out

4.1 The Meaning of Forward in Model Class

The forward method in a PyTorch model class defines the computation performed at every call to the
model. It specifies how data flows through the network layers during the forward pass, transforming
input data (in this case, images) into output predictions (card classes). While PyTorch automatically
handles the backward pass for gradient computation, the forward pass must be explicitly defined.

4.2 Transfer Learning Significance

Transfer learning leverages knowledge gained from solving one problem and applies it to a different but
related problem. In our implementation, we used a pre-trained ResNet18 model that had already learned
feature representations from millions of images in the ImageNet dataset. The benefits of this approach
include:

1. Efficiency: Reduces training time significantly compared to training from scratch.

2. Performance: Often leads to better model performance, especially when training data is limited.

3. Feature Reuse: Lower layers of pre-trained networks have learned generic features like edges,
textures, and shapes that are transferable across image domains.

4. Optimization: Provides a better initialization point, helping to avoid poor local minima during
training.

5 Model Training and Learning Rate Analysis

We trained the model using three different learning rates to analyze their impact on model performance:

1. High learning rate: 1.5

2. Medium learning rate: 0.05

3. Very low learning rate: 0.0000005

5.1 Training Results

Table 1: Learning Rate Comparison

Learning Rate Training Loss Validation Loss F1 Score
1.5 Highly unstable Highly unstable Poor
0.05 Steadily decreasing Good convergence Best performance
0.0000005 Minimal decrease Minimal improvement Minimal improvement

2
5.2 Learning Rate Impact Analysis
1. High Learning Rate (1.5): The high learning rate caused unstable training with significant
fluctuations in loss values. The model struggled to converge, and the optimization process often
overshot optimal parameters, leading to poor performance metrics.
2. Medium Learning Rate (0.05): This learning rate provided the best results, showing steady
decrease in training loss, good convergence on the validation set, and consistent improvement in F1
score. The optimization process was stable while still making meaningful progress with each epoch.
3. Very Low Learning Rate (0.0000005): With such a low learning rate, the model showed
minimal improvement over training epochs. Parameter updates were too small to make significant
progress within the given number of epochs, resulting in a model that was effectively under-trained.

6 Observations and Challenges

Throughout the implementation and training process, we encountered several noteworthy challenges and
learning points:

1. Dataset Loading Efficiency: Ensuring proper loading and pre-processing of images required
careful implementation to avoid memory issues.

2. Transfer Learning Adaptation: Modifying the pre-trained ResNet18 model required under-
standing of how to properly remove the final classification layer and add a new one tailored to our
specific classification task.
3. Learning Rate Sensitivity: The experiments clearly demonstrated that model performance is
highly sensitive to learning rate selection, with orders of magnitude differences in learning rates
resulting in dramatically different training outcomes.
4. Balancing Batch Size: Finding the appropriate batch size to balance computational efficiency
with training stability was important, especially given the memory constraints of the training
environment.
5. Tensor Dimensionality: Ensuring correct tensor shapes throughout the data pipeline and model
forward pass required careful debugging and understanding of PyTorch’s tensor operations.

7 Conclusion
This project successfully implemented a complete PyTorch pipeline for image classification using transfer
learning. We completed the dataset class by implementing the getitem method, modified the model
architecture to implement the forward pass, and trained the model with different learning rates.
The analysis of different learning rates clearly demonstrated the critical importance of this hyperpa-
rameter for model training success. The medium learning rate (0.05) provided the best balance between
progress speed and training stability, achieving superior performance compared to both higher and lower
learning rates.
The use of transfer learning through the pre-trained ResNet18 model proved to be an effective ap-
proach, allowing the model to leverage previously learned features while adapting to the specific charac-
teristics of the card classification task.
This implementation and analysis provide valuable insights into the practical aspects of deep learning
model development and the impact of hyperparameter choices on training outcomes.

Assign PDF
No ratings yet
Assign PDF
19 pages
Pytorch Project Pedro Aguiar
No ratings yet
Pytorch Project Pedro Aguiar
27 pages
TD 4 Computer Vision
No ratings yet
TD 4 Computer Vision
4 pages
Notebook - Agave Plant Maturation Model Inference and Testing
No ratings yet
Notebook - Agave Plant Maturation Model Inference and Testing
7 pages
Report On Handwritten Digit Recognition Using A Feedforward Neural Network
No ratings yet
Report On Handwritten Digit Recognition Using A Feedforward Neural Network
8 pages
Transfer Learning with Pre-trained Models
No ratings yet
Transfer Learning with Pre-trained Models
16 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Pytorch Tutorial: Narges Honarvar Nazari January 30
No ratings yet
Pytorch Tutorial: Narges Honarvar Nazari January 30
29 pages
Resnet v5
No ratings yet
Resnet v5
13 pages
Report Week 1 and 2
No ratings yet
Report Week 1 and 2
12 pages
PyTorch For Deep Learning Zero To Mastery
No ratings yet
PyTorch For Deep Learning Zero To Mastery
6 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
DIP Mini Project
100% (1)
DIP Mini Project
12 pages
Report
No ratings yet
Report
4 pages
DL Lab 7 Excuted
No ratings yet
DL Lab 7 Excuted
4 pages
Code
No ratings yet
Code
4 pages
How To Develop A CNN For MNIST Handwritten Digit Classification
No ratings yet
How To Develop A CNN For MNIST Handwritten Digit Classification
43 pages
Lecture25 TransferLearningOverviewPart1
No ratings yet
Lecture25 TransferLearningOverviewPart1
54 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Industrial Automation Assignment
No ratings yet
Industrial Automation Assignment
10 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Mamindla Sathvika Lab11
No ratings yet
Mamindla Sathvika Lab11
12 pages
Make 04 00002 v2
No ratings yet
Make 04 00002 v2
20 pages
06 Pytorch Transfer Learning
No ratings yet
06 Pytorch Transfer Learning
18 pages
Exercise Classification
No ratings yet
Exercise Classification
8 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Quality Testing MobileNet V2 Compressed
No ratings yet
Quality Testing MobileNet V2 Compressed
13 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deep Learning Workshop Session 2
No ratings yet
Deep Learning Workshop Session 2
4 pages
DL Exp-6 16010422230
No ratings yet
DL Exp-6 16010422230
8 pages
Aicw
No ratings yet
Aicw
19 pages
Report
No ratings yet
Report
14 pages
P I P T D - S: Rofiling and Mproving The Y Orch Ataloader For High Latency Torage
No ratings yet
P I P T D - S: Rofiling and Mproving The Y Orch Ataloader For High Latency Torage
25 pages
This Python Script Implements A Single
No ratings yet
This Python Script Implements A Single
6 pages
ML Code Analysis
No ratings yet
ML Code Analysis
6 pages
Lab 8
No ratings yet
Lab 8
5 pages
Deep Learning with PyTorch Course
No ratings yet
Deep Learning with PyTorch Course
9 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
DL Exp5 22108B0055
No ratings yet
DL Exp5 22108B0055
14 pages
ML Ass2
No ratings yet
ML Ass2
8 pages
Practical: Build and Train A Feedforward Neural Network (MLP)
No ratings yet
Practical: Build and Train A Feedforward Neural Network (MLP)
4 pages
Faster R-CNN
No ratings yet
Faster R-CNN
20 pages
AngadKumar - 21CS012 - Pattern Recognition
No ratings yet
AngadKumar - 21CS012 - Pattern Recognition
8 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
PyTorch Deep Learning Guide
No ratings yet
PyTorch Deep Learning Guide
19 pages
Quality Testing Resnet18 Compressed
No ratings yet
Quality Testing Resnet18 Compressed
13 pages
HOMEWORK 3 Report
No ratings yet
HOMEWORK 3 Report
3 pages
Project Midsem
No ratings yet
Project Midsem
29 pages
Py Torch
No ratings yet
Py Torch
786 pages
Master Inspera
No ratings yet
Master Inspera
45 pages
Skill 7
No ratings yet
Skill 7
11 pages
Cat Dog Classification Report
No ratings yet
Cat Dog Classification Report
11 pages
Report
No ratings yet
Report
6 pages
Lab 9
No ratings yet
Lab 9
29 pages
Generalized Roughness Bearing Fault Diagnosis Using Time Series Analysis and Gradient Boosted Tree
No ratings yet
Generalized Roughness Bearing Fault Diagnosis Using Time Series Analysis and Gradient Boosted Tree
4 pages
New Algorithm For Earthquake Prediction
No ratings yet
New Algorithm For Earthquake Prediction
7 pages
Annexure - 7 - Data Sheet - Nokia - DC - Fabric - 7220 IXR-D-SW - Data - Sheet - EN71028
No ratings yet
Annexure - 7 - Data Sheet - Nokia - DC - Fabric - 7220 IXR-D-SW - Data - Sheet - EN71028
9 pages
Databricks Data Engineer Professional
No ratings yet
Databricks Data Engineer Professional
98 pages
Br-1620hr-1650hr-Us 2
No ratings yet
Br-1620hr-1650hr-Us 2
6 pages
Senior Network Engineer
No ratings yet
Senior Network Engineer
2 pages
Synopsis - Note Sharing Application Using Django
No ratings yet
Synopsis - Note Sharing Application Using Django
12 pages
F.I.T Question Bank With Answers
No ratings yet
F.I.T Question Bank With Answers
50 pages
Azure Virtual Machines Cheat Sheet
No ratings yet
Azure Virtual Machines Cheat Sheet
3 pages
Design and Development of A Folding Type Braille Keyboard
No ratings yet
Design and Development of A Folding Type Braille Keyboard
9 pages
Word Basics Lesson Plan Four
No ratings yet
Word Basics Lesson Plan Four
7 pages
Deposit Products Configuration
No ratings yet
Deposit Products Configuration
9 pages
Software Complexity Metrics Guide
No ratings yet
Software Complexity Metrics Guide
26 pages
4K Ultra Slim LED TV Powered by Android TV: With Ambilight 3-Sided
No ratings yet
4K Ultra Slim LED TV Powered by Android TV: With Ambilight 3-Sided
3 pages
NetSuite Role Permissions Guide
No ratings yet
NetSuite Role Permissions Guide
22 pages
Eklavya Mathematics Practice Booklet For JEE Advanced
No ratings yet
Eklavya Mathematics Practice Booklet For JEE Advanced
273 pages
D-Copia 253 MF Plus - 303 MF Plus 8765
No ratings yet
D-Copia 253 MF Plus - 303 MF Plus 8765
2 pages
Computer Components: ODD, VGA, TV Tuner Cards
No ratings yet
Computer Components: ODD, VGA, TV Tuner Cards
1 page
Synergy Server Max
No ratings yet
Synergy Server Max
3 pages
Mobile Number Registration Form For Manual Registration
No ratings yet
Mobile Number Registration Form For Manual Registration
1 page
Cryptography and Computer Security
No ratings yet
Cryptography and Computer Security
63 pages
ManageEngine AssetExplorer 5.6 HelpDocument
No ratings yet
ManageEngine AssetExplorer 5.6 HelpDocument
269 pages
El Gancho de Un Ensayo
100% (1)
El Gancho de Un Ensayo
4 pages
Internet Security
No ratings yet
Internet Security
56 pages
CSE373: Design and Analysis of Algorithms
No ratings yet
CSE373: Design and Analysis of Algorithms
52 pages
Practical 5 EXTC PT OSPF RIP
No ratings yet
Practical 5 EXTC PT OSPF RIP
21 pages
Author Guidelines for EAAP Meeting
No ratings yet
Author Guidelines for EAAP Meeting
2 pages
Balanza Analitica Alc - Serie Esp
No ratings yet
Balanza Analitica Alc - Serie Esp
140 pages
Python Classes and Methods Guide
No ratings yet
Python Classes and Methods Guide
22 pages
TXSA Assignment
No ratings yet
TXSA Assignment
5 pages

Report Aykhan Mahmudov

Uploaded by

Report Aykhan Mahmudov

Uploaded by

PyTorch Dataloader and Model

3 Dataset Class Implementation

def __getitem__(self, index):

# Load and process the image

# Apply transformations if available

return image, torch.tensor(label)

def forward(self, image):

# Reshape the output

# Pass the features through the classifier

4.1 The Meaning of Forward in Model Class

4.2 Transfer Learning Significance

1. Efficiency: Reduces training time significantly compared to training from scratch.

5 Model Training and Learning Rate Analysis

1. High learning rate: 1.5

2. Medium learning rate: 0.05

3. Very low learning rate: 0.0000005

5.1 Training Results

Table 1: Learning Rate Comparison

6 Observations and Challenges

You might also like

def getitem(self, index):