Image_Classification_with_LLM_Structure

Overview

An image classification project leveraging the Transformer architecture from Large Language Models (LLMs).

Key Features

Multiple LLM Architectures: BERT, ELECTRA, GPT-2, RoBERTa, T5
Diverse Feature Extractors: VGG, ResNet, DenseNet, EfficientNet, MobileNet, ConvNeXt
Multi-Scale Feature Extraction: Extract image features at various scales for enhanced performance

Project Structure

├── models/
│   ├── bert.py                # BERT-based image classification model
│   ├── electra.py             # ELECTRA-based image classification model
│   ├── gpt.py                 # GPT-2-based image classification model
│   ├── roberta.py             # RoBERTa-based image classification model
│   ├── t5.py                  # T5-based image classification model
│   └── feature_extractor.py  # CNN-based feature extractors

How It Works

Extract image features using CNN-based Feature Extractors
Project extracted features into Transformer embedding dimensions
Process features through LLM Transformer architecture
Perform final classification through MLP layers

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
models		models
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image_Classification_with_LLM_Structure

Overview

Key Features

Project Structure

How It Works

About

Uh oh!

Releases

Packages

Languages

SeoBuAs/Image_Classification_with_LLM_Structure

Folders and files

Latest commit

History

Repository files navigation

Image_Classification_with_LLM_Structure

Overview

Key Features

Project Structure

How It Works

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages