0% found this document useful (0 votes)

47 views3 pages

Genaitable

Uploaded by

venkatesh k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views3 pages

Genaitable

Uploaded by

venkatesh k

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Here’s a comprehensive table that breaks down various concepts, algorithms, and libraries used

across Generative AI (GenAI) domains like text generation, image generation, video generation,
speech synthesis, and multi-modal systems. Each row in the table provides a specific GenAI
application or domain, with details on key algorithms, popular libraries or models, and common use
cases.

Popular
GenAI Concept Key Algorithms/Models Primary Use Cases
Libraries/Frameworks

- Transformer (GPT, BERT, T5, - Hugging Face

- Chatbots
LLaMA) Transformers
- Content creation
Text Generation (LLMs) - Fine-tuning and RLHF - OpenAI GPT
- Text summarization
(Reinforcement Learning - DeepSpeed
- Question answering
from Human Feedback) - LangChain

- Hugging Face
- Seq2Seq (T5, BART) - Translation
Text-to-Text Transformers
- Transformers with pre- and - Paraphrasing
Transformation - spaCy
post-processing - Summarization
- NLTK

- Diffusion Models (Stable

Diffusion, DALL-E)
- Hugging Face Diffusers
- GANs (StyleGAN, BigGAN) - Art and illustration
- PyTorch
Image Generation - Variational Autoencoders - Image inpainting
- TensorFlow
(VAEs) - Super-resolution
- StyleGAN2
- Transformer-based (VQ-
VAE, VQ-GAN)

- 3D/Spatio-temporal GANs - Pytorch3D

- Video synthesis
(MoCoGAN) - DeepMind’s Deep
- Animation
Video Generation - Temporal Diffusion Models Video Prior
- Scene
- Transformers for video - Hugging Face
reconstruction
(TimeSformer) Transformers

- Tacotron
- PyTorch - Text-to-speech
- WaveNet
Audio Generation and - Hugging Face - Audiobook
- GAN-TTS
Speech Synthesis Transformers narration
- Diffusion-based audio
- Google TTS - Voice cloning
models

- Convolutional Recurrent - Hugging Face - Transcription

(DeepSpeech) Transformers - Voice assistants
Speech-to-Text
- Transformer-based - SpeechBrain - Real-time speech
(Wav2Vec 2.0, Whisper) - OpenAI Whisper processing

- Image Encoder + Text - Captioning for

- Hugging Face
Decoder (CLIP, Flamingo) accessibility
Transformers
Image Captioning - CNN-RNN hybrids - Social media
- OpenAI CLIP
- Vision Transformers with automation
- PyTorch
text output (ViT-GPT) - Image indexing
Popular
GenAI Concept Key Algorithms/Models Primary Use Cases
Libraries/Frameworks

- CNNs + RNNs (e.g., LSTM - Audio analysis

- PyTorch
Speech Recognition and for audio sequences) - Sentiment analysis
- Librosa
Audio Classification - Transformers (Audio - Transcription and
- SpeechBrain
Spectrogram Transformers) voice analysis

- CLIP (Contrastive
- Visual question
Language–Image - Hugging Face
answering
Multi-modal Models Pretraining) Transformers
- Image-text retrieval
(Image + Text) - Flamingo - OpenAI CLIP
- Enhanced search
- Unified Transformer - PyTorch
engines
models (e.g., OFA, BLIP)

- Diffusion Models (Stable

- Illustration
Diffusion, DALL-E)
Text-to-Image - Hugging Face Diffusers generation
- GANs with text-
Generation (Text - DALL-E mini - Custom art
conditioning (AttnGAN)
Prompts) - PyTorch - Concept
- Variational Autoencoders
visualization
(VQ-VAE)

- Video Diffusion
- Temporal Diffusion Models
models - Marketing videos
Text-to-Video - GANs for video (TGAN,
- Pytorch3D - Video synthesis
Generation MoCoGAN)
- Hugging Face - Storytelling
- Transformers (VideoGPT)
Transformers

- 3D GANs (GANcraft,
3DGAN) - 3D model
- PyTorch3D
Text-to-3D Object - Neural Radiance Fields generation
- NVIDIA NeRF
Generation (NeRF) - Game assets
- Blender
- Diffusion-based 3D - AR/VR applications
synthesis

- Q-learning - Game AI
- Stable Baselines3
Reinforcement Learning- - Actor-Critic methods (PPO, - Robotics control
- RLlib
based Generation SAC) - Autonomous
- OpenAI Gym
- Multi-agent RL agents

- Document-based
- Dense Passage Retrieval
- Haystack question answering
(DPR)
Knowledge Retrieval and - Hugging Face - Knowledge-
- Retrieval-Augmented
Augmentation (RAG) Transformers grounded chatbots
Transformers (RAG)
- Pyserini - Real-time
- BM25 for retrieval
information lookup

Personalized - Collaborative Filtering - TensorFlow - Content

Recommendations - Matrix Factorization (SVD, - PyTorch recommendation
NMF) - LightFM - Product
Popular
GenAI Concept Key Algorithms/Models Primary Use Cases
Libraries/Frameworks

- Deep Learning (NARRE, suggestions

GRU4Rec) - Social media feeds

- Fine-tuning for
- RLHF - Hugging Face
Large Language Model chatbots
- Transfer Learning Transformers
(LLM) Tuning and - Specialized task
- Data Augmentation - DeepSpeed
Optimization LLMs
(Backtranslation) - OpenAI Gym
- Bias mitigation

Notes on Key Components and Terminologies

 LLMs (Large Language Models): Foundation models like GPT-4, BERT, and T5 enable
generative tasks, including text generation, chatbots, and question-answering, by utilizing a
large corpus of data to understand language patterns and semantics.

 Diffusion Models: This class of models, popular for high-quality image and video generation,
progressively denoises random noise to create detailed images or video frames.

 GANs (Generative Adversarial Networks): Useful for images, audio, and even video, GANs
train two networks (generator and discriminator) in an adversarial manner to produce high-
fidelity, realistic outputs.

 Transformers for Vision and Video: With the development of Vision Transformers (ViT) and
TimeSformer, transformers have expanded into visual domains, excelling in tasks requiring
spatial-temporal coherence, such as image classification and video generation.

 Speech Models: Tacotron and WaveNet are prominent for text-to-speech, while models like
Wav2Vec and Whisper by OpenAI have transformed speech-to-text tasks.

 CLIP and Multi-modal Models: CLIP and other multi-modal models process and align both
text and image data, enabling applications like visual question answering and image
captioning.

 RAG (Retrieval-Augmented Generation): Integrates retrieval methods with generative

models for tasks that require grounding in external documents or databases, such as
question-answering over a specific knowledge base.

This table and guide should serve as a foundation across the spectrum of Generative AI, offering
insight into the methods and tools used to build diverse applications, from creative content
generation to personalized recommendations and beyond. Let me know if you want further detail on
any particular area!

Generative AI - Concepts and Applications Riyyya - Opos
No ratings yet
Generative AI - Concepts and Applications Riyyya - Opos
7 pages
Generative AI
100% (1)
Generative AI
6 pages
PDF 2: Core Generative Ai Models (Gans, Vaes, Transformers, Diffusion)
No ratings yet
PDF 2: Core Generative Ai Models (Gans, Vaes, Transformers, Diffusion)
2 pages
Generative AI
No ratings yet
Generative AI
9 pages
Unit - DL
No ratings yet
Unit - DL
22 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
Unit 1 Int 426
No ratings yet
Unit 1 Int 426
5 pages
Generative Ai
No ratings yet
Generative Ai
9 pages
3213213 - copia
No ratings yet
3213213 - copia
2 pages
Transformers For Natural Language Processing and Computer Vision
No ratings yet
Transformers For Natural Language Processing and Computer Vision
150 pages
Generative AI
No ratings yet
Generative AI
2 pages
Generative AI
No ratings yet
Generative AI
4 pages
AI Transformers Practical Examples Notes
No ratings yet
AI Transformers Practical Examples Notes
2 pages
??? ?? ?????????? ?? ????????
No ratings yet
??? ?? ?????????? ?? ????????
21 pages
Introduction To Generative AI
100% (1)
Introduction To Generative AI
77 pages
Summary IBM GenAI
No ratings yet
Summary IBM GenAI
1 page
Naan Mudalvan
No ratings yet
Naan Mudalvan
68 pages
(5)
No ratings yet
(5)
1 page
R22 Gen AI Course Pack
No ratings yet
R22 Gen AI Course Pack
7 pages
Brochure Title
No ratings yet
Brochure Title
15 pages
Generative AI Course Questions
No ratings yet
Generative AI Course Questions
2 pages
ML Interview Ke Pehle Padhna Hai
No ratings yet
ML Interview Ke Pehle Padhna Hai
59 pages
Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
100% (3)
Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
239 pages
Gen AI Research by XG
No ratings yet
Gen AI Research by XG
5 pages
Summary of Generative AI Concepts
No ratings yet
Summary of Generative AI Concepts
2 pages
Generative AI for Tech Professionals
No ratings yet
Generative AI for Tech Professionals
8 pages
Model Usage
No ratings yet
Model Usage
9 pages
Unit 1 - GAI
No ratings yet
Unit 1 - GAI
4 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
Madhavan M Ts Report
No ratings yet
Madhavan M Ts Report
32 pages
Generative AI Curriculum Technical Guide For Azure 14
No ratings yet
Generative AI Curriculum Technical Guide For Azure 14
2 pages
Chat GPT Is Not All You Need Paper Review
No ratings yet
Chat GPT Is Not All You Need Paper Review
31 pages
GenerativeAI Projects
100% (4)
GenerativeAI Projects
46 pages
Wa0008.
No ratings yet
Wa0008.
7 pages
Intro To Genai
No ratings yet
Intro To Genai
2 pages
The Atlas of 50 Common AI Models
No ratings yet
The Atlas of 50 Common AI Models
72 pages
Generative AI Tutorial
67% (3)
Generative AI Tutorial
18 pages
PE - Module 2
No ratings yet
PE - Module 2
30 pages
nlfynx7RfS0IZ9YGOtls - Some Core Concepts
No ratings yet
nlfynx7RfS0IZ9YGOtls - Some Core Concepts
6 pages
Document 2
No ratings yet
Document 2
2 pages
AIV 2024 Week10 Start
No ratings yet
AIV 2024 Week10 Start
29 pages
Unit 1 Intoduction To Generative AI
No ratings yet
Unit 1 Intoduction To Generative AI
8 pages
Generative Intelligence: Trends and Prospects: NAME: Tungena Sarayu UID:111724720011 Ieee Publications
No ratings yet
Generative Intelligence: Trends and Prospects: NAME: Tungena Sarayu UID:111724720011 Ieee Publications
16 pages
Unit3sem7 Generative Ai
No ratings yet
Unit3sem7 Generative Ai
41 pages
Module1 L1 L2
No ratings yet
Module1 L1 L2
35 pages
Generative Ai and Large Language Models (LLMS) : Unit - 7
No ratings yet
Generative Ai and Large Language Models (LLMS) : Unit - 7
42 pages
Class Note 2: Intermediate Concepts in Generative AI
No ratings yet
Class Note 2: Intermediate Concepts in Generative AI
4 pages
Generative Ai
No ratings yet
Generative Ai
30 pages
UNIT 4 Generative AI PDF
No ratings yet
UNIT 4 Generative AI PDF
6 pages
Intro To Gen AI PDF
No ratings yet
Intro To Gen AI PDF
6 pages
GR 9 - Generative AI
No ratings yet
GR 9 - Generative AI
22 pages
AI Models: Types and Applications
No ratings yet
AI Models: Types and Applications
14 pages
G Ai
No ratings yet
G Ai
7 pages
Generative AI System Design Resources
No ratings yet
Generative AI System Design Resources
5 pages
Gen AI ChatGPT OpenAI N GPT Store - Et Tu Code
No ratings yet
Gen AI ChatGPT OpenAI N GPT Store - Et Tu Code
342 pages
Generative AI Complete Questions
No ratings yet
Generative AI Complete Questions
3 pages
AI Quiz ch2
No ratings yet
AI Quiz ch2
10 pages
Class Notes Astronomy 4 of 5
No ratings yet
Class Notes Astronomy 4 of 5
2 pages
2023 Healthcare-Programme Mongolia Annual Report
No ratings yet
2023 Healthcare-Programme Mongolia Annual Report
22 pages
APC 300 Service Manual
No ratings yet
APC 300 Service Manual
87 pages
MUHS OBG Sylabus
No ratings yet
MUHS OBG Sylabus
10 pages
District Literary Fair: Broward County Public Schools
No ratings yet
District Literary Fair: Broward County Public Schools
13 pages
Low Cost Equipment For Teaching
No ratings yet
Low Cost Equipment For Teaching
58 pages
KJHGD
No ratings yet
KJHGD
3 pages
Data Sheet - EC68G Power Pack 0820
No ratings yet
Data Sheet - EC68G Power Pack 0820
2 pages
Steel Design (LRFD) - TOPIC 4 - Tention Members - 26 July 2022
No ratings yet
Steel Design (LRFD) - TOPIC 4 - Tention Members - 26 July 2022
20 pages
Property Detail
No ratings yet
Property Detail
1 page
Introduction To Agile Change Management v1.0 1
100% (1)
Introduction To Agile Change Management v1.0 1
8 pages
GENECHECKER Model UF-300 Real-Time PCR System
No ratings yet
GENECHECKER Model UF-300 Real-Time PCR System
2 pages
Unit 3
No ratings yet
Unit 3
19 pages
1 Shows The Bonding of Thymine and Adenine. Each Charge Shown Is
No ratings yet
1 Shows The Bonding of Thymine and Adenine. Each Charge Shown Is
6 pages
Abay Fana Dairy Farm Investment Analysis
100% (1)
Abay Fana Dairy Farm Investment Analysis
42 pages
Project Engineer - CFD Consulting
No ratings yet
Project Engineer - CFD Consulting
2 pages
Muscular System Overview
No ratings yet
Muscular System Overview
19 pages
Haircut
No ratings yet
Haircut
2 pages
Cursors 100112215205 Phpapp01
No ratings yet
Cursors 100112215205 Phpapp01
19 pages
Recommendations FOR The Design, Manufacture and Erection of Steel Penstocks of Welded Construction For Hydro Electric Installations
No ratings yet
Recommendations FOR The Design, Manufacture and Erection of Steel Penstocks of Welded Construction For Hydro Electric Installations
79 pages
HLASM R3: New Features & Functions
No ratings yet
HLASM R3: New Features & Functions
27 pages
TOEIC Explanation of Grading Schema
No ratings yet
TOEIC Explanation of Grading Schema
2 pages
BPCC113 E July 2024-January 2025
No ratings yet
BPCC113 E July 2024-January 2025
6 pages
Assessment Algebra - 2 - Unit - 8 - Rational - Functions
No ratings yet
Assessment Algebra - 2 - Unit - 8 - Rational - Functions
2 pages
Xii Physical Education Practical
No ratings yet
Xii Physical Education Practical
65 pages
Moderator: Dr. Usha Suwalka Presenter: Dr. Suchismita Naik
No ratings yet
Moderator: Dr. Usha Suwalka Presenter: Dr. Suchismita Naik
44 pages
Method of The Year 2024: Spatial Proteomics
No ratings yet
Method of The Year 2024: Spatial Proteomics
2 pages
Industrial Revolution
No ratings yet
Industrial Revolution
1 page
Car Classification Worksheet: Please Fill This Out and Bring It To The Event
No ratings yet
Car Classification Worksheet: Please Fill This Out and Bring It To The Event
1 page
Laboratory Department Tracking Sheet Original
No ratings yet
Laboratory Department Tracking Sheet Original
124 pages
Armonía, Forma Musical y Rock
No ratings yet
Armonía, Forma Musical y Rock
297 pages

Genaitable

Uploaded by

Genaitable

Uploaded by

Here’s a comprehensive table that breaks down various concepts, algorithms, and libraries used

- Transformer (GPT, BERT, T5, - Hugging Face

- Diffusion Models (Stable

- 3D/Spatio-temporal GANs - Pytorch3D

- Convolutional Recurrent - Hugging Face - Transcription

- Image Encoder + Text - Captioning for

- CNNs + RNNs (e.g., LSTM - Audio analysis

- Diffusion Models (Stable

Personalized - Collaborative Filtering - TensorFlow - Content

- Deep Learning (NARRE, suggestions

Notes on Key Components and Terminologies

 RAG (Retrieval-Augmented Generation): Integrates retrieval methods with generative

You might also like