0% found this document useful (0 votes)

29 views9 pages

LLM Distillation

The document analyzes three research papers on distilling knowledge from Large Language Models (LLMs) into smaller models, highlighting motivations such as efficiency, accessibility, and customization. Each paper presents a distinct approach: personalized distillation for adaptive learning, divide-and-conquer strategies for problem-solving, and using LLM explanations to enhance reasoning in smaller models. The potential applications of these techniques in the LawGPT project are also discussed, emphasizing improved training and legal reasoning capabilities.

Uploaded by

Chiranjan 7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views9 pages

LLM Distillation

Uploaded by

Chiranjan 7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Distilling Knowledge from Large

Language Models

A Comparative Analysis of Three

Research Papers
Introduction: Distillation
• Large Language Models (LLMs) are powerful tools with remarkable
capabilities.
• Distillation aims to transfer knowledge from large LLMs to smaller, more
manageable models.

• Motivations for distillation:

- Efficiency: Smaller models are faster and cheaper to run.
- Accessibility: Enables the use of open-source models.
- Customization: Facilitates tailoring models for specific tasks.

• This presentation analyzes three papers that explore different approaches

to LLM distillation.
• We will also discuss the potential of these techniques for application in
LawGPT.
Paper 1: Personalized Distillation
• Title: Personalized Distillation: Empowering Open-Sourced LLMs with Adaptive
Learning for Code Generation

• Key Idea:
- Standard Distillation: LLMs generate data, and smaller models learn from it.
- Personalized Distillation: Adapts to the student model's learning progress.

• Process:
1. Student model attempts a task.
2. Evaluation and feedback are provided.
3. Teacher model refines the attempt if needed.

• Benefit: More efficient learning with less data.

Paper 1: Potential Application to LawGPT
• Relate personalized distillation to the LawGPT project.

• How can it improve LawGPT's training?

- LawGPT attempts to answer legal questions.
- A powerful LLM provides feedback and refines the answers.
- Focus on mistakes allows for efficient learning.

• LawGPT Application:
1. A base LawGPT model attempts to answer legal queries.
2. A more powerful LLM (e.g., GPT-4) reviews the answer, providing
corrections and detailed feedback.
3. The base LawGPT model learns from its mistakes, iteratively improving its
legal reasoning and accuracy.
Paper 2: Divide-and-Conquer Distillation
• Title: Divide-or-Conquer? Which Part Should You Distill Your LLM?

• Key Idea: Break down complex reasoning into two phases:

- Decomposition: Breaking down problems into smaller parts.
- Solving: Executing solutions to the sub-problems.
• Hypothesis (authors assumed):
- Decomposition is easier to distill (general problem-solving skills).
- Solving is harder to distill (requires more domain knowledge).
• Diff approaches:
– Static Approach: The LLM first decomposes the entire problem into sub-problems, then
solves each one.
– Dynamic Approach: The LLM decomposes part of the problem, solves it, and uses the
solution to guide further decomposition.
• The authors chose a static approach for clearer separation of stages, easier
implementation, and potential for future integration into dynamic processes
Paper 2: Potential Application to LawGPT
• Apply the divide-and-conquer strategy to LawGPT.
- Decomposition Model: A smaller model breaks down legal questions.
- Solving Model: A larger model answers the sub-questions.

• Benefit: More efficient use of computational resources.

• LawGPT Application:
1. A smaller LawGPT model could be trained to decompose complex legal
questions into simpler sub-questions.
2. A larger, more knowledgeable LawGPT model then answers these sub-
questions.
3. This enables a modular approach, where different models handle different
aspects of legal reasoning.
Paper 3: Distillation with Explanations
• Title: Distillation with Explanations from Large Language Models

• Key Idea: LLMs can generate incorrect answers.

• Observation: LLM explanations are often consistent with their (incorrect) answers.
• Method:
– Combine ground truth labels with LLM-generated explanations to train a smaller model.
– LLM explanations have value even if the answer is wrong because they show the model's
reasoning.
– Smaller models can learn valuable reasoning steps from these explanations.
– By combining these explanations with correct labels, we can train models to be both accurate
and capable of reasoning.

• Challenge: LLMs Can Be Incorrect:

- LLMs have shown impressive capabilities in language tasks.
- However, they can generate incorrect or inaccurate answers.
- Noisy data from incorrect answers can negatively affect model training.
Paper 3: Potential Application to LawGPT
• Use LLMs to generate explanations for legal concepts, even with occasional errors.

• Train LawGPT using a combination of:

- LLM-generated explanations.
- Ground truth labels.

• Benefit: Leverage LLM reasoning while maintaining accuracy.

• LawGPT Application:
1. When training LawGPT, use a larger LLM to generate explanations for its
answers.
2. Combine these explanations with a dataset of legal questions and verified
correct answers.
3. This helps LawGPT learn to provide both accurate answers and sound legal
reasoning.

DSCI565 Paper Presentation - Group 3
No ratings yet
DSCI565 Paper Presentation - Group 3
24 pages
2305 02301 PDF
No ratings yet
2305 02301 PDF
12 pages
Juridia Hackathon 1734982717
No ratings yet
Juridia Hackathon 1734982717
4 pages
Llmquoter: Enhancing Rag Capabilities Through Efficient Quote Extraction From Large Contexts
No ratings yet
Llmquoter: Enhancing Rag Capabilities Through Efficient Quote Extraction From Large Contexts
8 pages
K D U F O - S LLM: G R S D: Nowledge Istillation Sing Rontier PEN Ource S Eneralizability and The Ole of Ynthetic ATA
No ratings yet
K D U F O - S LLM: G R S D: Nowledge Istillation Sing Rontier PEN Ource S Eneralizability and The Ole of Ynthetic ATA
25 pages
Distilling Step-by-Step! Outperforming Larger Language Models With Less Training Data and Smaller Model Sizes
100% (1)
Distilling Step-by-Step! Outperforming Larger Language Models With Less Training Data and Smaller Model Sizes
13 pages
LLMQuoter - Enhancing RAG Capabilities Through Efficient Quote
No ratings yet
LLMQuoter - Enhancing RAG Capabilities Through Efficient Quote
12 pages
Summary - Foundations On LLMs
No ratings yet
Summary - Foundations On LLMs
6 pages
How LLMs Collaborate With Multi Agent Setup
No ratings yet
How LLMs Collaborate With Multi Agent Setup
6 pages
Towards Understanding Distilled Reasoning Models: A Rep-Resentational Approach
No ratings yet
Towards Understanding Distilled Reasoning Models: A Rep-Resentational Approach
13 pages
Mind's Mirror Distilling Self Evaluation Capability and Comprehensive
No ratings yet
Mind's Mirror Distilling Self Evaluation Capability and Comprehensive
25 pages
Knowledge Distillation of LLM
No ratings yet
Knowledge Distillation of LLM
43 pages
Productionizing LLM Applications
No ratings yet
Productionizing LLM Applications
13 pages
BLT Can Large Language Models Handle Basic Legal Text 2311.09693v2
No ratings yet
BLT Can Large Language Models Handle Basic Legal Text 2311.09693v2
14 pages
Saullm-7B:: A Pioneering Large Language Model For Law
No ratings yet
Saullm-7B:: A Pioneering Large Language Model For Law
13 pages
Relatedwork
No ratings yet
Relatedwork
8 pages
Divide-or-Conquer? Which Part Should You Distill Your LLM?
No ratings yet
Divide-or-Conquer? Which Part Should You Distill Your LLM?
13 pages
ITQu
No ratings yet
ITQu
5 pages
Rough Research Paper
No ratings yet
Rough Research Paper
5 pages
24 Texto GPT
No ratings yet
24 Texto GPT
4 pages
Beyond Scaling Law: A Data-Efficient Distillation Framework For Reasoning
No ratings yet
Beyond Scaling Law: A Data-Efficient Distillation Framework For Reasoning
9 pages
Guide to LLMs and Prompt Crafting
No ratings yet
Guide to LLMs and Prompt Crafting
2 pages
LLM in Digital Forensic
No ratings yet
LLM in Digital Forensic
17 pages
Fine-Tuning Llama 3 for Legal AI
No ratings yet
Fine-Tuning Llama 3 for Legal AI
6 pages
LLM As A Judge
No ratings yet
LLM As A Judge
7 pages
Advanced Prompt Engineering
No ratings yet
Advanced Prompt Engineering
27 pages
Comprehensive Guide To LLM Distillation Using Hugging Face (2025)
No ratings yet
Comprehensive Guide To LLM Distillation Using Hugging Face (2025)
1 page
Can Large Language Models Handle Basic Legal Text
No ratings yet
Can Large Language Models Handle Basic Legal Text
12 pages
Benchmarking Legal Knowledge of Large Language Models
No ratings yet
Benchmarking Legal Knowledge of Large Language Models
38 pages
سيمنار
No ratings yet
سيمنار
4 pages
Evaluating Large Language Models For Tax Law Reasoning: Abstract
No ratings yet
Evaluating Large Language Models For Tax Law Reasoning: Abstract
15 pages
Exploration of Open Large Language Models For Ediscovery
No ratings yet
Exploration of Open Large Language Models For Ediscovery
12 pages
Inference Efficiency by Learning Task Complexity
No ratings yet
Inference Efficiency by Learning Task Complexity
9 pages
Advancing AI-Scientist Understanding: Making LLM Think Like A Physicist With Interpretable Reasoning
No ratings yet
Advancing AI-Scientist Understanding: Making LLM Think Like A Physicist With Interpretable Reasoning
20 pages
Strategic Innovation For Justice
No ratings yet
Strategic Innovation For Justice
5 pages
A Systematic Survey and Critical Review On Evaluating Large Language Models - Challenges, Limitations, and Recommendations
No ratings yet
A Systematic Survey and Critical Review On Evaluating Large Language Models - Challenges, Limitations, and Recommendations
32 pages
Building LLM Applications For Production
No ratings yet
Building LLM Applications For Production
25 pages
End-to-End LLMOps Project Lifecycle - Summary, Road
No ratings yet
End-to-End LLMOps Project Lifecycle - Summary, Road
3 pages
Week 11 Session 2 Lesson Plan
No ratings yet
Week 11 Session 2 Lesson Plan
6 pages
LLM Guide for Interns
No ratings yet
LLM Guide for Interns
4 pages
Creating LLM
No ratings yet
Creating LLM
3 pages
LLM Vs Lawyers
No ratings yet
LLM Vs Lawyers
2 pages
Llms
No ratings yet
Llms
3 pages
AI Model Optimization Guide
100% (1)
AI Model Optimization Guide
1 page
LLM2
No ratings yet
LLM2
3 pages
An Overview of Large Language Models For Statisticians
No ratings yet
An Overview of Large Language Models For Statisticians
67 pages
A Bibliometric Review of Large Language Models Research From 2017 To 2023
No ratings yet
A Bibliometric Review of Large Language Models Research From 2017 To 2023
36 pages
Fine-Tuning GPT-3 For Legal Rule Classification: Davide Liga, Livio Robaldo
No ratings yet
Fine-Tuning GPT-3 For Legal Rule Classification: Davide Liga, Livio Robaldo
10 pages
2024 KDD RAG Meets LLM Tutorial Part1
No ratings yet
2024 KDD RAG Meets LLM Tutorial Part1
68 pages
Lab Session1 25oct2024
No ratings yet
Lab Session1 25oct2024
29 pages
Legal AI Model for Chinese Law
No ratings yet
Legal AI Model for Chinese Law
8 pages
Chatgpt: A Technical Perspective: Presented by Teamx
No ratings yet
Chatgpt: A Technical Perspective: Presented by Teamx
18 pages
Toward Generalizable Evaluation in The LLM Era A Survey Beyond Benchmarks
No ratings yet
Toward Generalizable Evaluation in The LLM Era A Survey Beyond Benchmarks
42 pages
5 - Automated Legal Consulting in Construction Procurement Using Metaheuristically Optimized Large Language Models
No ratings yet
5 - Automated Legal Consulting in Construction Procurement Using Metaheuristically Optimized Large Language Models
11 pages
T-Rag: L LLM T: Essons From The Renches
No ratings yet
T-Rag: L LLM T: Essons From The Renches
21 pages
Synthetic Data LLM RL
No ratings yet
Synthetic Data LLM RL
33 pages
English Language Course Guide
No ratings yet
English Language Course Guide
4 pages
Definitions of Translation Translation As A Product and A Process
100% (2)
Definitions of Translation Translation As A Product and A Process
12 pages
IBM's Global AI Adoption Index 2021 - Executive-Summary
No ratings yet
IBM's Global AI Adoption Index 2021 - Executive-Summary
13 pages
Midterm Exam: Organization & Management
100% (1)
Midterm Exam: Organization & Management
4 pages
Module 1 - Essentials of Project Management - Project Management in Global Health (B)
No ratings yet
Module 1 - Essentials of Project Management - Project Management in Global Health (B)
9 pages
Answer Key HSG Anh 9 2021-2022
No ratings yet
Answer Key HSG Anh 9 2021-2022
2 pages
Vocabulary Success for Grades 3-6
0% (1)
Vocabulary Success for Grades 3-6
24 pages
Grade 7 Listening Strategies
No ratings yet
Grade 7 Listening Strategies
17 pages
Psychology Reviewer
100% (1)
Psychology Reviewer
4 pages
Chapter 3 Module 4
No ratings yet
Chapter 3 Module 4
19 pages
Notes Psychology Ba Question Paper
No ratings yet
Notes Psychology Ba Question Paper
4 pages
Tips For True False Not Given IELTS Reading Questions
No ratings yet
Tips For True False Not Given IELTS Reading Questions
3 pages
Burgmann Scholarship Final
No ratings yet
Burgmann Scholarship Final
2 pages
English Exercises If-Clauses Rephrasing
No ratings yet
English Exercises If-Clauses Rephrasing
1 page
Project Omni - Conversation With AI - Labeling Instructions
No ratings yet
Project Omni - Conversation With AI - Labeling Instructions
5 pages
CMAP Second Quarter
No ratings yet
CMAP Second Quarter
7 pages
Generative AI and ML For The Enterprise
No ratings yet
Generative AI and ML For The Enterprise
21 pages
Instant Download Ebook PDF English Syntax and Argumentation 5th Ed 2018 Edition PDF Scribd
97% (62)
Instant Download Ebook PDF English Syntax and Argumentation 5th Ed 2018 Edition PDF Scribd
47 pages
Lecture 1 Introduction To Control System
No ratings yet
Lecture 1 Introduction To Control System
14 pages
Tpad 2 Lesson Observation Form - 2
82% (11)
Tpad 2 Lesson Observation Form - 2
2 pages
Educ 205 Finals
No ratings yet
Educ 205 Finals
12 pages
Tenses in Hindi PDF by Ajay Sir Accent Hisar 1 PDF
No ratings yet
Tenses in Hindi PDF by Ajay Sir Accent Hisar 1 PDF
25 pages
PSYCHOLOGY 48 Aarakhada 20-21-61193502de
No ratings yet
PSYCHOLOGY 48 Aarakhada 20-21-61193502de
10 pages
Tenses
No ratings yet
Tenses
18 pages
Classroom Management for Kinder 2
No ratings yet
Classroom Management for Kinder 2
2 pages
Faithful vs. Communicative Translation
100% (1)
Faithful vs. Communicative Translation
6 pages
Model Answer-Math-Gr.9-Thinking Skills (ATL) - Similarity - Wk9
No ratings yet
Model Answer-Math-Gr.9-Thinking Skills (ATL) - Similarity - Wk9
3 pages
Action Research 1
No ratings yet
Action Research 1
61 pages
Stages of Consciousness - Georg Kuhlewind
100% (1)
Stages of Consciousness - Georg Kuhlewind
115 pages
Leadership by Example
No ratings yet
Leadership by Example
10 pages

LLM Distillation

Uploaded by

LLM Distillation

Uploaded by

Distilling Knowledge from Large

A Comparative Analysis of Three

• Motivations for distillation:

• This presentation analyzes three papers that explore different approaches

• Benefit: More efficient learning with less data.

• How can it improve LawGPT's training?

• Key Idea: Break down complex reasoning into two phases:

• Benefit: More efficient use of computational resources.

• Key Idea: LLMs can generate incorrect answers.

• Challenge: LLMs Can Be Incorrect:

• Train LawGPT using a combination of:

• Benefit: Leverage LLM reasoning while maintaining accuracy.

You might also like