Skip to content

tigerchen52/awesome_role_of_small_models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The Role of Small Models

Awesome PDF GitHub License

This work is ongoing, and we welcome any comments or suggestions.

Please feel free to reach out if you find we have overlooked any relevant papers.

What is the Role of Small Models in the LLM Era: A Survey

Lihu Chen1   Gaël Varoquaux2  

1 Imperial College London, UK    2 Soda, Inria Saclay, France   



Content List

Collaboration

SMs Enhance LLMs

Data Curation

Curating pre-training data

Title Topic Venue Code
Data selection for language models via importance resampling Data Selection PDF Badge PDF Badge
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale Data Selection PDF Badge
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data Data Selection PDF Badge PDF Badge
QuRating: Selecting High-Quality Data for Training Language Models Data Selection PDF Badge PDF Badge
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining Data Reweighting PDF Badge PDF Badge

Curating Instruction-tuning Data

Title Topic Venue Code
MoDS: Model-oriented Data Selection for Instruction Tuning Data Selection PDF Badge PDF Badge
LESS: Selecting Influential Data for Targeted Instruction Tuning Data Selection PDF Badge PDF Badge
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning Data Selection PDF Badge PDF Badge

Weak-to-Strong Paradigm

Using weaker (smaller) models to align stronger (larger) models

Title Topic Venue Code
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision Weak-to-Strong PDF Badge PDF Badge
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models Weak-to-Strong PDF Badge PDF Badge
Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts Weak-to-Strong PDF Badge PDF Badge
Improving Weak-to-Strong Generalization with Reliability-Aware Alignment Weak-to-Strong PDF Badge PDF Badge
Aligner: Efficient Alignment by Learning to Correct Weak-to-Strong PDF Badge PDF Badge
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models Weak-to-Strong PDF Badge PDF Badge
Theoretical Analysis of Weak-to-Strong Generalization Weak-to-Strong PDF Badge

Efficient Inference

Ensembling different-size models to reduce inference costs

Title Topic Venue Code
Efficient Edge Inference by Selective Query Model Cascading PDF Badge PDF Badge
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance Model Cascading PDF Badge
Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance Model Cascading PDF Badge PDF Badge
AutoMix: Automatically Mixing Language Models Model Cascading PDF Badge PDF Badge
FrugalML: How to use ML Prediction APIs more accurately and cheaply Model Cascading PDF Badge PDF Badge
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems Model Cascading PDF Badge
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models Model Routing PDF Badge
Tryage: Real-time, intelligent Routing of User Prompts to Large Language Models Model Routing PDF Badge
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking Model Routing PDF Badge
RouteLLM: Learning to Route LLMs with Preference Data Model Routing PDF Badge PDF Badge
Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling Model Routing PDF Badge PDF Badge
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing Model Routing PDF Badge PDF Badge
LLM-BLENDER: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion Model Routing PDF Badge PDF Badge
RouterBench: A Benchmark for Multi-LLM Routing System Model Routing PDF Badge PDF Badge
Large Language Model Routing with Benchmark Datasets Model Routing PDF Badge

Speculative Decoding

Title Topic Venue Code
Fast Inference from Transformers via Speculative Decoding Speculative Decoding PDF Badge PDF Badge
Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding Speculative Decoding PDF Badge PDF Badge
Accelerating Large Language Model Decoding with Speculative Sampling Speculative Decoding PDF Badge PDF Badge

Evaluating LLMs

Using SMs to evaluate LLM's generations

Title Topic Venue Code
BERTScore: Evaluating Text Generation with BERT General Evaluation PDF Badge PDF Badge
BARTScore: Evaluating Generated Text as Text Generation General Evaluation PDF Badge PDF Badge
Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation Uncertainty PDF Badge PDF Badge
Selfcheckgpt: Zero-resource black-box hallucination detection for generative large language models Uncertainty PDF Badge PDF Badge
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models Performance Prediction PDF Badge PDF Badge

Domain Adaptation

Using domain-specific SMs to adjust token probability of LLMs at decoding time

Title Topic Venue Code
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models White-box Domain Adaptation PDF Badge
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuning White-box Domain Adaptation PDF Badge PDF Badge
Tuning Language Models by Proxy White-box Domain Adaptation PDF Badge PDF Badge

Using domain-specific SMs to generate knowledge for LLMs at reasoning time

Title Topic Venue Code
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models Black-box Domain Adaptation PDF Badge PDF Badge
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models Black-box Domain Adaptation PDF Badge

Retrieval Augmented Generation

Using SMs to retrieve knowledge for enhancing generations:

Title Topic Venue Code
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks Documents PDF Badge
KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases Knowledge Bases PDF Badge
End-to-End Table Question Answering via Retrieval-Augmented Generation Tables PDF Badge
DocPrompting: Generating Code by Retrieving the Docs Codes PDF Badge PDF Badge
Toolformer: Language Models Can Teach Themselves to Use Tools Tools PDF Badge PDF Badge
Retrieval-Augmented Multimodal Language Modeling Images PDF Badge

Prompt-based Reasoning

Using SMs to augment prompts for LLMs

Title Topic Venue Code
UPRISE: Universal Prompt Retrieval for Improving Zero-Shot Evaluation Retrieving Prompts PDF Badge PDF Badge
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning Decomposing Complex Problems PDF Badge PDF Badge
Small Models are Valuable Plug-ins for Large Language Models Generating Pseudo Labels PDF Badge PDF Badge
Can Small Language Models Help Large Language Models Reason Better?: LM-Guided Chain-of-Thought Generating Pseudo Labels PDF Badge
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation Generating Feedback PDF Badge
Small Language Models Improve Giants by Rewriting Their Outputs Generating Feedback PDF Badge PDF Badge

Deficiency Repair

Developing SM plugins to repair deficiencies:

Title Topic Venue Code
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector Hallucinations PDF Badge PDF Badge
Reconfidencing LLMs from the Grouping Loss Perspective Hallucinations PDF Badge
Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little Cost Out-Of-Vocabulary Words PDF Badge PDF Badge

Contrasting LLMs and SMs for better generations:

Title Topic Venue Code
Contrastive Decoding: Open-ended Text Generation as Optimization Reducing Repeated Texts PDF Badge PDF Badge
Alleviating Hallucinations of Large Language Models through Induced Hallucinations Mitigating Hallucinations PDF Badge
Contrastive Decoding Improves Reasoning in Large Language Models Augmenting Reasoning Capabilities PDF Badge
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following Safeguarding Privacy PDF Badge

LLMs Enhance SMs

Knowledge Distillation

Black-box Distillation:

Title Topic Venue Code
Explanations from Large Language Models Make Small Reasoners Better Chain-Of-Thought Distillation PDF Badge
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes Chain-Of-Thought Distillation PDF Badge PDF Badge
Distilling Reasoning Capabilities into Smaller Language Models Chain-Of-Thought Distillation PDF Badge PDF Badge
Teaching Small Language Models to Reason Chain-Of-Thought Distillation PDF Badge
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step Chain-Of-Thought Distillation PDF Badge PDF Badge
Specializing Smaller Language Models towards Multi-Step Reasoning Chain-Of-Thought Distillation PDF Badge
TinyLLM: Learning a Small Student from Multiple Large Language Models Chain-Of-Thought Distillation PDF Badge
Lion: Adversarial Distillation of Proprietary Large Language Models Instruction Following Distillation PDF Badge PDF Badge
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning Instruction Following Distillation PDF Badge PDF Badge

White-box Distillation:

Title Topic Venue Code
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Logits PDF Badge
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers Intermediate Features PDF Badge PDF Badge
Less is More: Task-aware Layer-wise Distillation for Language Model Compression Intermediate Features PDF Badge PDF Badge
MiniLLM: Knowledge Distillation of Large Language Models Intermediate Features PDF Badge PDF Badge
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models Intermediate Features PDF Badge

Data Synthesis

Data Augmentation:

Title Topic Venue Code
Improving data augmentation for low resource speech-to-text translation with diverse paraphrasing Text Paraphrase PDF Badge
Paraphrasing with Large Language Models Text Paraphrase PDF Badge
Query Rewriting for Retrieval-Augmented Large Language Models Query Rewriting PDF Badge PDF Badge
LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model Specific Tasks PDF Badge
Data Augmentation for Intent Classification with Off-the-shelf Large Language Models Specific Tasks PDF Badge PDF Badge
Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding Specific Tasks PDF Badge

Training Data Generation:

Title Topic Venue Code
Want To Reduce Labeling Cost? GPT-3 Can Help Label Annotation PDF Badge
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning Label Annotation PDF Badge
ZeroGen: Efficient Zero-shot Learning via Dataset Generation Dataset Generation PDF Badge PDF Badge
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding Dataset Generation PDF Badge PDF Badge
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions Dataset Generation PDF Badge
Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations Dataset Generation PDF Badge
Does Synthetic Data Generation of LLMs Help Clinical Text Mining? Dataset Generation PDF Badge
Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction Dataset Generation PDF Badge PDF Badge
ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection Dataset Generation PDF Badge PDF Badge

Competition

Computation-constrained Environment

Task-specific Environment

Interpretability-required Environment

Citation

@misc{chen2024rolesmallmodelsllm,
      title={What is the Role of Small Models in the LLM Era: A Survey}, 
      author={Lihu Chen and Gaël Varoquaux},
      year={2024},
      eprint={2409.06857},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2409.06857}, 
}

About

a curated list of the role of small models in the LLM era

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages