Large Language Models and How to Instruction Tune Them (in a Sustainable Way)

CLiC-it 2023 Tutorial

Large Language Models and How to Instruction Tune Them (in a Sustainable Way)

Authors: Danilo Croce & Claudiu Daniel Hromei

This repository hosts materials from the CLiC-IT 2023 tutorial, aiming to:

The objective of this tutorial is:

Introduce Transformer-based architectures, including encoding-decoding, encoder-only, and decoder-only structures.
Demonstrate fine-tuning of Large Language Models (LLMs) on diverse datasets in a multi-task framework.
Utilize Low-Rank Adaptation (LoRA) for sustainable and efficient tuning on "modest" hardware (e.g., single 16GB RAM GPU).

The repository includes code for fine-tuning a Large Language Model (based on LLaMA) with instructions to solve all the tasks from EVALITA 2023. In particular, this tutorial shows how to encode data from different tasks into specific prompts and fine-tune the LLM using Q-LoRA. The code can also be used in Google Colab using an Nvidia-T4 GPU with 15GB memory.

The code is heavily based on the one used in ExtremITA system participating in EVALITA 2023:

Code

The overall process is divided into four steps:

Step 1 - Encoding the data: it shows how to encode data from an EVALITA task to generate prompts for the LLM
Step 2 - Fine-tuning the LLaMA model: it shows how to fine-tune the LLMS given the prompts
Step 3 - Inference: generating answers: it shows how to use the fined-tuned model
Step 4 - Deconding the data: it shows how to convert the data to be evaluated in the EVALITA challenge

Slides

The repository also features tutorial slides (LINK).

For queries or suggestions, raise an Issue in this repository or email croce@info.uniroma2.it or hromei@ing.uniroma2.it.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
CLiC-it_2023_tutorial.pdf		CLiC-it_2023_tutorial.pdf
CLiC_it_2023_tutorial_ExtremITA_0_data_encoder.ipynb		CLiC_it_2023_tutorial_ExtremITA_0_data_encoder.ipynb
CLiC_it_2023_tutorial_ExtremITA_1_train.ipynb		CLiC_it_2023_tutorial_ExtremITA_1_train.ipynb
CLiC_it_2023_tutorial_ExtremITA_2_inference.ipynb		CLiC_it_2023_tutorial_ExtremITA_2_inference.ipynb
CLiC_it_2023_tutorial_ExtremITA_3_data_decoder.ipynb		CLiC_it_2023_tutorial_ExtremITA_3_data_decoder.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLiC-it 2023 Tutorial

Large Language Models and How to Instruction Tune Them (in a Sustainable Way)

Authors: Danilo Croce & Claudiu Daniel Hromei

Code

Slides

About

Releases

Packages

Contributors 2

Languages

License

crux82/CLiC-it_2023_tutorial

Folders and files

Latest commit

History

Repository files navigation

CLiC-it 2023 Tutorial

Large Language Models and How to Instruction Tune Them (in a Sustainable Way)

Authors: Danilo Croce & Claudiu Daniel Hromei

Code

Slides

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages