Title generator

The goal of this project is to have an (L)LM to generate titles for conversations. For that, a language model was fine-tuned to always output a 3-5 words title summarizing the input.

Model Information

Setup

If you only want to do inference:

# 1. Clone the repo
git clone https://github.com/ogrnz/generate-title-llm.git
cd generate-title-llm

# 2. Create a python virtual env and activate it

# 3. Install transformers
pip install transformers

# 4. Use it!
python src/infer.py --message "I have troubles with my self-hosted docker setup. A traefik2 reverse proxy handles requests from the internet to my different services."

If you want to use the full project (generate new synthetic dataset, fine-tune another model, modify a script...):

# 1. Clone the repo
git clone https://github.com/ogrnz/generate-title-llm.git
cd generate-title-llm

# 2. Create a python virtual env and activate it

# 3. Install transformers
pip install -r requirements.txt

Usage

Basic

For basic inference:

python src/infer.py --message "I have troubles with my self-hosted docker setup. A traefik2 reverse proxy handles requests from the internet to my different services."

You can also pass a JSONL file with the --file_path argument.

`src/finetune.py`

# Sane default
python src/finetune.py 

# Custom
python src/finetune.py --checkpoint "google-t5/t5-small" --hf-data "ogrnz/chat-titles" --output-dir results

Custom training arguments (for example to not use fp16) can be written in a json file and loaded via the --training-config flag:

{
    "fp16": false,
}

# Custom
python src/finetune.py --training-config training_config.json

`src/infer.py`

Script used to perform inference. See Basic usage section.

`src/dataset.py`

Script used to generate the synthetic dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Title generator

Model Information

Setup

Usage

Basic

`src/finetune.py`

`src/infer.py`

`src/dataset.py`

About

Releases

Packages

Languages

ogrnz/generate-title-llm

Folders and files

Latest commit

History

Repository files navigation

Title generator

Model Information

Setup

Usage

Basic

src/finetune.py

src/infer.py

src/dataset.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`src/finetune.py`

`src/infer.py`

`src/dataset.py`

Packages