G2T LLM Evaluation

Here you can find scripts for LLM evaluation on the WEBNLG-2020 dataset

How to Use

Run python llm_evaluator.py --llm=<NAME OF LLM> --dataset_folder=<PATH TO FOLDER WITH WEBNLG DATASET> --dataset_filename=<FILENAME OF WEBNLG DATASET> --output_path=<WHERE TO STORE GENERATED GRAPH DESCRIPTIONS> to generate graph descriptions Supported LLMs are:

llama3:8b
gemma2:9b
gpt-4o
gpt-4o-mini

Run python metrics_evaluator.py --preds_path=<PATH TO FILE WITH GRAPH DESCRIPTIONS FROM LLM> --dataset_folder=<PATH TO FOLDER WITH WEBNLG DATASET> --dataset_filename=<FILENAME OF WEBNLG DATASET> --output_path=<WHERE TO STORE DETAILED METRICS> to evaluate WEBNLG metrics and alignscore_evaluator.py for the AlignScore.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
align_score_evaluator.py		align_score_evaluator.py
llm_evaluator.py		llm_evaluator.py
metrics_evaluator.py		metrics_evaluator.py
requirements.txt		requirements.txt
webnlg_dataset_reader.py		webnlg_dataset_reader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

G2T LLM Evaluation

How to Use

About

Releases

Packages

Languages

s-nlp/llm_g2t

Folders and files

Latest commit

History

Repository files navigation

G2T LLM Evaluation

How to Use

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages