Arabic-eval

ALUE_main

This folder has ALUE datasets and programs which can process datasets into suitable format to train and test. You can get more detail in this folder.

translate_to_review

This program aims to translate the GPTreview results into English or Chinese.

bert-baselines

This program aims to get metrics by ourselves.You can get more detail in this folder.

eval_human

This program aims to let native speakers to compare which model's generation is better.

data

Translated vicuna dataset

gen_data,

vicuna answers by three models

gpt, GPT4-...,

alternative GPT apis by freedom intelligence

LLM,

eval pipeline by freedom intelligence

mmlu-chatgpt

this folder contains eval-harness library

vicuna_eval_results

another results folder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arabic-eval

ALUE_main

translate_to_review

bert-baselines

eval_human

data

gen_data,

gpt, GPT4-...,

LLM,

mmlu-chatgpt

vicuna_eval_results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 137 Commits
ALUE_main		ALUE_main
GPT4-API-Accelerate-main		GPT4-API-Accelerate-main
LLM		LLM
bert-baselines		bert-baselines
data		data
eval_human		eval_human
gen_data		gen_data
gpt		gpt
mmlu-chatgpt		mmlu-chatgpt
translate_to_review		translate_to_review
vicuna_eval_results		vicuna_eval_results
.gitignore		.gitignore
GPT-f8a6008e7dd334eca1e82b3093ed7cef72a67b81.zip		GPT-f8a6008e7dd334eca1e82b3093ed7cef72a67b81.zip
README.md		README.md
init.py		init.py

AI-Initiative-KAUST/Arabic-eval

Folders and files

Latest commit

History

Repository files navigation

Arabic-eval

ALUE_main

translate_to_review

bert-baselines

eval_human

data

gen_data,

gpt, GPT4-...,

LLM,

mmlu-chatgpt

vicuna_eval_results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages