Code for "Prediction-Powered Ranking of Large Language Models", Arxiv 2024.
ranking-algorithm
llm-eval
llm-evaluation
llm-evaluation-framework
prediction-powered-inference
rank-sets
-
Updated
May 27, 2024 - Python