faithfulness

Here are 9 public repositories matching this topic...

pkuserc / ChatGPT_for_IE

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

performance evaluation information-extraction calibration named-entity-recognition event-detection event-extraction relation-extraction entity-typing relation-classification explainability large-language-models chatgpt faithfulness

Updated Aug 17, 2024
Python

MinhVuong2000 / LLMReasonCert

Star

Official Implementation of ACL2024 paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https://arxiv.org/abs/2402.11199).

framework evaluation knowledge-graph reasoning evaluation-framework llms faithfulness

Updated Jul 27, 2024
Python

khuangaf / CHOCOLATE

Star

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

factuality faithfulness large-vision-language-models chart-understanding chart-captioning chart-summarization

Updated Jun 5, 2024
Jupyter Notebook

YisongMiao / DiSQ-Score

Star

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

evaluation discourse language-model faithfulness socratic-method

Updated Aug 7, 2024
Python

About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning" . Do not hesitate to open an issue if you run into any trouble!

nlp reasoning faithfulness chain-of-thought-reasoning

Updated Sep 6, 2024

KomeijiForce / Active_Passive_Constraint_Koishiday_2024

Star

[NeurIPS 2024] An advanced persona-driven role-playing system with global faithfulness quantification and optimization. In memory of the Koishi's Day of 2024.

role-playing metrics global-optimization quantification factuality-checking faithfulness komeiji ai-character