Skip to content

Commit

Permalink
Fix stray tag
Browse files Browse the repository at this point in the history
  • Loading branch information
mjpost committed Nov 1, 2024
1 parent 5378880 commit cd8314a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion data/xml/2024.findings.xml
Original file line number Diff line number Diff line change
Expand Up @@ -21760,7 +21760,7 @@
<author><first>Xingdi</first><last>Yuan</last><affiliation>Microsoft Research</affiliation></author>
<author><first>Bang</first><last>Liu</last><affiliation>University of Montreal</affiliation></author>
<pages>3534-3568</pages>
<abstract>Large language models (LLMs), trained on vast amounts of internet data, have developed a broad understanding of the world, enhancing the decision-making capabilities of embodied agents. This success is largely due to the comprehensive and in-depth domain knowledge within their training datasets. However, the extent of this knowledge can vary across different domains, and existing methods often assume that LLMs have a complete understanding of their environment, overlooking potential gaps in their grasp of actual world dynamics. To address this gap, we introduce <b>Di</b>scover, <b>V</b>erify, and <b>E</b>volve (<b/>), a framework that <b>discovers</b> world dynamics from a small number of demonstrations, <b>verifies</b> the correctness of these dynamics, and <b>evolves</b> new, advanced dynamics tailored to the current situation. Through extensive evaluations, we analyze the impact of each component on performance and compare the automatically generated dynamics from with human-annotated world dynamics. Our results demonstrate that LLMs guided by can make better decisions, achieving rewards comparable to human players in the Crafter environment.</abstract>
<abstract>Large language models (LLMs), trained on vast amounts of internet data, have developed a broad understanding of the world, enhancing the decision-making capabilities of embodied agents. This success is largely due to the comprehensive and in-depth domain knowledge within their training datasets. However, the extent of this knowledge can vary across different domains, and existing methods often assume that LLMs have a complete understanding of their environment, overlooking potential gaps in their grasp of actual world dynamics. To address this gap, we introduce <b>Di</b>scover, <b>V</b>erify, and <b>E</b>volve (DiVE), a framework that <b>discovers</b> world dynamics from a small number of demonstrations, <b>verifies</b> the correctness of these dynamics, and <b>evolves</b> new, advanced dynamics tailored to the current situation. Through extensive evaluations, we analyze the impact of each component on performance and compare the automatically generated dynamics from with human-annotated world dynamics. Our results demonstrate that LLMs guided by can make better decisions, achieving rewards comparable to human players in the Crafter environment.</abstract>
<url hash="100611d1">2024.findings-emnlp.202</url>
<bibkey>sun-etal-2024-enhancing-agent</bibkey>
</paper>
Expand Down

0 comments on commit cd8314a

Please sign in to comment.