LLMs-and-ARC

This repository hosts experimental results as described in the paper: LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based Representations.

Structure

The output-logs directory includes two subdirectories, direct-grid and object-based, each representing different approaches in our experiment. Both contain:

ARC-subset: Results for various prompting methods on the 50 ARC tasks subset, with filenames representing the prompting method as outlined in Table 1 and 3 of our paper.
1D-ARC and Vertical-Horizontal: Results for generated tasks using the best performing prompting methods, with filenames based on the generated task types as seen in Table 2 and 4 of our paper. The exact mapping of the 1D-ARC filenames can be found here.

Note: If a results file ends with 3.5 or 4, it contains the result for GPT-3.5 or GPT-4, respectively. If it does not have such an ending, it contains results for both models.

Visualization

For a visualization of GPT-4 solutions, visit our visualizations page.

Dataset

The "1D-ARC" dataset used in our experiments is available at this GitHub repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

LLMs-and-ARC

Structure

Visualization

Dataset

Files

README.md

Latest commit

History

README.md

File metadata and controls

LLMs-and-ARC

Structure

Visualization

Dataset