Data Preprocessing

Before training a model, ensure that you have generated the metadata for your dataset. You can do this by running the following command:

export PYTHONPATH="./"
python -m scripts.preprocess_dataset --dataset_lists DATASETNAMES --dataset_path /data/datasets

If you already have the files DATASET_NAME/house_id_to_sub_house_id_train.json and DATASET_NAME/house_id_to_sub_house_id_val.json, you can skip this step.

Training a Model

To train a model, use the following command:

python -m training.offline.train_pl --dataset_version DATASET_NAME --eval_every 50 --precision PRECISION_TYPE --input_sensors LIST__OF_SENSORS --per_gpu_batch PER_GPU_BATCH_SIZE --sliding_window 50 --model MODEL_NAME --model_version base  --lr BASE_LEARNING_RATE

You can set the precision to fixed precision:

--precision 32-true

Or mixed precision:

--precision 16-mixed

Example Full Command:

python -m training.offline.train_pl --max_samples MAX_TRAIN_SAMPLES --eval_max_samples MAX_EVAL_SAMPLES --eval_every EVAL_EPOCH_FREQ --save_every SAVE_EPOCH_FREQ --model_version siglip_base_3 --sliding_window 4 --per_gpu_batch 3 --lr 0.0002 --data_dir DATASET_DIR --dataset_version CHORES --model EarlyFusionCnnTransformer --input_sensors raw_navigation_camera raw_manipulation_camera last_actions an_object_is_in_hand --precision 16-mixed --loss action --max_epochs MAX_EPOCH --resume_local --output_dir /path/to/save/logs --num_nodes NUM_NODES

If you want to train a model on multiple datasets, First define your mixture in training/offline/dataset_mixtures.py, for example, TWO_TASK_MIX = ['ObjectNavType', 'ObjectNavDescription']. Then pass that name as an argument: --dataset_version TWO_TASK_MIX.

Logging Metrics and Checkpoints

To log the metrics and model weights locally use the following tags:

--wandb_logging False --output_dir /path/to/save/logs

If you prefer to use wandb use the following tags:

--wandb_logging True --wandb_entity_name WANDB_ENTITY_NAME --wandb_project_name WANDB_PROJECT_NAME

Note that if you are using wandb you should also specify a local folder in which your data will be saved before being uploaded to wandb. You can do so by setting the environment variable of WANDB_DIR:

export WANDB_DIR=/local/path/to/save/files

Note that during training an experiment ID will be assigned to your experiment. This training id can be used later to evaluate the trained model.

Evaluating a Model (Online Evaluation)

To evaluate a model using online evaluation, use the following command:

python -m training.offline.online_eval --shuffle --eval_subset minival --output_basedir /path/to/save/logs --test_augmentation --task_type TASK_TYPE --input_sensors raw_navigation_camera raw_manipulation_camera last_actions an_object_is_in_hand --house_set objaverse --training_run_id TRAINING_ID --ckptStep STEP_NUMBER --num_workers NUM_WORKERS --gpu_devices LIST_OF_GPU_IDS

The weights and metrics will be saved in /path/to/save/logs. Use the following to evaluate the trained model. Note that TRAINING_ID is the identifier that was automatically generated for your training run (Refer to training a model details).

For loading local weights:

--wandb_logging False --training_run_id TRAINING_ID --ckptStep STEP_NUMBER --local_checkpoint_dir /path/to/local/checkpoints/

If you would prefer to use wandb then use the following command. The TRAINING_ID will be your wandb run id and you can use the following tags for evaluation:

--wandb_logging True --wandb_entity_name WANDB_ENTITY_NAME --wandb_project_name WANDB_PROJECT_NAME --training_run_id TRAINING_ID

Creating Your Own Model

To create your own model, follow these steps:

Define a new model class in architecture/models/transformer_models (e.g., early_fusion_tsfm_models.py).

Requirements for implementation:
- A class describing your model.
- A forward(self, model_inputs, actions) function that calculates the loss and returns a dictionary with keys "logits" and "loss."
- A class method called build_model that takes "model_version," "input_sensors," "loss," and "ckpt_path" as input and returns the model and preprocessor as output.
- A class method called build_agent that takes "model_version," "input_sensors," "loss," "device," "sampling" (greedy or sampling), and "ckpt_path" as input and returns the online evaluation agent as output.
Register your model in architecture/models/transformer_models/__init__.py.
Implement the Agent class that takes care of caching inputs during online evaluation and performs the forward pass during online evaluation.

Training and evaluating an online RL baseline

First, export the Obajverse data directory and the houses directory:

export OBJAVERSE_DATA_DIR=/path/to/objaverse_assets
export OBJAVERSE_HOUSES_DIR=/path/to/objaverse_houses

To train a model, use the following command:

python -m allenact.main training/online/siglip_vitb_gru_rgb_augment_objectnav

The results will be saved to experiment_output. Please refer to AllenAct for more details.

You can use wandb callbacks with the WANDB_PROJECT and WANDB_ENTITY specified:

export WANDB_PROJECT=YOUR_WANDB_PROJECT_NAME
export WANDB_ENTITY=YOUR_WANDB_ENTITY_NAME
python -m allenact.main training/online/siglip_vitb_gru_rgb_augment_objectnav --callbacks wandb_logging_callback

To evaluate, use the following command:

python -m allenact.main training/online/siglip_vitb_gru_rgb_augment_objectnav -c /path/to/checkpoint --eval

You can find the checkpoint in the experiment_output directory.

Downloading our pretrained models

Pick a directory /path/to/pretrained_models where you'd like to save our pretrained models. Then run

python -m scripts.download_trained_ckpt --save_dir /path/to/local/checkpoints --ckpt_ids SigLIP-ViTb-3-CHORES-S

to download our SigLIP-ViTb-3-CHORES-S model. Skipping --ckpt_ids will download DINOv2-ViTs-3-CHORES-S, SigLIP-ViTb-3-CHORES-L, SigLIP-ViTb-3-CHORES-S, SigLIP-ViTb-3-CHORESNav-L, SigLIP-ViTb-3-CHORESNav-S, SigLIP-ViTb-3-double-det-CHORES-L, and SigLIP-ViTb-3-double-det-CHORES-S under the directory.

Then, we can evaluate the model without detection:

python -m training.offline.online_eval --shuffle --eval_subset minival --output_basedir /path/to/save/logs \
 --test_augmentation --task_type TASK_TYPE \
 --input_sensors raw_navigation_camera raw_manipulation_camera last_actions an_object_is_in_hand \
 --house_set objaverse --wandb_logging False --num_workers NUM_WORKERS \
 --gpu_devices LIST_OF_GPU_IDS --training_run_id TRAINING_ID --local_checkpoint_dir /path/to/local/checkpoints

and the model with detection:

python -m training.offline.online_eval --shuffle --eval_subset minival --output_basedir /path/to/save/logs \
 --test_augmentation --task_type TASK_TYPE \
 --input_sensors raw_navigation_camera raw_manipulation_camera last_actions an_object_is_in_hand \
 nav_task_relevant_object_bbox manip_task_relevant_object_bbox nav_accurate_object_bbox manip_accurate_object_bbox \
 --house_set objaverse --wandb_logging False --num_workers NUM_WORKERS \
 --gpu_devices LIST_OF_GPU_IDS --training_run_id TRAINING_ID --local_checkpoint_dir /path/to/local/checkpoints

TASK_TYPE can be a specific task or one of the options of CHORES or CHORESNAV. TRAINING_ID can be one of the DINOv2-ViTs-3-CHORES-S, SigLIP-ViTb-3-CHORES-L, SigLIP-ViTb-3-CHORES-S, SigLIP-ViTb-3-CHORESNav-L, SigLIP-ViTb-3-CHORESNav-S, SigLIP-ViTb-3-double-det-CHORES-L, and SigLIP-ViTb-3-double-det-CHORES-S.

for a more concrete example, please refer to the scripts/evaluate_pretrained_model.sh script:

bash scripts/evaluate_pretrained_model.sh

it would download objaverse assets, objaverse houses, and the pretrained model SigLIP-ViTb-3-double-det-CHORES-S, and then evaluate the model on the minival set.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TRAINING_README.md

TRAINING_README.md

Data Preprocessing

Training a Model

Logging Metrics and Checkpoints

Evaluating a Model (Online Evaluation)

Creating Your Own Model

Training and evaluating an online RL baseline

Downloading our pretrained models

Files

TRAINING_README.md

Latest commit

History

TRAINING_README.md

File metadata and controls

Data Preprocessing

Training a Model

Logging Metrics and Checkpoints

Evaluating a Model (Online Evaluation)

Creating Your Own Model

Training and evaluating an online RL baseline

Downloading our pretrained models