diff --git a/docs/index.html b/docs/index.html index c9e7ac1..7f4da24 100644 --- a/docs/index.html +++ b/docs/index.html @@ -26,13 +26,14 @@

MOPA: Modular Object Navigation with PointGoal Agents

Tommaso Campari2,3 Unnat Jain4 Manolis Savva1 - Angel X. Chang1 + Angel X. Chang1,5
1Simon Fraser University,   2University of Padova,   3FBK, -   4Meta AI +   4Meta AI, +   5Amii
@@ -58,27 +59,7 @@

MOPA: Modular Object Navigation with PointGoal Agents

Abstract

- Our work focuses on the Multi-Object Navigation (MultiON) task, where an agent needs to - navigate to - multiple objects in a given sequence. We systematically investigate the inherent modularity - of this - task by dividing our approach to contain four modules: (a) an object detection module - trained to - identify objects from RGB images, (b) a map building module to build a semantic map of the - observed - objects, (c) an exploration module enabling the agent to explore its surroundings, and - finally (d) a - navigation module to move to identified target objects. We focus on the navigation and the - exploration modules in this work. We show that we can effectively leverage a PointGoal - navigation - model in the MultiON task instead of learning to navigate from scratch. Our experiments show - that a - PointGoal agent-based navigation module outperforms analytical path planning on the MultiON - task. We - also compare exploration strategies and surprisingly find that a random exploration strategy - significantly outperforms more advanced exploration methods. We additionally create MultiON - 2.0, a - new large-scale dataset as a test-bed for our approach. + We propose a simple but effective modular approach MOPA (Modular ObjectNav with PointGoal agents) to systematically investigate the inherent modularity of the object navigation task in Embodied AI. MOPA consists of four modules: (a) an object detection module trained to identify objects from RGB images, (b) a map building module to build a semantic map of the observed objects, (c) an exploration module enabling the agent to explore the environment, and (d) a navigation module to move to identified target objects. We show that we can effectively reuse a pretrained PointGoal agent as the navigation model instead of learning to navigate from scratch, thus saving time and compute. We also compare various exploration strategies for MOPA and find that a simple uniform strategy significantly outperforms more advanced exploration methods.