This repository contains code and data required to reproduce the results in the ICRA 2018 paper
Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking (arXiv)
Sarthak Sharma *, Junaid Ahmed Ansari*, J. Krishna Murthy, and K. Madhava Krishna
* The first two authors contributed equally to the work.
The project page has more qualitative results, and links to data.
If you find the code/data useful in your experiments, kindly consider citing
@inproceedings{BeyondPixels_ICRA2018,
title={Beyond Pixels: Leveraging Geometry and Shape Cues for Online Multi-Object Tracking},
author={Sarthak Sharma, Junaid Ahmed Ansari, J. Krishna Murthy, K. Madhava Krishna},
booktitle = {Procedings of the IEEE International Conference on Robotics and Automation},
year={2018}
}
We provide demo scripts for running code and visualizing results on sequences from the KITTI Tracking dataset.
To run a demo script that shows representative results on short snippets from the train and test splits run
main_script_train.m
main_script_test.m
IMPORTANT: You need to have the
Data
folder initialized, before you can run this demo. See below for details.
Before you can run this, however, make sure you download the requisite CNN appearance features and rectified images by running the following script.
sh download_data.sh
Or you can download it from here and place it (after unzipping it) in the parent directory.
To falcilitate comparision, we have also released our results on the KITTI Tracking benchmark (train and test splits). The result files, in the format specified by the evaluation server, can be downloaded from here
DISCLAIMER: The result files have been released in good faith, in the spirit of reproducible research. No misuse is permitted.
We release release object detections obtained (and filtering scripts for non-maxima suppression, along with parameters used) for all train and test sequences. We report results obtained by running RRC-Net [code] on KITTI Tracking data. They can be accessed in the Data
directory.
We also release odometry estimates obtained from ORB-SLAM. Note that, since we used monocular ORB-SLAM, odometry estimates were obtained to-scale. To get rid of the scale factor ambiguity, we empirically estimate a scale factor by four-fold cross-validation over the train set. Once this scale factor is estimated, we use the same factor across all train and test sequences in the results reported. These can be found in the Data
directory too.
AutoWare's (one of the first "all-in-one" open-source software for self-driving vehicles) implements an image based object tracker is based on our work. Link