Driving Policy (Advanced)

DISCLAIMERS

Safety: Driving policies are not perfect and may crash the robot. Always make sure you operate in a safe environment! Keep in mind, that your phone could be damaged in a collision! Make sure you always have a game controller connected and are familiar with the key mapping so you can stop the vehicle at any time. Use at your own risk!
Compute hardware: Training a driving policy requires a lot of resources and may slow down or even freeze your machine. It is recommended to use a high-end laptop or workstation with large amount of RAM and dedicated GPU, especially when training with larger batch sizes. The documentation is currently also not very detailed. Use at your own risk!
Patience required: To get a good driving policy for your custom dataset will require some patience. It is not straight-forward, involves data collection, hyperparameter tuning, etc. If you have never trained machine learning models before, it will be challenging and may even get frustrating.

You first need to setup your training environment.

Dependencies

We recommend to create a conda environment for OpenBot (if not already done). Instructions on installing conda can be found here. You can create a new environment with the following command:

conda create -n openbot pip

If you do not want install the dependencies globally, activate your conda environment first:

conda activate openbot

Make sure you are in the folder policy within your local OpenBot repository. Now, you can install all the dependencies for training with the following command:

pip install -r requirements_train.txt

You can also install pydot (pip install pydot) and graphviz (see instructions) if you want to visualize the the network architecture.

Note that training will be very slow on a laptop. So if you have access to a computer with dedicated GPU, we highly recommend to use it.

If you want to use the WebApp for data collection and training, you need to install the following dependencies in addition.

pip install -r requirements_web.txt

If you prefer to setup the environment manually, here is a list of the dependencies:

tensorflow
jupyter notebook
matplotlib
numpy
PIL
black[jupyter]

If you want to use the web interface you also need:

aiohttp
aiozeroconf
imageio

NOTES:

Remember to activate the environment before running commands in the terminal: conda activate openbot
If your tensorflow import does not work, try installing via pip install tensorflow --user. (See this issue.)

Dataset

Data Collection

In order to train an autonomous driving policy, you will first need to collect a dataset. The more data you collect, the better the resulting driving policy. For the experiments in our paper, we collected about 30 minutes worth of data. Note that the network will imitate your driving behaviour. The better and more consistent you drive, the better the network will learn to drive.

Connect a bluetooth game controller to the phone (e.g. PS4 controller: to enter pairing mode press the PS and share buttons until the LED flashes quickly).
Select the CIL-Mobile model in the app.
Now drive drive the car via a game controller and record a dataset. On the PS4 controller logging can be toggled with the X button.

You will now find a folder called Openbot on the internal storage of your smartphone. For each recording, there will be zip file. The name of the zip file will be in the format yyyymmdd_hhmmss.zip corresponding to the timestamp of when the recording was started.

The Jupyter notebook expects a folder called dataset in the same folder. In this folder, there should be two subfolders, train_data and test_data. The training data is used to learn the driving policy. The test data is used to validate the learned driving policy on unseen data during the training process. This provides some indication how well this policy will work on the robot. Even though the robot may drive along the same route as seen during training, the exact images observed will be slightly different in every run. The common split is 80% training data and 20% test data. Inside the train_data and test_data folders, you need to make a folder for each recording session and give it a name such as my_openbot_1, my_openbot_2, etc. The idea here is that each recording session may have different lighting conditions, a different robot, a different route. In the Jupyter notebook, you can then train only on a subset of these datasets or on all of them. Inside each recording session folder, you drop all the recordings from that recording session. Each recording corresponds to an extracted zip file that you have transferred from the Openbot folder on your phone. Your dataset folder should look like this:

Rather than copying all files manually from the phone, you can also upload the logs automatically to a Python server on your computer. In this case, the zip files will be uploaded and unpacked into the folder dataset/uploaded. You will still need to move them into the folder structure for training. You can simply treat the uploaded folder as a recording session and move it into train_data. The recordings will then be recognized as training data by the Jupyter notebook. If you do not already have a recording session in the test_data folder, you also need to move at least one recording from train_data/uploaded into test_data/uploaded.

Data Conversion (optional)

For better training performance, you can convert the collected dataset into a specialized format. You can create a tfrecord of the train and test datasets with the following commands:

conda activate openbot
python -m openbot.tfrecord -i dataset/train_data -o dataset/tfrecords -n train.tfrec
python -m openbot.tfrecord -i dataset/test_data -o dataset/tfrecords -n test.tfrec

By default this conversion will be done automatically at the start of training.

Policy Training

Make sure your conda environment for openbot is activated by executing the following command:

conda activate openbot

Jupyter Notebook

We provide a Jupyter Notebook that guides you through the steps for training an autonomous driving policy. Open the notebook with the following command.

jupyter notebook policy_learning.ipynb

Now a web-browser window will open automatically and load the Jupyter notebook. Follow the steps in order to train a model with your own data.

Shell

This method assumes that the data is in the correct place. To adjust the hyperparameters you can pass the following arguments.

'--no_tf_record', action='store_true', help='do not load a tfrecord but a directory of files'
'--create_tf_record', action='store_true', help='create a new tfrecord'
'--model', type=str, default='pilot_net', choices=['cil_mobile', 'cil_mobile_fast', 'cil', 'pilot_net'], help='network architecture (default: cil_mobile)'
'--batch_size', type=int, default=16, help='number of training epochs (default: 16)'
'--learning_rate', type=float, default=0.0001, help='learning rate (default: 0.0001)'
'--num_epochs', type=int, default=10, help='number of epochs (default: 10)'
'--batch_norm', action='store_true', help='use batch norm'
'--flip_aug', action='store_true', help='randomly flip images and controls for augmentation'
'--cmd_aug', action='store_true', help='add noise to command input for augmentation'
'--resume', action='store_true', help='resume previous training'

If your dataset has already been converted to a tfrecord, you can train the policy from the shell with the command:

python -m openbot.train

If you would like to convert your dataset to a tfrecord, before training, you need to add the following flag:

python -m openbot.train --create_tf_record

If you do not want to convert the dataset to a tfrecord, and train using the files direclty, you need to add the following flag:

python -m openbot.train --no_tf_record

To train a model for final deployment, you want to use a large batch size and number of epochs. Enabling batch norm usually improves training as well. The model pilot_net is larger than the default cil_mobile but can achieve better performance on some tasks while still runnining in real time on most smartphones.

python -m openbot.train --model pilot_net --batch_size 128 --num_epochs 100 --batch_norm

Deployment

At the end of the training process, two tflite files are generated: one corresponds to the best checkpoint according to the validation metrics and the other to the last checkpoint. Pick one of them and rename it to autopilot_float.tflite. Replace the existing model in Android Studio and recompile the app.

If you are looking for the folder in your local directory, you will find it at: app/src/main/assets/networks.

Web App

We provide a web app and a python web server for easy policy training. (Beta)

Features

Automatic log (session) upload
- see Troubleshooting for details
List uploaded sessions, with GIF preview
List datasets, with basic info
Move session to a dataset
Delete session
List trained models, and show plots about training
Train a model with basic parameters, show progress bar

Preview

Quickstart

conda activate openbot
python -m openbot.server

You can now open your browser to visualize the dataset and see incoming uploads by going to: http://localhost:8000/#/uploaded

Running the server

You can run the python server with the command:

python -m openbot.server

There is also a developer mode:

adev runserver openbot/server

For frontend development (react app):

FE_DEV=1 adev runserver openbot/server

When you run the server you should see something like:

Skip address 127.0.0.1 @ interface lo
Found address 192.168.x.x @ interface wlp2s0
Registration of a service, press Ctrl-C to exit...
Running frontend: 0.1.0
Frontend path: /home/USERNAME/miniconda3/envs/openbot/lib/python3.7/site-packages/openbot_frontend
======== Running on http://0.0.0.0:8000 ========
(Press CTRL+C to quit)

Troubleshooting

If the upload to the server is not working, here are some troubleshooting tips:

Try restarting the server (computer) and the OpenBot app (smartphone)
Make sure the smartphone and your computer are connected to the same WiFi network
If your router has both 2.4 GHz and 5 GHz networks with the same name, disable the 5 GHz network
Keep the phone connected to Android Studio while running the app. In the Logcat tab, select Debug from the dropdown. Type NSD into the filter field to see the debug messages concerning the server connection. Type Upload into the filter field for debug messages concerning the recording file upload.
If a published models gets downloaded continiously, make sure the time on your phone and laptop / workstation are set correctly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Driving Policy (Advanced)

DISCLAIMERS

Dependencies

Dataset

Data Collection

Data Conversion (optional)

Policy Training

Jupyter Notebook

Shell

Deployment

Web App

Features

Preview

Quickstart

Running the server

Troubleshooting

Files

README.md

Latest commit

History

README.md

File metadata and controls

Driving Policy (Advanced)

DISCLAIMERS

Dependencies

Dataset

Data Collection

Data Conversion (optional)

Policy Training

Jupyter Notebook

Shell

Deployment

Web App

Features

Preview

Quickstart

Running the server

Troubleshooting