Segnet

This project demonstrates training on the comma10k. Goal is to provide training and prediction with Pytorch C++ "libtorch" library. The specific network is a simple U-Net which outputs a [N, classes, W, H] size tensor. To find the classification for a given pixel, the argmax of the classes responses is calculated for each and correspond to the class.

Example Results

Right now there are 8908 images in the files_trainable with 976 for testing. It seems to perform ok after >20 epochs, but the fine detail seems to struggle. Training started at 4:53pm on March 13, 2022 and reached epoch 33 at 8:55pm (7 minutes per epoch) on a 1080Ti card. It would be interesting to perform evaluation only on "confident" network returns. Average loss of 0.0694 on test and 0.0549 on training data after 100 epochs. If dropout is used the average loss is 0.1060 on test and 0.0960 on training data after 100 epochs.

Input picture (left), groundtruth (top right), and prediction (bottom right)

Confidence in order for: Road, Lane markings, Undrivable, Movable, My car.

Dependencies

Linux only support (tested on Ubuntu 20.04 LTS)
Pytorch Libtorch - https://pytorch.org/get-started/locally/#start-locally
- Extract into libtorch/ directory
- https://download.pytorch.org/libtorch/cu101/libtorch-cxx11-abi-shared-with-deps-1.3.0.zip
- Stable 1.3 Linux LibTorch C++
- CUDA version 10.1
- cxx11 ABI since we build with c++11
Install CUDA 10.1 (match your libtorch version) - https://developer.nvidia.com/cuda-10.1-download-archive-base
Install cuDNN 7.5 (match 10.1 cuda version!) - https://developer.nvidia.com/cudnn
Install OpenCV - sudo apt install libopencv-dev
Install Boost 1.68 - sudo apt install libboost-dev

Training Yourself

One needs to clone the comma10k repo.
Update the path to the root directory in the src/net_seg_train.cpp file
Build and run it
There a bunch of augmentation applied, edit the src/utils/augmentations.h if you wish to tune them
If you wish to train on a different dataset, you would need to create your own data loader
After training, you can use the src/net_seg_test.cpp file to see your loss on the validation subset

Future Work / TODOs

Use the larger "trainable" image set link
Allow setting of max return prop
See if dropout in network helps
Compare against baseline: commaai/comma10k#2000
ROS subscriber / publish
ROS append to bag file

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docs		docs
libtorch		libtorch
src		src
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
ReadMe.md		ReadMe.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Segnet

Example Results

Dependencies

Training Yourself

Future Work / TODOs

About

Releases 1

Packages

Languages

License

goldbattle/segnet

Folders and files

Latest commit

History

Repository files navigation

Segnet

Example Results

Dependencies

Training Yourself

Future Work / TODOs

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages