Drone-Vision-Spring-2021-Sample-Solution Development

Installation

Prerequisite

For desktop: Conda is required to handle the package installation. Follow this link to view the Miniconda installer page.
For Raspberry Pi: Follow the instructions to setup Fedora on the Raspberry Pi. Then, install Miniforge to handle the conda environment.

Prepare

Clone this repository:

git clone --recurse-submodules https://github.com/lpcvai/21LPCVC-UAV_VIdeo_Track-Sample-Solution.git
cd 21LPCVC-UAV_VIdeo_Track-Sample-Solution

Create a virtual environment and install it's dependencies:
- For desktop with Nvidia GPU
```
conda env create -f environment.yml     
```
- For desktop without Nvidia GPU
```
conda env create -f environment_nocuda.yml     
```
- For Raspberry Pi 3B+ with Fedora & Miniforge
```
conda env create -f environment_rpi.yml     
```
Activate conda environment
```
conda activate SampleSolution
```

Download Pre-trained Models

The trained weights are provided here. The weights are called best.pt and should be placed under yolov5/weights/. We recommend using the best.pt weights, since it is using the smallest yolov5 model avalible, which is most sutibable for the Raspberry Pi / mobile. There is another model named yolov5l, which utilizes a more complex model for higher accuracy, but results in less FPS. Make sure to use them to get the best detections. The trained weights were created using a dataset containing over 10,000 images. More stats on the dataset can be found in yolov5/weights/stats.txt. Specific stats about the training session can be viewed here if you're interested. Our quantization implementation can be viewed here
The DeepSORT weights need to be downloaded; they can be found here. They should be called ckpt.t7 and place it under deep_sort/deep_sort/deep/checkpoint/

Input and Output Files

Inputs

The first input will be a video. The sample videos can be located here.

The second input is a csv file, containing the first 10 frames for the solution to acquire the correct labels. An additional 10 frames will be provided in the middle of the video, to recalibrate the labels if some identity switching occurs. The format for the input file in inputs/"videoname".csv should be similar to the example below. NOTE: The delimiter between each value in the actual csv file will be a comma (","), the | is just for visualization. The bounding box coordinate system is based off of the YOLO annotation format.

Frame	Class	ID	X	Y	Width	Height
0	0	1	0.41015	0.39583	0.02031	0.03425
0	0	2	0.36835	0.61990	0.04557	0.18055
0	1	3	0.41015	0.39583	0.03593	0.16296
1	0	1	0.52942	0.39583	0.02031	0.03425
1	0	2	0.36835	0.61990	0.04557	0.18055
1	1	3	0.52942	0.39537	0.03593	0.16296

Frame: The frame number of the annotation
Class: 0 for person, 1 for sports ball
X = absolute_x / image_width
Y = absolute_y / image_height
Width = absolute_width /image_width
Height = absolute_height /image_height

Outputs

The only output from the solution should be a text file. This text file will include the location of every ball when a single ball has been caught. The format for the output file in outputs/"videoname"_out.csv should be similar to the example below. NOTE: The delimiter between each value in the actual csv file will be a comma (","), the | is just for visualization.

Frame	Orange	Red	Purple	Blue
5	1	5	2	4	- Person 4 catches blue
30	3	5	2	4	- Person 3 catches orange
49	3	1	2	4	- Person 1 catches red
60	3	1	2	5	- Person 5 catches blue

Run

python3 main.py --source VIDEOSOURCE --groundtruths PATHTOCSV --skip-frames NUMOFFRAMES

References

Multi-class Yolov5 + Deep Sort with PyTorch
Yolov5_DeepSort_Pytorch
Ultralytics Yolov5
Deep_SORT_Pytorch
Simple Online and Realtime Tracking with a Deep Association Metric

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Drone-Vision-Spring-2021-Sample-Solution Development

Installation

Prerequisite

Prepare

Download Pre-trained Models

Input and Output Files

Inputs

Outputs

Run

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Drone-Vision-Spring-2021-Sample-Solution Development

Installation

Prerequisite

Prepare

Download Pre-trained Models

Input and Output Files

Inputs

Outputs

Run

References