pytorch_retinanet

A PyTorch implementation of Retinanet for object detection as described in the paper Focal Loss for Dense Object Detection.

The code is heavily influended by Detectron2 , torchvision implementation of RCNN models and the FastAI implementation

TODO:

Create Anchor Generator.
Create ResNet based BackBone Model.
Create FeaturePyramid architecture as described in https://arxiv.org/abs/1612.03144.
Focal Loss & Regeression and Classification Head for Retinanet.
Assemble Retinanet Using BackBone => FeaturePyramid => AnchorGenerator => Regeression & Classification.

Tutorials:

demo.ipynb

Installing Dependencies :

Ensure that python>=3.6 , torch>=1.6.0, torchvision>=0.7.0 is installed .

$ git clone https://github.com/benihime91/pytorch_retinanet.git
$ cd pytorch_retinanet
$ pip install -r requirements.txt

Note: for pytorch-lightning versions >= 1.0.0 t training will fail .

Usage:

Clone the Github Repo

$ git clone https://github.com/benihime91/pytorch_retinanet.git

For easy training pipeline, we recommend using pytorch-lightning for training and testing.

First of all open the hparams.yaml file and modify it according to need. Instructions to modeify the same are present inside the file.

Create a python script inside the retinanet repo. Name it whatever you want and then insert the following lines:

 from omegaconf import OmegaConf, DictConfig
 import pytorch_lightning as pl
 from pytorch_lightning import Trainer

 from model import RetinaNetModel
 
 # load in the hparams yaml file
 hparams = OmegaConf.load("hparams.yaml")

 # instantiate lightning module
 model = RetinaNetModel(hparams)
 
 # Instantiate Trainer
 trainer = Trainer()
 # start train
 trainer.fit(model)
 # to test model using COCO API
 trainer.test(model)

Loading Data:

The data can be loaded into the model in one of 3 ways.

This is controlled via the dataset.kind parameter in hparams.yaml.

To load in the in the COCO-2017 dataset:
- set dataset.kind = "coco"
- set dataset.root_dir = {path to the coco dataset}
```
dataset:
   kind: coco
   root_dir: /Datasets/coco/
```

If the dataset is in Pascal-VOC format :

set dataset.kind = "pascal"
set data.trn_paths = [path_to_annotations, path_to_images]
similarly set the paths for valiation and test datasets

dataset:
   kind: pascal
   trn_paths:
      - /content/data_train/Annotations/
      - /content/data_train/Images/"
   test_paths:
      - /content/data_test/Annotations/"
      - /content/data_test/Images/"
   val_paths:
      - /content/data_validation/Annotations/"
      - /content/data_validation/Images/"

Note:

image and annotation folder can be the same folder.
val_paths if optional, if no validation data then set ,

val_paths: False

The datasets can also be loaded from a csv format. The csv file should be as follows :

filename,width,height,class,xmin,ymin,xmax,ymax,labels
Images/007826.jpg,500,375,diningtable,80,217,320,273,11
Images/007826.jpg,500,375,chair,197,193,257,326,9
...
...
Images/006286.jpg,500,375,diningtable,402,219,500,375,11
Images/006286.jpg,500,375,diningtable,347,177,405,216,11

filename : path to the Image
width : width of the Image [Optional]
height : height of the Image [Optional]
class : class label for the particular annotation
labels : integer labels for the particular annotation
xmin, ymin, xmax, ymax: absolute bounding-box co-ordinates

dataset:
   kind: csv
   trn_paths: "train_data.csv"
   val_paths: "val_data.csv" #This is Optional
   test_paths: "test_data.csv"

Note :

if validation dataset is not present set hparams.dataset.val_paths = False.
the model computes the COCO-API evaluation metrics on the test dataset.
for csv dataset each entry in the csv file should corresponding to a unique bounding-box.
labels should start from 1 as the 0th label is reserved for "background" class.
to generate a LABEL_MAP to be used for visulazation purposes:

 from utils.pascal import generate_pascal_category_names
 import pandas as pd
 
 path = ... # path to csv file contraining the annotations
 df = pd.read_csv(path)
 
 # Generate a label map
 LABEL_MAP = generate_pascal_category_names(df)

Visualizing the bounding-box(s) over the image :

from utils import visualize_boxes_and_labels_on_image_array as vis_bbs
from PIL import Image
import cv2
import numpy as np

image = cv2.cvtColor(cv2.imread(image_path), cv2.COLOR_BGR2RGB)

# or :
# image = Image.open(image_path)
# image = np.array(image)

# normalize the image
image = image / 255.0

# label map should be a list containing contraining the name
# of the categories. Each cotegory should be at the index 
# corresponding to the integer category
# 0th index is reserved for the background class
LABEL_MAP = ...

# (N,4) dimensional array containing the absolute bounding-box
# co-ordinates in xmin, ymin, xmax, ymax, format. 
boxes = ...
# (N) dimensional array contraining the interger labels
labels = ...
# (N) dimensional array contraining the confidence probability for the image. 
# This can also be None.
scores = ...

# draw bounding-box over the loaded image
im = vis_bbs(image, boxes, labels, scores, LABEL_MAP)
# this function returns a PIL image instance 
# to view the image
im.show()
# or in jupyter-notebooks use : im

References :

https://arxiv.org/abs/1708.02002
https://github.com/facebookresearch/detectron2
https://github.com/pytorch/vision
https://github.com/fastai/course-v3/blob/master/nbs/dl2/pascal.ipynb
https://github.com/tensorflow/models/tree/master/research/object_detection
https://github.com/PyTorchLightning/wheat/tree/dee605b0bf5cf6b0ab08755c45e38dc07d338bb7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pytorch_retinanet

TODO:

Tutorials:

Installing Dependencies :

Usage:

Loading Data:

Note :

Visualizing the bounding-box(s) over the image :

References :

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 492 Commits
retinanet		retinanet
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
hparams.yaml		hparams.yaml
model.py		model.py
requirements.txt		requirements.txt

License

benihime91/pytorch_retinanet

Folders and files

Latest commit

History

Repository files navigation

pytorch_retinanet

TODO:

Tutorials:

Installing Dependencies :

Usage:

Loading Data:

Note :

Visualizing the bounding-box(s) over the image :

References :

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages