Annotation-converters

This Repo covers all formats of annotations for Object Detection and can easily convert from one form to another using attached scripts

All computer vision problems require annotated datasets and for training deep neural networks data needs to be annotated in defined form. For Object Detection, there are many available formats for preparing and annotating your dataset but the most popular and used formats are Pascal VOC and Microsoft COCO.

MS COCO

COCO is large scale images with Common Objects in Context (COCO) for object detection, segmentation, and captioning data set. COCO has 1.5 million object instances for 80 object categories. COCO stores annotations in a JSON file.
COCO Bounding box: (x-top left, y-top left, width, height)

Pascal VOC

Pascal Visual Object Classes(VOC) provides standardized image data sets for object detection. Pascal VOC is an XML file, unlike COCO which has a JSON file.
Pascal VOC Bounding box :(xmin-top left, ymin-top left, xmax-bottom right, ymax-bottom right)

Darknet YOLO

YOLO reads or predicts bounding boxes in different format compared to VOC or COCO.
YOLO Bounding box : (x_center, y_center, width, height) --> all these coordinates are normalized with respect to image width & height.

In Pascal VOC and YOLO we create a file for each of the image in the dataset. In COCO we have one file each, for entire dataset for training, testing and validation.

Usually, when working on custom datasets we end up wasting lot of time in converting annotations from one format to another suitable to object detection models or frameworks. This is really frustrating and I compiled few annotations converter scripts which covers most of the cases and saves you time! You can now focus more on productive tasks such as improving model performance or training more efficiently.

Scripts

JSON_to_txt.py
- It converts MS COCO JSON file to a text file for each image and the format is {class_id, x_min, y_min, width, height}
Yolo_to_Voc.py
- It converts YOLO text files to Pascal VOC format XML files
- (x_c_n, y_c_n, width_n, height_n) --> (x_min, y_min, x_max, y_max)
XML_to_JSON.py && voc2coco.py
- These python scripts convert all XML files of a dataset into MS COCO readable JSON file.
  python XML_to_JSON.py ./annotations_dir/ ./json_dest_dir/coco_output.json
gt_yolo2json.py && pred_yolo2json.py
- It converts all YOLO text files into MS COCO readable JSON file.
- For ground truth YOLO text files --> gt_yolo2json.py
- For YOLO predicted text files --> pred_yolo2json.py
cocoGT_to_Yolo.py
- It converts MS COCO ground truth text files to YOLO format. It also has a function to convert YOLO text files to VOC format. Feel free to change the code and switch between the functions.

JSON --> VOC XML files

Json2PascalVoc is a Python library for converting some special Json strings to PascalVOC format XML files.
pip install Json2PascalVoc

from Json2PascalVoc.Converter import Converter
   
myConverter = Converter()
#returns a Converter Object
myConverter.convertJsonToPascal("data.json")
#Converts Json to PascalVOC XML and saves the XML file to the related file path

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Annotation-converters

MS COCO

Pascal VOC

Darknet YOLO

Scripts

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
JSON_to_txt.py		JSON_to_txt.py
README.md		README.md
XML_to_JSON.py		XML_to_JSON.py
cocoGT_to_Yolo.py		cocoGT_to_Yolo.py
gt_yolo2json.py		gt_yolo2json.py
pred_yolo2json.py		pred_yolo2json.py
voc2coco.py		voc2coco.py
yolo_to_voc.py		yolo_to_voc.py

ngun7/Annotation-Converters

Folders and files

Latest commit

History

Repository files navigation

Annotation-converters

MS COCO

Pascal VOC

Darknet YOLO

Scripts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages