Dataset Types

Below is the list of dataset types available to use in the DL Workbench:

Your dataset does not need to contain images from official databases providing these types, like ImageNet or Pascal VOC, but it needs to adhere to the supported dataset formats.

To learn how to download and prepare datasets, refer to Download and Cut Datasets.

NOTE: For COCO datasets, only the object-detection and instance-segmentation tasks are supported.

ImageNet

ImageNet is a well-known dataset used to train classification models. It consists of an annotation file and images:

|-- annotation.txt
|-- 0001.jpg
|-- 0002.jpg
|...
|-- n.jpg

The annotation file is organized as follows:

0001.jpg <label ID>
0002.jpg <label ID>
...
n.jpg <label ID>

Pascal Visual Object Classes (Pascal VOC)

Pascal VOC is a well-known dataset used to train object-detection and semantic-segmentation models. Pascal VOC datasets consist of several folders containing annotation files and image indices.

A Pascal VOC dataset archive is organized as follows:

|-- VOCdevkit
|-- VOC
|-- Annotations
|-- 0001.xml
|-- 0002.xml
...
|-- n.xml
|-- ImageSets
|-- Layout
|-- test.txt
|-- Main
|-- 0001_test.txt
|-- 0002_test.txt
...
|-- n_test.txt
|-- Segmentation
|-- test.txt
|-- Images
|-- 0001.jpg
|-- 0002.jpg
...
|-- n.jpg
|-- SegmentationClass
|-- 0001.png
|-- 0002.png
...
|-- n.png
|-- SegmentationObject
|-- 0001.png
|-- 0002.png
...
|-- n.png

Common Objects in Context (COCO)

COCO dataset is used for object detection, instance segmentation, person keypoints detection, stuff segmentation, and caption generation.

A COCO dataset is downloaded as two separate archives, but you have to create one archive based on them as described in the Download COCO Dataset section in Download and Cut Datasets. To upload a COCO dataset to the DL Workbench, make sure the archive contains the following files:

|-- val
|-- 0001.jpg
|-- 0002.jpg
...
|-- n.jpg
|-- annotations
|-- instances_val.json

The JSON file with annotations is organized as follows:

{
"info": <info>,
"images": [<images>],
"licenses": [<licenses>],
"annotations": [<annotations>]
}

Common Semantic Segmentation

Common Semantic Segmentation is a custom dataset type for semantic segmentation models. The type is aimed to simplify the structure provided by Pascal VOC.

A Common Semantic Segmentation dataset archive consists of folders with images and masks, and a JSON file with metainformation:

|-- meta.json
|-- Images
|-- 0001.jpg
|-- 0002.jpg
...
|-- n.jpg
|-- Masks
|-- 0001.png
|-- 0002.png
...
|-- n.png

The JSON metainformation file is organized as follows:

{
"label_map": {<map>},
"background_label":"<label>",
"segmentation_colors":[<colors>]
}

Unannotated Dataset

Unannotated datasets do not contain annotations, so models in the configurations that use unannotated datasets can be calibrated only in the simplified mode of the Maximum Performance Calibration method and cannot be used for accuracy measurement.

The archive is organized as follows:

|-- 0001.jpg
|-- 0002.jpg
|...
|-- n.jpg