We are aggregating high quality public datasets along with datasets that have been built by the Superb AI team. Use these to kickoff a project or to enrich your current training datasets.

Want to publish your dataset?
Contact us >>
Bounding Box
Segmentation

COCO Dataset

COCO present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding.
COCO Consortium
Bounding Box
Polygon

Rareplanes Dataset

Rareplane is the machine learning dataset that incorporates both real and synthetically generated satellite imagery.
Cosmiq Works
Bounding Box
Polygon

Food Recognition Challenge Dataset

This is a novel dataset of food images collected through the MyFoodRepo app where numerous volunteer Swiss users provide images of their daily food intake in the context of a digital cohort called Food & You.
Aicrowd
Bounding Box

WIDERFACE

Face detection benchmark dataset, of which images are selected from the publicly available WIDER dataset.
The Chinese University of Hong Kong
Bounding Box
OCR

CCPD

A large and comprehensive license plate dataset. All images are taken manually by workers of a roadside parking management company and are annotated carefully.
University of Science and Technology of China
Bounding Box

AU-AIR

AU-AIR dataset is the first multi-modal UAV dataset for object detection.
Aarhus University
Classification

Covid-19 Image Dataset

Helping Deep Learning and AI Enthusiasts like me to contribute to improving COVID-19 detection using just Chest X-rays.
The University of Montreal
Classification

Flower Image Dataset

An extensive flower image dataset with 10 different types of flowers.
Aksha Srivastava
Bounding Box
Classification

The Oxford-IIIT Pet Dataset

This dataset has 37 category pet dataset with roughly 200 images for each class. The imae have a large variations in scale, pose and lighting. All images have an associated ground truth annotation of breed and head ROI.
UK India Education and Research Initiative (UKIERI) and ERC Grant VisRec.
Bounding Box
Segmentation

Pascal VOC

Data set from the VOC challenges. This data set provides standardized image data sets for object class recognition.
Oxford University