计算机视觉·常用数据集·目标检测

Detection

PASCAL VOC 2009 dataset
Classification/Detection Competitions, Segmentation Competition, Person Layout Taster Competition datasets
LabelMe dataset
LabelMe is a web-based image annotation tool that allows researchers to label images and share the annotations with the rest of the community. If you use the database, we only ask that you contribute to it, from time to time, by using the labeling tool.
BioID Face Detection Database
1521 images with human faces, recorded under natural conditions, i.e. varying illumination and complex background. The eye positions have been set manually.
CMU/VASC & PIE Face dataset
Yale Face dataset
Caltech
Cars, Motorcycles, Airplanes, Faces, Leaves, Backgrounds
Caltech 101
Pictures of objects belonging to 101 categories
Caltech 256
Pictures of objects belonging to 256 categories
Daimler Pedestrian Detection Benchmark
15,560 pedestrian and non-pedestrian samples (image cut-outs) and 6744 additional full images not containing pedestrians for bootstrapping. The test set contains more than 21,790 images with 56,492 pedestrian labels (fully visible or partially occluded), captured from a vehicle in urban traffic.
MIT Pedestrian dataset
CVC Pedestrian Datasets
CVC Pedestrian Datasets
CBCL Pedestrian Database
MIT Face dataset
CBCL Face Database
MIT Car dataset
CBCL Car Database
MIT Street dataset
CBCL Street Database
INRIA Person Data Set
A large set of marked up images of standing or walking people
INRIA car dataset
A set of car and non-car images taken in a parking lot nearby INRIA
INRIA horse dataset
A set of horse and non-horse images
H3D Dataset
3D skeletons and segmented regions for 1000 people in images
HRI RoadTraffic dataset
A large-scale vehicle detection dataset
BelgaLogos
10000 images of natural scenes, with 37 different logos, and 2695 logos instances, annotated with a bounding box.
FlickrBelgaLogos
10000 images of natural scenes grabbed on Flickr, with 2695 logos instances cut and pasted from the BelgaLogos dataset.
FlickrLogos-32
The dataset FlickrLogos-32 contains photos depicting logos and is meant for the evaluation of multi-class logo detection/recognition as well as logo retrieval methods on real-world images. It consists of 8240 images downloaded from Flickr.
TME Motorway Dataset
30000+ frames with vehicle rear annotation and classification (car and trucks) on motorway/highway sequences. Annotation semi-automatically generated using laser-scanner data. Distance estimation and consistent target ID over time available.
PHOS (Color Image Database for illumination invariant feature selection)
Phos is a color image database of 15 scenes captured under different illumination conditions. More particularly, every scene of the database contains 15 different images: 9 images captured under various strengths of uniform illumination, and 6 images under different degrees of non-uniform illumination. The images contain objects of different shape, color and texture and can be used for illumination invariant feature detection and selection.
CaliforniaND: An Annotated Dataset For Near-Duplicate Detection In Personal Photo Collections
California-ND contains 701 photos taken directly from a real user’s personal photo collection, including many challenging non-identical near-duplicate cases, without the use of artificial image transformations. The dataset is annotated by 10 different subjects, including the photographer, regarding near duplicates.
USPTO Algorithm Challenge, Detecting Figures and Part Labels in Patents
Contains drawing pages from US patents with manually labeled figure and part labels.
Abnormal Objects Dataset
Contains 6 object categories similar to object categories in Pascal VOC that are suitable for studying the abnormalities stemming from objects.
Human detection and tracking using RGB-D camera
Collected in a clothing store. Captured with Kinect (640*480, about 30fps)
Multi-Task Facial Landmark (MTFL) dataset
This dataset contains 12,995 face images collected from the Internet. The images are annotated with (1) five facial landmarks, (2) attributes of gender, smiling, wearing glasses, and head pose.
WIDER FACE: A Face Detection Benchmark
WIDER FACE dataset is a face detection benchmark dataset with images selected from the publicly available WIDER dataset. It contains 32,203 images and 393,703 face annotations.
PIROPO Database: People in Indoor ROoms with Perspective and Omnidirectional cameras
Multiple sequences recorded in two different indoor rooms, using both omnidirectional and perspective cameras, containing people in a variety of situations (people walking, standing, and sitting). Both annotated and non-annotated sequences are provided, where ground truth is point-based. In total, more than 100,000 annotated frames are available.


数据来源

猜你喜欢

转载自blog.csdn.net/u010189457/article/details/78472266