Caltech-UCSD Birds-200-2011 (CUB-200-2011) is an extended version of the CUB-200 dataset, with roughly double the number of images per class and new part location annotations.
- Number of categories: 200
- Number of images: 11,788
- Annotations per image: 15 Part Locations, 312 Binary Attributes, 1 Bounding Box
Link: http://www.vision.caltech.edu/visipedia/CUB-200-2011.html
The structure of the dataset is as of the follwing
- images/
- images.txt (
<image_id> <image_name>
) - train_test_split.txt (
<image_id> <is_training_image>
) - classes.txt (
<class_id> <class_name>
) - image_class_labels.txt (
<image_id> <class_id>
)
- images.txt (
- parts/
- parts.txt (
<part_id> <part_name>
) - part_locs.txt (
<image_id> <part_id> <x> <y> <visible>
) - part_click_locs.txt (
<image_id> <part_id> <x> <y> <visible> <time>
)
- parts.txt (
- attributes/
- attributes.txt (
<attribute_id> <attribute_name>
) - certainties.txt (
<certainty_id> <certainty_name>
) - image_attribute_labels.txt (
<image_id> <attribute_id> <is_present> <certainty_id> <time>
)
- attributes.txt (
- bounding_boxes.txt (
<image_id> <x> <y> <width> <height>
)