Skip to content

Instantly share code, notes, and snippets.

@vchrombie
Last active September 17, 2019 05:50
Show Gist options
  • Save vchrombie/128faba8b06e598a444fb8ac8a402e0b to your computer and use it in GitHub Desktop.
Save vchrombie/128faba8b06e598a444fb8ac8a402e0b to your computer and use it in GitHub Desktop.
GANBOYS: Final Year Project Dataset Description

Caltech-UCSD Birds-200-2011 (CUB-200-2011) is an extended version of the CUB-200 dataset, with roughly double the number of images per class and new part location annotations.

  • Number of categories: 200
  • Number of images: 11,788
  • Annotations per image: 15 Part Locations, 312 Binary Attributes, 1 Bounding Box

Link: http://www.vision.caltech.edu/visipedia/CUB-200-2011.html

The structure of the dataset is as of the follwing

  • images/
    • images.txt (<image_id> <image_name>)
    • train_test_split.txt (<image_id> <is_training_image>)
    • classes.txt (<class_id> <class_name>)
    • image_class_labels.txt (<image_id> <class_id>)
  • parts/
    • parts.txt (<part_id> <part_name>)
    • part_locs.txt (<image_id> <part_id> <x> <y> <visible>)
    • part_click_locs.txt (<image_id> <part_id> <x> <y> <visible> <time>)
  • attributes/
    • attributes.txt (<attribute_id> <attribute_name>)
    • certainties.txt (<certainty_id> <certainty_name>)
    • image_attribute_labels.txt (<image_id> <attribute_id> <is_present> <certainty_id> <time>)
  • bounding_boxes.txt (<image_id> <x> <y> <width> <height>)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment