Fine-grained image classification (Birds)

The goal of this task is to develop an algorithm to learn to classify images containing objects of the same category (e.g. birds, dogs) into specific sub-categories, i.e. specific species. The dataset is available:

Caltech-UCSD Birds-200-2011: 200 categories of birds http://www.vision.caltech.edu/visipedia/CUB-200-2011.html

Python workbooks

Colaboration

add this to import the data info for bounding_boxes, classes, image_class_labels, images, train_test_split, attributes, certainties, class_attribute_labels_continuous, image_attribute_labels, part_click_locs, part_locs & parts

git_fldr = '/content/MLBirds/'
  
  if os.path.exists(git_fldr):
    %cd '/content/MLBirds/'
    !git pull
  else:
    !git clone https://github.com/TeaWithLucas/MLBirds.git
    %cd '/content/MLBirds/
    
  import data_load as data

The Caltech-UCSD Birds-200-2011 Dataset

For more information about the dataset, visit the project website:

http://www.vision.caltech.edu/visipedia

If you use the dataset in a publication, please cite the dataset in the style described on the dataset website (see url above).

Directory Information

images/ The images organized in subdirectories based on species. See IMAGES AND CLASS LABELS section below for more info.
parts/ 15 part locations per image. See PART LOCATIONS section below for more info.
attributes/ 322 binary attribute labels from MTurk workers. See ATTRIBUTE LABELS section below for more info.

Images and Class Labels

Images are contained in the directory images/, with 200 subdirectories (one for each bird species)

List of image files (images.txt)

The list of image file names is contained in the file images.txt, with each line corresponding to one image:

<image_id> <image_name>

Train/test split (train_test_split.txt)

The suggested train/test split is contained in the file train_test_split.txt, with each line corresponding to one image:

<image_id> <is_training_image>

where <image_id> corresponds to the ID in images.txt, and a value of 1 or 0 for <is_training_image> denotes that the file is in the training or test set, respectively.

List of class names (classes.txt)

The list of class names (bird species) is contained in the file classes.txt, with each line corresponding to one class:

<class_id> <class_name>

Image class labels (image_class_labels.txt)

The ground truth class labels (bird species labels) for each image are contained in the file image_class_labels.txt, with each line corresponding to one image:

<image_id> <class_id>

where <image_id> and <class_id> correspond to the IDs in images.txt and classes.txt, respectively.

Bounding Boxes

Each image contains a single bounding box label. Bounding box labels are contained in the file bounding_boxes.txt, with each line corresponding to one image:

<image_id>

where <image_id> corresponds to the ID in images.txt, and , , , and are all measured in pixels

Part Locations

List of part names (parts/parts.txt)

The list of all part names is contained in the file parts/parts.txt, with each line corresponding to one part:

<part_id> <part_name>

Part locations (parts/part_locs.txt)

The set of all ground truth part locations is contained in the file parts/part_locs.txt, with each line corresponding to the annotation of a particular part in a particular image:

<image_id> <part_id>

where <image_id> and <part_id> correspond to the IDs in images.txt and parts/parts.txt, respectively. and denote the pixel location of the center of the part. is 0 if the part is not visible in the image and 1 otherwise.

MTurk part locations (parts/part_click_locs.txt)

A set of multiple part locations for each image and part, as perceived by multiple MTurk users is contained in parts/part_click_locs.txt, with each line corresponding to the annotation of a particular part in a particular image by a different MTurk worker:

<image_id> <part_id>

where <image_id>, <part_id>, , are in the same format as defined in parts/part_locs.txt, and is the time in seconds spent by the MTurk worker.

Attribute Labels

List of attribute names (attributes/attributes.txt)

The list of all attribute names is contained in the file attributes/attributes.txt, with each line corresponding to one attribute:

<attribute_id> <attribute_name>

List of certainty names (attributes/certainties.txt)

The list of all certainty names (used by workers to specify their certainty of an attribute response of is contained in the file attributes/certainties.txt, with each line corresponding to one certainty:

<certainty_id> <certainty_name>

MTurk image attribute labels (attributes/image_attribute_labels.txt)

The set of attribute labels as perceived by MTurkers for each image is contained in the file attributes/image_attribute_labels.txt, with each line corresponding to one image/attribute/worker triplet:

<image_id> <attribute_id> <is_present> <certainty_id>

where <image_id>, <attribute_id>, <certainty_id> correspond to the IDs in images.txt, attributes/attributes.txt, and attributes/certainties.txt respectively. <is_present> is 0 or 1 (1 denotes that the attribute is present). denotes the time spent by the MTurker in seconds.

Class attribute labels (attributes/class_attribute_labels_continuous.txt)

Attributes on a per-class level--in a similar format to the Animals With Attributes dataset--are contained in attributes/class_attribute_labels_continuous.txt. The file contains 200 lines and 312 space-separated columns. Each line corresponds to one class (in the same order as classes.txt) and each column contains one real-valued number corresponding to one attribute (in the same order as attributes.txt). The number is the percentage of the time (between 0 and 100) that a human thinks that the attribute is present for a given class

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
analysis_data		analysis_data
docs		docs
images		images
segmentations		segmentations
.gitignore		.gitignore
CMT316_Coursework2.pdf		CMT316_Coursework2.pdf
LICENSE		LICENSE
README.md		README.md
analysis.ipynb		analysis.ipynb
data_load.py		data_load.py
implementation.ipynb		implementation.ipynb
preprocessing.ipynb		preprocessing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fine-grained image classification (Birds)

Python workbooks

Colaboration

The Caltech-UCSD Birds-200-2011 Dataset

Directory Information

Images and Class Labels

List of image files (images.txt)

Train/test split (train_test_split.txt)

List of class names (classes.txt)

Image class labels (image_class_labels.txt)

Bounding Boxes

Part Locations

List of part names (parts/parts.txt)

Part locations (parts/part_locs.txt)

MTurk part locations (parts/part_click_locs.txt)

Attribute Labels

List of attribute names (attributes/attributes.txt)

List of certainty names (attributes/certainties.txt)

MTurk image attribute labels (attributes/image_attribute_labels.txt)

Class attribute labels (attributes/class_attribute_labels_continuous.txt)

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

TeaWithLucas/MLBirds

Folders and files

Latest commit

History

Repository files navigation

Fine-grained image classification (Birds)

Python workbooks

Colaboration

The Caltech-UCSD Birds-200-2011 Dataset

Directory Information

Images and Class Labels

List of image files (images.txt)

Train/test split (train_test_split.txt)

List of class names (classes.txt)

Image class labels (image_class_labels.txt)

Bounding Boxes

Part Locations

List of part names (parts/parts.txt)

Part locations (parts/part_locs.txt)

MTurk part locations (parts/part_click_locs.txt)

Attribute Labels

List of attribute names (attributes/attributes.txt)

List of certainty names (attributes/certainties.txt)

MTurk image attribute labels (attributes/image_attribute_labels.txt)

Class attribute labels (attributes/class_attribute_labels_continuous.txt)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages