Thanks for tuning in to Google I/O. View all sessions on demandWatch on demand


This dataset consists of 4502 images of healthy and unhealthy plant leaves divided into 22 categories by species and state of health. The images are in high resolution JPG format.

There are no files with label prefix 0000, therefore label encoding is shifted by one (e.g. file with label prefix 0001 gets encoded label 0).

Dataset URL: License:

Split Examples
'train' 4,502
  • Feature structure:
    'image': Image(shape=(None, None, 3), dtype=uint8),
    'image/filename': Text(shape=(), dtype=string),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=22),
  • Feature documentation:
Feature Class Shape Dtype Description
image Image (None, None, 3) uint8
image/filename Text string
label ClassLabel int64


  • Citation:
  author={Siddharth Singh Chouhan, Ajay Kaul, Uday Pratap Singh, Sanjeev
  title={A Database of Leaf Images: Practice towards Plant Conservation with
Plant Pathology},
  howpublished={Mendeley Data},