penguins
Stay organized with collections
Save and categorize content based on your preferences.
Measurements for three penguin species observed in the Palmer Archipelago,
Antarctica.
These data were collected from 2007 - 2009 by Dr. Kristen Gorman with the
Palmer Station Long Term Ecological Research Program,
part of the US Long Term Ecological Research Network.
The data were originally imported from the
Environmental Data Initiative (EDI)
Data Portal, and are available for use by CC0 license ("No Rights Reserved") in
accordance with the Palmer Station Data Policy. This copy was imported from
Allison Horst's GitHub repository.
@Manual{,
title = {palmerpenguins: Palmer Archipelago (Antarctica) penguin data},
author = {Allison Marie Horst and Alison Presmanes Hill and Kristen B Gorman},
year = {2020},
note = {R package version 0.1.0},
doi = {10.5281/zenodo.3960218},
url = {https://allisonhorst.github.io/palmerpenguins/},
}
penguins/processed (default config)
Split |
Examples |
'train' |
334 |
FeaturesDict({
'features': Tensor(shape=(4,), dtype=float32),
'species': ClassLabel(shape=(), dtype=int64, num_classes=3),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
features |
Tensor |
(4,) |
float32 |
|
species |
ClassLabel |
|
int64 |
|
penguins/simple
Config description: penguins/simple
has been processed from the raw
dataset, with simplified class labels derived from text fields, missing
values marked as NaN/NA and retains only 7 significant features (n = 344).
Download size: 13.20 KiB
Dataset size: 56.10 KiB
Splits:
Split |
Examples |
'train' |
344 |
FeaturesDict({
'body_mass_g': float32,
'culmen_depth_mm': float32,
'culmen_length_mm': float32,
'flipper_length_mm': float32,
'island': ClassLabel(shape=(), dtype=int64, num_classes=3),
'sex': ClassLabel(shape=(), dtype=int64, num_classes=3),
'species': ClassLabel(shape=(), dtype=int64, num_classes=3),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
body_mass_g |
Tensor |
|
float32 |
|
culmen_depth_mm |
Tensor |
|
float32 |
|
culmen_length_mm |
Tensor |
|
float32 |
|
flipper_length_mm |
Tensor |
|
float32 |
|
island |
ClassLabel |
|
int64 |
|
sex |
ClassLabel |
|
int64 |
|
species |
ClassLabel |
|
int64 |
|
Supervised keys (See
as_supervised
doc):
({'body_mass_g': 'body_mass_g', 'culmen_depth_mm': 'culmen_depth_mm',
'culmen_length_mm': 'culmen_length_mm', 'flipper_length_mm':
'flipper_length_mm', 'island': 'island', 'sex': 'sex', 'species':
'species'}, 'species')
Examples
(tfds.as_dataframe):
penguins/raw
Config description: penguins/raw
is the original, unprocessed copy
from @allisonhorst, containing all 17 features, presented either as numeric
types or as raw text (n = 344).
Download size: 49.72 KiB
Dataset size: 164.51 KiB
Splits:
Split |
Examples |
'train' |
344 |
FeaturesDict({
'Body Mass (g)': float32,
'Clutch Completion': Text(shape=(), dtype=string),
'Comments': Text(shape=(), dtype=string),
'Culmen Depth (mm)': float32,
'Culmen Length (mm)': float32,
'Date Egg': Text(shape=(), dtype=string),
'Delta 13 C (o/oo)': float32,
'Delta 15 N (o/oo)': float32,
'Flipper Length (mm)': float32,
'Individual ID': Text(shape=(), dtype=string),
'Island': Text(shape=(), dtype=string),
'Region': Text(shape=(), dtype=string),
'Sample Number': int32,
'Sex': Text(shape=(), dtype=string),
'Species': Text(shape=(), dtype=string),
'Stage': Text(shape=(), dtype=string),
'studyName': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
Body Mass (g) |
Tensor |
|
float32 |
|
Clutch Completion |
Text |
|
string |
|
Comments |
Text |
|
string |
|
Culmen Depth (mm) |
Tensor |
|
float32 |
|
Culmen Length (mm) |
Tensor |
|
float32 |
|
Date Egg |
Text |
|
string |
|
Delta 13 C (o/oo) |
Tensor |
|
float32 |
|
Delta 15 N (o/oo) |
Tensor |
|
float32 |
|
Flipper Length (mm) |
Tensor |
|
float32 |
|
Individual ID |
Text |
|
string |
|
Island |
Text |
|
string |
|
Region |
Text |
|
string |
|
Sample Number |
Tensor |
|
int32 |
|
Sex |
Text |
|
string |
|
Species |
Text |
|
string |
|
Stage |
Text |
|
string |
|
studyName |
Text |
|
string |
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-16 UTC.
[null,null,["Last updated 2022-12-16 UTC."],[],[],null,["# penguins\n\n\u003cbr /\u003e\n\n- **Description**:\n\nMeasurements for three penguin species observed in the Palmer Archipelago,\nAntarctica.\n\nThese data were collected from 2007 - 2009 by Dr. Kristen Gorman with the\n[Palmer Station Long Term Ecological Research Program](https://pal.lternet.edu/),\npart of the [US Long Term Ecological Research Network](https://lternet.edu/).\nThe data were originally imported from the\n[Environmental Data Initiative](https://environmentaldatainitiative.org/) (EDI)\nData Portal, and are available for use by CC0 license (\"No Rights Reserved\") in\naccordance with the Palmer Station Data Policy. This copy was imported from\n[Allison Horst's GitHub repository](https://allisonhorst.github.io/palmerpenguins/articles/intro.html).\n\n- **Homepage** :\n \u003chttps://allisonhorst.github.io/palmerpenguins/\u003e\n\n- **Source code** :\n [`tfds.datasets.penguins.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/penguins/penguins_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): Initial release.\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Citation**:\n\n @Manual{,\n title = {palmerpenguins: Palmer Archipelago (Antarctica) penguin data},\n author = {Allison Marie Horst and Alison Presmanes Hill and Kristen B Gorman},\n year = {2020},\n note = {R package version 0.1.0},\n doi = {10.5281/zenodo.3960218},\n url = {https://allisonhorst.github.io/palmerpenguins/},\n }\n\npenguins/processed (default config)\n-----------------------------------\n\n- **Config description** : `penguins/processed` is a drop-in replacement for\n the `iris` dataset. It contains 4 normalised numerical features presented as\n a single tensor, no missing values and the class label (species) is\n presented as an integer (n = 334).\n\n- **Download size** : `25.05 KiB`\n\n- **Dataset size** : `17.61 KiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'train'` | 334 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'features': Tensor(shape=(4,), dtype=float32),\n 'species': ClassLabel(shape=(), dtype=int64, num_classes=3),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------|--------------|-------|---------|-------------|\n| | FeaturesDict | | | |\n| features | Tensor | (4,) | float32 | |\n| species | ClassLabel | | int64 | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('features', 'species')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\npenguins/simple\n---------------\n\n- **Config description** : `penguins/simple` has been processed from the raw\n dataset, with simplified class labels derived from text fields, missing\n values marked as NaN/NA and retains only 7 significant features (n = 344).\n\n- **Download size** : `13.20 KiB`\n\n- **Dataset size** : `56.10 KiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'train'` | 344 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'body_mass_g': float32,\n 'culmen_depth_mm': float32,\n 'culmen_length_mm': float32,\n 'flipper_length_mm': float32,\n 'island': ClassLabel(shape=(), dtype=int64, num_classes=3),\n 'sex': ClassLabel(shape=(), dtype=int64, num_classes=3),\n 'species': ClassLabel(shape=(), dtype=int64, num_classes=3),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|-------------------|--------------|-------|---------|-------------|\n| | FeaturesDict | | | |\n| body_mass_g | Tensor | | float32 | |\n| culmen_depth_mm | Tensor | | float32 | |\n| culmen_length_mm | Tensor | | float32 | |\n| flipper_length_mm | Tensor | | float32 | |\n| island | ClassLabel | | int64 | |\n| sex | ClassLabel | | int64 | |\n| species | ClassLabel | | int64 | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `({'body_mass_g': 'body_mass_g', 'culmen_depth_mm': 'culmen_depth_mm',\n 'culmen_length_mm': 'culmen_length_mm', 'flipper_length_mm':\n 'flipper_length_mm', 'island': 'island', 'sex': 'sex', 'species':\n 'species'}, 'species')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\npenguins/raw\n------------\n\n- **Config description** : `penguins/raw` is the original, unprocessed copy\n from @allisonhorst, containing all 17 features, presented either as numeric\n types or as raw text (n = 344).\n\n- **Download size** : `49.72 KiB`\n\n- **Dataset size** : `164.51 KiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'train'` | 344 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'Body Mass (g)': float32,\n 'Clutch Completion': Text(shape=(), dtype=string),\n 'Comments': Text(shape=(), dtype=string),\n 'Culmen Depth (mm)': float32,\n 'Culmen Length (mm)': float32,\n 'Date Egg': Text(shape=(), dtype=string),\n 'Delta 13 C (o/oo)': float32,\n 'Delta 15 N (o/oo)': float32,\n 'Flipper Length (mm)': float32,\n 'Individual ID': Text(shape=(), dtype=string),\n 'Island': Text(shape=(), dtype=string),\n 'Region': Text(shape=(), dtype=string),\n 'Sample Number': int32,\n 'Sex': Text(shape=(), dtype=string),\n 'Species': Text(shape=(), dtype=string),\n 'Stage': Text(shape=(), dtype=string),\n 'studyName': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------------------|--------------|-------|---------|-------------|\n| | FeaturesDict | | | |\n| Body Mass (g) | Tensor | | float32 | |\n| Clutch Completion | Text | | string | |\n| Comments | Text | | string | |\n| Culmen Depth (mm) | Tensor | | float32 | |\n| Culmen Length (mm) | Tensor | | float32 | |\n| Date Egg | Text | | string | |\n| Delta 13 C (o/oo) | Tensor | | float32 | |\n| Delta 15 N (o/oo) | Tensor | | float32 | |\n| Flipper Length (mm) | Tensor | | float32 | |\n| Individual ID | Text | | string | |\n| Island | Text | | string | |\n| Region | Text | | string | |\n| Sample Number | Tensor | | int32 | |\n| Sex | Text | | string | |\n| Species | Text | | string | |\n| Stage | Text | | string | |\n| studyName | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples..."]]