go_emotions

参考:

raw

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:go_emotions/raw')
  • 说明
The GoEmotions dataset contains 58k carefully curated Reddit comments labeled for 27 emotion categories or Neutral.
The emotion categories are admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire,
disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness,
optimism, pride, realization, relief, remorse, sadness, surprise.
  • 许可:无已知许可
  • 版本:0.0.0
  • 拆分
拆分 样本
'train' 211225
  • 特征
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "author": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "subreddit": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "link_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "parent_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "created_utc": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "rater_id": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "example_very_unclear": {
        "dtype": "bool",
        "id": null,
        "_type": "Value"
    },
    "admiration": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "amusement": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "anger": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "annoyance": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "approval": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "caring": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "confusion": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "curiosity": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "desire": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "disappointment": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "disapproval": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "disgust": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "embarrassment": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "excitement": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "fear": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "gratitude": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "grief": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "joy": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "love": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "nervousness": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "optimism": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "pride": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "realization": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "relief": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "remorse": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "sadness": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "surprise": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "neutral": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

simplified

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:go_emotions/simplified')
  • 说明
The GoEmotions dataset contains 58k carefully curated Reddit comments labeled for 27 emotion categories or Neutral.
The emotion categories are admiration, amusement, anger, annoyance, approval, caring, confusion, curiosity, desire,
disappointment, disapproval, disgust, embarrassment, excitement, fear, gratitude, grief, joy, love, nervousness,
optimism, pride, realization, relief, remorse, sadness, surprise.
  • 许可:无已知许可
  • 版本:0.0.0
  • 拆分
拆分 样本
'test' 5427
'train' 43410
'validation' 5426
  • 特征
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "labels": {
        "feature": {
            "num_classes": 28,
            "names": [
                "admiration",
                "amusement",
                "anger",
                "annoyance",
                "approval",
                "caring",
                "confusion",
                "curiosity",
                "desire",
                "disappointment",
                "disapproval",
                "disgust",
                "embarrassment",
                "excitement",
                "fear",
                "gratitude",
                "grief",
                "joy",
                "love",
                "nervousness",
                "optimism",
                "pride",
                "realization",
                "relief",
                "remorse",
                "sadness",
                "surprise",
                "neutral"
            ],
            "names_file": null,
            "id": null,
            "_type": "ClassLabel"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    },
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}