eduge

参考:

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:eduge')
  • 说明
Eduge news classification dataset is provided by Bolorsoft LLC. It is used for training the Eduge.mn production news classifier
75K news articles in 9 categories: урлаг соёл, эдийн засаг, эрүүл мэнд, хууль, улс төр, спорт, технологи, боловсрол and байгал орчин
  • 许可:无已知许可
  • 版本:0.0.0
  • 拆分
拆分 样本
'test' 15133
'train' 60528
  • 特征
{
    "news": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "label": {
        "num_classes": 9,
        "names": [
            "\u0443\u0440\u043b\u0430\u0433 \u0441\u043e\u0451\u043b",
            "\u044d\u0434\u0438\u0439\u043d \u0437\u0430\u0441\u0430\u0433",
            "\u044d\u0440\u04af\u04af\u043b \u043c\u044d\u043d\u0434",
            "\u0445\u0443\u0443\u043b\u044c",
            "\u0443\u043b\u0441 \u0442\u04e9\u0440",
            "\u0441\u043f\u043e\u0440\u0442",
            "\u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438",
            "\u0431\u043e\u043b\u043e\u0432\u0441\u0440\u043e\u043b",
            "\u0431\u0430\u0439\u0433\u0430\u043b \u043e\u0440\u0447\u0438\u043d"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}