テルグ語ニュース

参照:

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:telugu_news')
  • 説明
This dataset contains Telugu language news articles along with respective
topic labels (business, editorial, entertainment, nation, sport) extracted from
the daily Andhra Jyoti. This dataset could be used to build Classification and Language Models.
  • ライセンス:データファイル©原作者
  • バージョン:1.1.0
  • 分割
スプリット
'test' 4329
'train' 17312
  • 特徴
{
    "sno": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "heading": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "topic": {
        "num_classes": 5,
        "names": [
            "business",
            "editorial",
            "entertainment",
            "nation",
            "sports"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}