autshumato

参考:

autshumato-en-tn

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-en-tn')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 159000
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "tn"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-zu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-en-zu')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 35489
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "zu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-ts

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-en-ts')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 450000
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "ts"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-en-ts-manual

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-en-ts-manual')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 92396
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "ts"
        ],
        "id": null,
        "_type": "Translation"
    }
}

autshumato-tn

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-tn')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 38206
  • 特征
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

autshumato-ts

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:autshumato/autshumato-ts')
  • 说明
Multilingual information access is stipulated in the South African constitution. In practise, this
is hampered by a lack of resources and capacity to perform the large volumes of translation
work required to realise multilingual information access. One of the aims of the Autshumato
project is to develop machine translation systems for three South African languages pairs.
  • 许可:无已知许可
  • 版本:1.0.0
  • 拆分
拆分 样本
'train' 58398
  • 特征
{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}