europarl_bilingual

参考:

bg-cs

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-cs')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 402657
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "cs"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-da

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-da')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 393449
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-de

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-de')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 393298
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-el

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-el')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 377341
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-en

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-en')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 408290
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 388226
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 400712
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 396624
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 393644
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 382773
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 377822
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 392554
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 398355
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 388273
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 395269
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 388972
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 389381
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 393815
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 380231
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/bg-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 398236
  • 特征
{
    "translation": {
        "languages": [
            "bg",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-da

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-da')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 618055
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-de

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-de')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 568589
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-el

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-el')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 599489
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-en

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-en')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 647095
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 619774
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 636512
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 619320
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 628200
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 616160
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 607017
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 624292
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 627873
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 618414
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621387
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 609729
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 392085
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 636128
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 611624
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/cs-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 631544
  • 特征
{
    "translation": {
        "languages": [
            "cs",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-de

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-de')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1928414
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-el

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-el')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1280579
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-en

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-en')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1991647
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1943931
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 635018
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1917260
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1992590
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 617519
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1876703
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 614923
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 627809
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1987498
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 642544
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1930454
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 388156
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621907
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 595944
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/da-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1871171
  • 特征
{
    "translation": {
        "languages": [
            "da",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-el

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-el')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1223026
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-en

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-en')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1961119
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1887879
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 578248
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1871185
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1942666
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 563571
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1832989
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 565892
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 573226
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1934111
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 579166
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1884176
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 385663
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 569381
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 546212
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/de-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1842026
  • 特征
{
    "translation": {
        "languages": [
            "de",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-en

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-en')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1292180
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1272383
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 599915
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1227612
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1290796
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 586250
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1231222
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 590850
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 596929
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1277297
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 591069
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1261188
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 372839
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 600684
  • 版本:8.0.0
{
    "translation": {
        "languages": [
            "el",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 579109
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/el-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1273743
  • 特征
{
    "translation": {
        "languages": [
            "el",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-es

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-es')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 2009073
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 651236
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1969624
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 2051014
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 625178
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1946253
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 634284
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 639318
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 2027447
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 631160
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 2002943
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 400356
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 639958
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 624803
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/en-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1892723
  • 特征
{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-et

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-et')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 618350
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1901596
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1982990
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 604007
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1880982
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 611082
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 615496
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1954351
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 609297
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1933321
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 387653
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 619027
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 599168
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/es-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1826855
  • 特征
{
    "translation": {
        "languages": [
            "es",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fi

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-fi')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 620939
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 630126
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 628044
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 607088
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 622003
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 637468
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621150
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 639046
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 616238
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 389087
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 634168
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 609731
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/et-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 656646
  • 特征
{
    "translation": {
        "languages": [
            "et",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-fr

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-fr')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1964126
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 606348
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1845203
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 613113
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 616816
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1940808
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 612689
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1885062
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 391430
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 623686
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 596661
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fi-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1883314
  • 特征
{
    "translation": {
        "languages": [
            "fi",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-hu

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-hu')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 615791
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1943673
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 620660
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 626280
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 2029551
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621402
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1980132
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 387846
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 631846
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 606897
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/fr-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1880390
  • 特征
{
    "translation": {
        "languages": [
            "fr",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-it

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-it')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 589563
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 610298
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621101
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 605806
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621820
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 599639
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 377239
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 618247
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 601671
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/hu-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 631872
  • 特征
{
    "translation": {
        "languages": [
            "hu",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-lt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 593003
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 599394
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1919855
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 594472
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1877432
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 367904
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 603467
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 579968
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/it-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1766096
  • 特征
{
    "translation": {
        "languages": [
            "it",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-lv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-lv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 621857
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 613308
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 617296
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 603223
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 384679
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 622997
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 602442
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lt-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 628817
  • 特征
{
    "translation": {
        "languages": [
            "lt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-nl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-nl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 618352
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 638453
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 615580
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 390857
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 629803
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 607381
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/lv-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 643600
  • 特征
{
    "translation": {
        "languages": [
            "lv",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-pl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 612797
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1957189
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 380736
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 622650
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 600023
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/nl-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1870685
  • 特征
{
    "translation": {
        "languages": [
            "nl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-pt

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pl-pt')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 608181
  • 特征
{
    "translation": {
        "languages": [
            "pl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pl-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 389341
  • 特征
{
    "translation": {
        "languages": [
            "pl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pl-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 624330
  • 特征
{
    "translation": {
        "languages": [
            "pl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pl-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 600511
  • 特征
{
    "translation": {
        "languages": [
            "pl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pl-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 657951
  • 特征
{
    "translation": {
        "languages": [
            "pl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-ro

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pt-ro')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 381404
  • 特征
{
    "translation": {
        "languages": [
            "pt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pt-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 611895
  • 特征
{
    "translation": {
        "languages": [
            "pt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pt-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 593455
  • 特征
{
    "translation": {
        "languages": [
            "pt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/pt-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 1823402
  • 特征
{
    "translation": {
        "languages": [
            "pt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sk

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/ro-sk')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 387839
  • 特征
{
    "translation": {
        "languages": [
            "ro",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/ro-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 374859
  • 特征
{
    "translation": {
        "languages": [
            "ro",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/ro-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 390133
  • 特征
{
    "translation": {
        "languages": [
            "ro",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sl

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/sk-sl')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 609698
  • 特征
{
    "translation": {
        "languages": [
            "sk",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/sk-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 636353
  • 特征
{
    "translation": {
        "languages": [
            "sk",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sl-sv

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:europarl_bilingual/sl-sv')
  • 说明
A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.
拆分 样本
'train' 608740
  • 特征
{
    "translation": {
        "languages": [
            "sl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}