TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

europarl_bilingual

参考：

bg-cs

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-cs')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	402657

特征：

{
    "translation": {
        "languages": [
            "bg",
            "cs"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-da

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-da')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	393449

特征：

{
    "translation": {
        "languages": [
            "bg",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-de

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-de')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	393298

特征：

{
    "translation": {
        "languages": [
            "bg",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-el

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-el')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	377341

特征：

{
    "translation": {
        "languages": [
            "bg",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-en

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-en')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	408290

特征：

{
    "translation": {
        "languages": [
            "bg",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	388226

特征：

{
    "translation": {
        "languages": [
            "bg",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	400712

特征：

{
    "translation": {
        "languages": [
            "bg",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	396624

特征：

{
    "translation": {
        "languages": [
            "bg",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	393644

特征：

{
    "translation": {
        "languages": [
            "bg",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	382773

特征：

{
    "translation": {
        "languages": [
            "bg",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	377822

特征：

{
    "translation": {
        "languages": [
            "bg",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	392554

特征：

{
    "translation": {
        "languages": [
            "bg",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	398355

特征：

{
    "translation": {
        "languages": [
            "bg",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	388273

特征：

{
    "translation": {
        "languages": [
            "bg",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	395269

特征：

{
    "translation": {
        "languages": [
            "bg",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	388972

特征：

{
    "translation": {
        "languages": [
            "bg",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	389381

特征：

{
    "translation": {
        "languages": [
            "bg",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	393815

特征：

{
    "translation": {
        "languages": [
            "bg",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	380231

特征：

{
    "translation": {
        "languages": [
            "bg",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

bg-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/bg-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	398236

特征：

{
    "translation": {
        "languages": [
            "bg",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-da

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-da')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	618055

特征：

{
    "translation": {
        "languages": [
            "cs",
            "da"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-de

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-de')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	568589

特征：

{
    "translation": {
        "languages": [
            "cs",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-el

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-el')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	599489

特征：

{
    "translation": {
        "languages": [
            "cs",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-en

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-en')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	647095

特征：

{
    "translation": {
        "languages": [
            "cs",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	619774

特征：

{
    "translation": {
        "languages": [
            "cs",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	636512

特征：

{
    "translation": {
        "languages": [
            "cs",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	619320

特征：

{
    "translation": {
        "languages": [
            "cs",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	628200

特征：

{
    "translation": {
        "languages": [
            "cs",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	616160

特征：

{
    "translation": {
        "languages": [
            "cs",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	607017

特征：

{
    "translation": {
        "languages": [
            "cs",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	624292

特征：

{
    "translation": {
        "languages": [
            "cs",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	627873

特征：

{
    "translation": {
        "languages": [
            "cs",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	618414

特征：

{
    "translation": {
        "languages": [
            "cs",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621387

特征：

{
    "translation": {
        "languages": [
            "cs",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	609729

特征：

{
    "translation": {
        "languages": [
            "cs",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	392085

特征：

{
    "translation": {
        "languages": [
            "cs",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	636128

特征：

{
    "translation": {
        "languages": [
            "cs",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	611624

特征：

{
    "translation": {
        "languages": [
            "cs",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

cs-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/cs-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	631544

特征：

{
    "translation": {
        "languages": [
            "cs",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-de

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-de')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1928414

特征：

{
    "translation": {
        "languages": [
            "da",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-el

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-el')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1280579

特征：

{
    "translation": {
        "languages": [
            "da",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-en

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-en')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1991647

特征：

{
    "translation": {
        "languages": [
            "da",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1943931

特征：

{
    "translation": {
        "languages": [
            "da",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	635018

特征：

{
    "translation": {
        "languages": [
            "da",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1917260

特征：

{
    "translation": {
        "languages": [
            "da",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1992590

特征：

{
    "translation": {
        "languages": [
            "da",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	617519

特征：

{
    "translation": {
        "languages": [
            "da",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1876703

特征：

{
    "translation": {
        "languages": [
            "da",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	614923

特征：

{
    "translation": {
        "languages": [
            "da",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	627809

特征：

{
    "translation": {
        "languages": [
            "da",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1987498

特征：

{
    "translation": {
        "languages": [
            "da",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	642544

特征：

{
    "translation": {
        "languages": [
            "da",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1930454

特征：

{
    "translation": {
        "languages": [
            "da",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	388156

特征：

{
    "translation": {
        "languages": [
            "da",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621907

特征：

{
    "translation": {
        "languages": [
            "da",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	595944

特征：

{
    "translation": {
        "languages": [
            "da",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

da-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/da-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1871171

特征：

{
    "translation": {
        "languages": [
            "da",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-el

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-el')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1223026

特征：

{
    "translation": {
        "languages": [
            "de",
            "el"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-en

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-en')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1961119

特征：

{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1887879

特征：

{
    "translation": {
        "languages": [
            "de",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	578248

特征：

{
    "translation": {
        "languages": [
            "de",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1871185

特征：

{
    "translation": {
        "languages": [
            "de",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1942666

特征：

{
    "translation": {
        "languages": [
            "de",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	563571

特征：

{
    "translation": {
        "languages": [
            "de",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1832989

特征：

{
    "translation": {
        "languages": [
            "de",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	565892

特征：

{
    "translation": {
        "languages": [
            "de",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	573226

特征：

{
    "translation": {
        "languages": [
            "de",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1934111

特征：

{
    "translation": {
        "languages": [
            "de",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	579166

特征：

{
    "translation": {
        "languages": [
            "de",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1884176

特征：

{
    "translation": {
        "languages": [
            "de",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	385663

特征：

{
    "translation": {
        "languages": [
            "de",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	569381

特征：

{
    "translation": {
        "languages": [
            "de",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	546212

特征：

{
    "translation": {
        "languages": [
            "de",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

de-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/de-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1842026

特征：

{
    "translation": {
        "languages": [
            "de",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-en

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-en')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1292180

特征：

{
    "translation": {
        "languages": [
            "el",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1272383

特征：

{
    "translation": {
        "languages": [
            "el",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	599915

特征：

{
    "translation": {
        "languages": [
            "el",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1227612

特征：

{
    "translation": {
        "languages": [
            "el",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1290796

特征：

{
    "translation": {
        "languages": [
            "el",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	586250

特征：

{
    "translation": {
        "languages": [
            "el",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1231222

特征：

{
    "translation": {
        "languages": [
            "el",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	590850

特征：

{
    "translation": {
        "languages": [
            "el",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	596929

特征：

{
    "translation": {
        "languages": [
            "el",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1277297

特征：

{
    "translation": {
        "languages": [
            "el",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	591069

特征：

{
    "translation": {
        "languages": [
            "el",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1261188

特征：

{
    "translation": {
        "languages": [
            "el",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	372839

特征：

{
    "translation": {
        "languages": [
            "el",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	600684

版本：8.0.0

{
    "translation": {
        "languages": [
            "el",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	579109

特征：

{
    "translation": {
        "languages": [
            "el",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

el-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/el-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1273743

特征：

{
    "translation": {
        "languages": [
            "el",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-es

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-es')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	2009073

特征：

{
    "translation": {
        "languages": [
            "en",
            "es"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	651236

特征：

{
    "translation": {
        "languages": [
            "en",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1969624

特征：

{
    "translation": {
        "languages": [
            "en",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	2051014

特征：

{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	625178

特征：

{
    "translation": {
        "languages": [
            "en",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1946253

特征：

{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	634284

特征：

{
    "translation": {
        "languages": [
            "en",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	639318

特征：

{
    "translation": {
        "languages": [
            "en",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	2027447

特征：

{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	631160

特征：

{
    "translation": {
        "languages": [
            "en",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	2002943

特征：

{
    "translation": {
        "languages": [
            "en",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	400356

特征：

{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	639958

特征：

{
    "translation": {
        "languages": [
            "en",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	624803

特征：

{
    "translation": {
        "languages": [
            "en",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

en-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/en-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1892723

特征：

{
    "translation": {
        "languages": [
            "en",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-et

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-et')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	618350

特征：

{
    "translation": {
        "languages": [
            "es",
            "et"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1901596

特征：

{
    "translation": {
        "languages": [
            "es",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1982990

特征：

{
    "translation": {
        "languages": [
            "es",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	604007

特征：

{
    "translation": {
        "languages": [
            "es",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1880982

特征：

{
    "translation": {
        "languages": [
            "es",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	611082

特征：

{
    "translation": {
        "languages": [
            "es",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	615496

特征：

{
    "translation": {
        "languages": [
            "es",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1954351

特征：

{
    "translation": {
        "languages": [
            "es",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	609297

特征：

{
    "translation": {
        "languages": [
            "es",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1933321

特征：

{
    "translation": {
        "languages": [
            "es",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	387653

特征：

{
    "translation": {
        "languages": [
            "es",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	619027

特征：

{
    "translation": {
        "languages": [
            "es",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	599168

特征：

{
    "translation": {
        "languages": [
            "es",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

es-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/es-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1826855

特征：

{
    "translation": {
        "languages": [
            "es",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fi

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-fi')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	620939

特征：

{
    "translation": {
        "languages": [
            "et",
            "fi"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	630126

特征：

{
    "translation": {
        "languages": [
            "et",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	628044

特征：

{
    "translation": {
        "languages": [
            "et",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	607088

特征：

{
    "translation": {
        "languages": [
            "et",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	622003

特征：

{
    "translation": {
        "languages": [
            "et",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	637468

特征：

{
    "translation": {
        "languages": [
            "et",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621150

特征：

{
    "translation": {
        "languages": [
            "et",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	639046

特征：

{
    "translation": {
        "languages": [
            "et",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	616238

特征：

{
    "translation": {
        "languages": [
            "et",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	389087

特征：

{
    "translation": {
        "languages": [
            "et",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	634168

特征：

{
    "translation": {
        "languages": [
            "et",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	609731

特征：

{
    "translation": {
        "languages": [
            "et",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

et-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/et-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	656646

特征：

{
    "translation": {
        "languages": [
            "et",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-fr

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-fr')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1964126

特征：

{
    "translation": {
        "languages": [
            "fi",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	606348

特征：

{
    "translation": {
        "languages": [
            "fi",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1845203

特征：

{
    "translation": {
        "languages": [
            "fi",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	613113

特征：

{
    "translation": {
        "languages": [
            "fi",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	616816

特征：

{
    "translation": {
        "languages": [
            "fi",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1940808

特征：

{
    "translation": {
        "languages": [
            "fi",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	612689

特征：

{
    "translation": {
        "languages": [
            "fi",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1885062

特征：

{
    "translation": {
        "languages": [
            "fi",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	391430

特征：

{
    "translation": {
        "languages": [
            "fi",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	623686

特征：

{
    "translation": {
        "languages": [
            "fi",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	596661

特征：

{
    "translation": {
        "languages": [
            "fi",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fi-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fi-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1883314

特征：

{
    "translation": {
        "languages": [
            "fi",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-hu

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-hu')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	615791

特征：

{
    "translation": {
        "languages": [
            "fr",
            "hu"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1943673

特征：

{
    "translation": {
        "languages": [
            "fr",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	620660

特征：

{
    "translation": {
        "languages": [
            "fr",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	626280

特征：

{
    "translation": {
        "languages": [
            "fr",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	2029551

特征：

{
    "translation": {
        "languages": [
            "fr",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621402

特征：

{
    "translation": {
        "languages": [
            "fr",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1980132

特征：

{
    "translation": {
        "languages": [
            "fr",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	387846

特征：

{
    "translation": {
        "languages": [
            "fr",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	631846

特征：

{
    "translation": {
        "languages": [
            "fr",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	606897

特征：

{
    "translation": {
        "languages": [
            "fr",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

fr-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/fr-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1880390

特征：

{
    "translation": {
        "languages": [
            "fr",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-it

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-it')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	589563

特征：

{
    "translation": {
        "languages": [
            "hu",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	610298

特征：

{
    "translation": {
        "languages": [
            "hu",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621101

特征：

{
    "translation": {
        "languages": [
            "hu",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	605806

特征：

{
    "translation": {
        "languages": [
            "hu",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621820

特征：

{
    "translation": {
        "languages": [
            "hu",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	599639

特征：

{
    "translation": {
        "languages": [
            "hu",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	377239

特征：

{
    "translation": {
        "languages": [
            "hu",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	618247

特征：

{
    "translation": {
        "languages": [
            "hu",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	601671

特征：

{
    "translation": {
        "languages": [
            "hu",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

hu-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/hu-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	631872

特征：

{
    "translation": {
        "languages": [
            "hu",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-lt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	593003

特征：

{
    "translation": {
        "languages": [
            "it",
            "lt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	599394

特征：

{
    "translation": {
        "languages": [
            "it",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1919855

特征：

{
    "translation": {
        "languages": [
            "it",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	594472

特征：

{
    "translation": {
        "languages": [
            "it",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1877432

特征：

{
    "translation": {
        "languages": [
            "it",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	367904

特征：

{
    "translation": {
        "languages": [
            "it",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	603467

特征：

{
    "translation": {
        "languages": [
            "it",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	579968

特征：

{
    "translation": {
        "languages": [
            "it",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

it-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/it-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1766096

特征：

{
    "translation": {
        "languages": [
            "it",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-lv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-lv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	621857

特征：

{
    "translation": {
        "languages": [
            "lt",
            "lv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	613308

特征：

{
    "translation": {
        "languages": [
            "lt",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	617296

特征：

{
    "translation": {
        "languages": [
            "lt",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	603223

特征：

{
    "translation": {
        "languages": [
            "lt",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	384679

特征：

{
    "translation": {
        "languages": [
            "lt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	622997

特征：

{
    "translation": {
        "languages": [
            "lt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	602442

特征：

{
    "translation": {
        "languages": [
            "lt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lt-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lt-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	628817

特征：

{
    "translation": {
        "languages": [
            "lt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-nl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-nl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	618352

特征：

{
    "translation": {
        "languages": [
            "lv",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	638453

特征：

{
    "translation": {
        "languages": [
            "lv",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	615580

特征：

{
    "translation": {
        "languages": [
            "lv",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	390857

特征：

{
    "translation": {
        "languages": [
            "lv",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	629803

特征：

{
    "translation": {
        "languages": [
            "lv",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	607381

特征：

{
    "translation": {
        "languages": [
            "lv",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

lv-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/lv-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	643600

特征：

{
    "translation": {
        "languages": [
            "lv",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-pl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	612797

特征：

{
    "translation": {
        "languages": [
            "nl",
            "pl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1957189

特征：

{
    "translation": {
        "languages": [
            "nl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	380736

特征：

{
    "translation": {
        "languages": [
            "nl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	622650

特征：

{
    "translation": {
        "languages": [
            "nl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	600023

特征：

{
    "translation": {
        "languages": [
            "nl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

nl-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/nl-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1870685

特征：

{
    "translation": {
        "languages": [
            "nl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-pt

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pl-pt')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	608181

特征：

{
    "translation": {
        "languages": [
            "pl",
            "pt"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pl-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	389341

特征：

{
    "translation": {
        "languages": [
            "pl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pl-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	624330

特征：

{
    "translation": {
        "languages": [
            "pl",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pl-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	600511

特征：

{
    "translation": {
        "languages": [
            "pl",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pl-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pl-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	657951

特征：

{
    "translation": {
        "languages": [
            "pl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-ro

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pt-ro')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	381404

特征：

{
    "translation": {
        "languages": [
            "pt",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pt-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	611895

特征：

{
    "translation": {
        "languages": [
            "pt",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pt-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	593455

特征：

{
    "translation": {
        "languages": [
            "pt",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

pt-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/pt-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	1823402

特征：

{
    "translation": {
        "languages": [
            "pt",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sk

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/ro-sk')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	387839

特征：

{
    "translation": {
        "languages": [
            "ro",
            "sk"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/ro-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	374859

特征：

{
    "translation": {
        "languages": [
            "ro",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

ro-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/ro-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	390133

特征：

{
    "translation": {
        "languages": [
            "ro",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sl

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/sk-sl')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	609698

特征：

{
    "translation": {
        "languages": [
            "sk",
            "sl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sk-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/sk-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	636353

特征：

{
    "translation": {
        "languages": [
            "sk",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}

sl-sv

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:europarl_bilingual/sl-sv')

说明：

A parallel corpus extracted from the European Parliament web site by Philipp Koehn (University of Edinburgh). The main intended use is to aid statistical machine translation research.

许可：数据集与原始来源具有相同的许可。请检查 http://opus.nlpl.eu/Europarl-v8.php 上提供的来源信息
版本：8.0.0
拆分：

拆分	样本
`'train'`	608740

特征：

{
    "translation": {
        "languages": [
            "sl",
            "sv"
        ],
        "id": null,
        "_type": "Translation"
    }
}