TFDS ora supporta il formato Croissant 🥐 ! Leggi la documentazione per saperne di più.

Questa pagina è stata tradotta dall'API Cloud Translation.

iwslt2017

Riferimenti:

iwslt2017-en-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-it')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1566
`'train'`	231619
`'validation'`	929

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-nl

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-nl')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1777
`'train'`	237240
`'validation'`	1003

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-ro

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ro')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1678
`'train'`	220538
`'validation'`	914

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-it-en

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1566
`'train'`	231619
`'validation'`	929

Caratteristiche :

{
    "translation": {
        "languages": [
            "it",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-it-nl

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-nl')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1669
`'train'`	233415
`'validation'`	1001

Caratteristiche :

{
    "translation": {
        "languages": [
            "it",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-it-ro

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-ro')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1643
`'train'`	217551
`'validation'`	914

Caratteristiche :

{
    "translation": {
        "languages": [
            "it",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-nl-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1777
`'train'`	237240
`'validation'`	1003

Caratteristiche :

{
    "translation": {
        "languages": [
            "nl",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-nl-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-it')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1669
`'train'`	233415
`'validation'`	1001

Caratteristiche :

{
    "translation": {
        "languages": [
            "nl",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-nl-ro

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-ro')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1680
`'train'`	206920
`'validation'`	913

Caratteristiche :

{
    "translation": {
        "languages": [
            "nl",
            "ro"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ro-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1678
`'train'`	220538
`'validation'`	914

Caratteristiche :

{
    "translation": {
        "languages": [
            "ro",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ro-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-it')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1643
`'train'`	217551
`'validation'`	914

Caratteristiche :

{
    "translation": {
        "languages": [
            "ro",
            "it"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ro-nl

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-nl')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	1680
`'train'`	206920
`'validation'`	913

Caratteristiche :

{
    "translation": {
        "languages": [
            "ro",
            "nl"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ar-en

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ar-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8583
`'train'`	231713
`'validation'`	888

Caratteristiche :

{
    "translation": {
        "languages": [
            "ar",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-de-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-de-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8079
`'train'`	206112
`'validation'`	888

Caratteristiche :

{
    "translation": {
        "languages": [
            "de",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-it-ar

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ar')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8583
`'train'`	231713
`'validation'`	888

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "ar"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-de

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-de')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8079
`'train'`	206112
`'validation'`	888

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "de"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-fr

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-fr')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8597
`'train'`	232825
`'validation'`	890

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "fr"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-ja

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ja')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8469
`'train'`	223108
`'validation'`	871

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "ja"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-en-ko

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ko')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8514
`'train'`	230240
`'validation'`	879

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "ko"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-it-zh

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-zh')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8549
`'train'`	231266
`'validation'`	879

Caratteristiche :

{
    "translation": {
        "languages": [
            "en",
            "zh"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-fr-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-fr-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8597
`'train'`	232825
`'validation'`	890

Caratteristiche :

{
    "translation": {
        "languages": [
            "fr",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ja-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ja-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8469
`'train'`	223108
`'validation'`	871

Caratteristiche :

{
    "translation": {
        "languages": [
            "ja",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-ko-en

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-ko-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8514
`'train'`	230240
`'validation'`	879

Caratteristiche :

{
    "translation": {
        "languages": [
            "ko",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}

iwslt2017-zh-it

Utilizzare il comando seguente per caricare questo set di dati in TFDS:

ds = tfds.load('huggingface:iwslt2017/iwslt2017-zh-en')

Descrizione :

The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:

  German, English, Italian, Dutch, Romanian.

For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.

Licenza : nessuna licenza conosciuta
Versione : 1.0.0
Divide :

Diviso	Esempi
`'test'`	8549
`'train'`	231266
`'validation'`	879

Caratteristiche :

{
    "translation": {
        "languages": [
            "zh",
            "en"
        ],
        "id": null,
        "_type": "Translation"
    }
}