Ссылки:
iwslt2017-en-it
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-it')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1566 |
'train' | 231619 |
'validation' | 929 |
- Функции :
{
"translation": {
"languages": [
"en",
"it"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-en-nl
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-nl')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1777 г. |
'train' | 237240 |
'validation' | 1003 |
- Функции :
{
"translation": {
"languages": [
"en",
"nl"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ан-ро
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ro')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1678 г. |
'train' | 220538 |
'validation' | 914 |
- Функции :
{
"translation": {
"languages": [
"en",
"ro"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-it-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1566 |
'train' | 231619 |
'validation' | 929 |
- Функции :
{
"translation": {
"languages": [
"it",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-it-nl
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-nl')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1669 г. |
'train' | 233415 |
'validation' | 1001 |
- Функции :
{
"translation": {
"languages": [
"it",
"nl"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-это-ро
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-it-ro')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1643 г. |
'train' | 217551 |
'validation' | 914 |
- Функции :
{
"translation": {
"languages": [
"it",
"ro"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-nl-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1777 г. |
'train' | 237240 |
'validation' | 1003 |
- Функции :
{
"translation": {
"languages": [
"nl",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-nl-it
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-it')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1669 г. |
'train' | 233415 |
'validation' | 1001 |
- Функции :
{
"translation": {
"languages": [
"nl",
"it"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-nl-ro
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-nl-ro')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1680 г. |
'train' | 206920 |
'validation' | 913 |
- Функции :
{
"translation": {
"languages": [
"nl",
"ro"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ro-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1678 г. |
'train' | 220538 |
'validation' | 914 |
- Функции :
{
"translation": {
"languages": [
"ro",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ро-это
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-it')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1643 г. |
'train' | 217551 |
'validation' | 914 |
- Функции :
{
"translation": {
"languages": [
"ro",
"it"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ro-nl
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ro-nl')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 1680 г. |
'train' | 206920 |
'validation' | 913 |
- Функции :
{
"translation": {
"languages": [
"ro",
"nl"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ar-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ar-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8583 |
'train' | 231713 |
'validation' | 888 |
- Функции :
{
"translation": {
"languages": [
"ar",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-де-ан
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-de-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8079 |
'train' | 206112 |
'validation' | 888 |
- Функции :
{
"translation": {
"languages": [
"de",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ru
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ar')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8583 |
'train' | 231713 |
'validation' | 888 |
- Функции :
{
"translation": {
"languages": [
"en",
"ar"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ан-де
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-de')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8079 |
'train' | 206112 |
'validation' | 888 |
- Функции :
{
"translation": {
"languages": [
"en",
"de"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ан-фр
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-fr')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8597 |
'train' | 232825 |
'validation' | 890 |
- Функции :
{
"translation": {
"languages": [
"en",
"fr"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ru-ja
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ja')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8469 |
'train' | 223108 |
'validation' | 871 |
- Функции :
{
"translation": {
"languages": [
"en",
"ja"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ан-ко
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-ko')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8514 |
'train' | 230240 |
'validation' | 879 |
- Функции :
{
"translation": {
"languages": [
"en",
"ko"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ru-zh
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-en-zh')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8549 |
'train' | 231266 |
'validation' | 879 |
- Функции :
{
"translation": {
"languages": [
"en",
"zh"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-fr-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-fr-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8597 |
'train' | 232825 |
'validation' | 890 |
- Функции :
{
"translation": {
"languages": [
"fr",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ja-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ja-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8469 |
'train' | 223108 |
'validation' | 871 |
- Функции :
{
"translation": {
"languages": [
"ja",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ko-en
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-ko-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8514 |
'train' | 230240 |
'validation' | 879 |
- Функции :
{
"translation": {
"languages": [
"ko",
"en"
],
"id": null,
"_type": "Translation"
}
}
iwslt2017-ж-ен
Используйте следующую команду, чтобы загрузить этот набор данных в TFDS:
ds = tfds.load('huggingface:iwslt2017/iwslt2017-zh-en')
- Описание :
The IWSLT 2017 Evaluation Campaign includes a multilingual TED Talks MT task. The languages involved are five:
German, English, Italian, Dutch, Romanian.
For each language pair, training and development sets are available through the entry of the table below: by clicking, an archive will be downloaded which contains the sets and a README file. Numbers in the table refer to millions of units (untokenized words) of the target side of all parallel training sets.
- Лицензия : Нет известной лицензии.
- Версия : 1.0.0
- Расколы :
Расколоть | Примеры |
---|---|
'test' | 8549 |
'train' | 231266 |
'validation' | 879 |
- Функции :
{
"translation": {
"languages": [
"zh",
"en"
],
"id": null,
"_type": "Translation"
}
}