flores
Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Kumpulan data evaluasi untuk terjemahan mesin dengan sumber daya rendah: Bahasa Nepal-Inggris dan Sinhala-Inggris.
@misc{guzmn2019new,
title={Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English},
author={Francisco Guzman and Peng-Jen Chen and Myle Ott and Juan Pino and Guillaume Lample and Philipp Koehn and Vishrav Chaudhary and Marc'Aurelio Ranzato},
year={2019},
eprint={1902.01382},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
flores/neen (konfigurasi default)
Membelah | Contoh |
---|
'test' | 2.835 |
'validation' | 2.559 |
Translation({
'en': Text(shape=(), dtype=string),
'ne': Text(shape=(), dtype=string),
})
Fitur | Kelas | Membentuk | Dtype | Keterangan |
---|
| Terjemahan | | | |
en | Teks | | rangkaian | |
ne | Teks | | rangkaian | |
flores/sien
Membelah | Contoh |
---|
'test' | 2.766 |
'validation' | 2.898 |
Translation({
'en': Text(shape=(), dtype=string),
'si': Text(shape=(), dtype=string),
})
Fitur | Kelas | Membentuk | Dtype | Keterangan |
---|
| Terjemahan | | | |
en | Teks | | rangkaian | |
ya | Teks | | rangkaian | |
Kecuali dinyatakan lain, konten di halaman ini dilisensikan berdasarkan Lisensi Creative Commons Attribution 4.0, sedangkan contoh kode dilisensikan berdasarkan Lisensi Apache 2.0. Untuk mengetahui informasi selengkapnya, lihat Kebijakan Situs Google Developers. Java adalah merek dagang terdaftar dari Oracle dan/atau afiliasinya.
Terakhir diperbarui pada 2022-12-06 UTC.
[null,null,["Terakhir diperbarui pada 2022-12-06 UTC."],[],[],null,["# flores\n\n\u003cbr /\u003e\n\n- **Description**:\n\nEvaluation datasets for low-resource machine translation: Nepali-English and\nSinhala-English.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/flores)\n\n- **Homepage** :\n \u003chttps://github.com/facebookresearch/flores/\u003e\n\n- **Source code** :\n [`tfds.translate.Flores`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/translate/flores.py)\n\n- **Versions**:\n\n - **`1.2.0`** (default): No release notes.\n- **Download size** : `1.47 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Citation**:\n\n @misc{guzmn2019new,\n title={Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English},\n author={Francisco Guzman and Peng-Jen Chen and Myle Ott and Juan Pino and Guillaume Lample and Philipp Koehn and Vishrav Chaudhary and Marc'Aurelio Ranzato},\n year={2019},\n eprint={1902.01382},\n archivePrefix={arXiv},\n primaryClass={cs.CL}\n }\n\nflores/neen (default config)\n----------------------------\n\n- **Config description**: Translation dataset from ne to en.\n\n- **Dataset size** : `1.89 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|----------------|----------|\n| `'test'` | 2,835 |\n| `'validation'` | 2,559 |\n\n- **Feature structure**:\n\n Translation({\n 'en': Text(shape=(), dtype=string),\n 'ne': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------|-------------|-------|--------|-------------|\n| | Translation | | | |\n| en | Text | | string | |\n| ne | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('ne', 'en')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nflores/sien\n-----------\n\n- **Config description**: Translation dataset from si to en.\n\n- **Dataset size** : `2.05 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|----------------|----------|\n| `'test'` | 2,766 |\n| `'validation'` | 2,898 |\n\n- **Feature structure**:\n\n Translation({\n 'en': Text(shape=(), dtype=string),\n 'si': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------|-------------|-------|--------|-------------|\n| | Translation | | | |\n| en | Text | | string | |\n| si | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('si', 'en')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples..."]]