参考文献:
るーぷる
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:poleval2019_mt/ru-pl')
- 説明:
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- ライセンス: 不明なライセンス
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 2969 |
'train' | 20001 |
'validation' | 3001 |
- 特徴:
{
"translation": {
"languages": [
"ru",
"pl"
],
"id": null,
"_type": "Translation"
}
}
英語
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:poleval2019_mt/en-pl')
- 説明:
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- ライセンス: 不明なライセンス
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 9845 |
'train' | 129255 |
'validation' | 10001 |
- 特徴:
{
"translation": {
"languages": [
"en",
"pl"
],
"id": null,
"_type": "Translation"
}
}
プルル
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:poleval2019_mt/pl-ru')
- 説明:
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- ライセンス: 不明なライセンス
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 2967 |
'train' | 20001 |
'validation' | 3001 |
- 特徴:
{
"translation": {
"languages": [
"pl",
"ru"
],
"id": null,
"_type": "Translation"
}
}
PL-en
次のコマンドを使用して、このデータセットを TFDS にロードします。
ds = tfds.load('huggingface:poleval2019_mt/pl-en')
- 説明:
PolEval is a SemEval-inspired evaluation campaign for natural language processing tools for Polish.Submitted solutions compete against one another within certain tasks selected by organizers, using available data and are evaluated according topre-established procedures. One of the tasks in PolEval-2019 was Machine Translation (Task-4).
The task is to train as good as possible machine translation system, using any technology,with limited textual resources.The competition will be done for 2 language pairs, more popular English-Polish (into Polish direction) and pair that can be called low resourcedRussian-Polish (in both directions).
Here, Polish-English is also made available to allow for training in both directions. However, the test data is ONLY available for English-Polish.
- ライセンス: 不明なライセンス
- バージョン: 1.0.0
- 分割:
スプリット | 例 |
---|---|
'test' | 1 |
'train' | 129255 |
'validation' | 10001 |
- 特徴:
{
"translation": {
"languages": [
"pl",
"en"
],
"id": null,
"_type": "Translation"
}
}