อ้างอิง:
สร้าง_บทวิจารณ์_th
ใช้คำสั่งต่อไปนี้เพื่อโหลดชุดข้อมูลนี้ใน TFDS:
ds = tfds.load('huggingface:generated_reviews_enth/generated_reviews_enth')
- คำอธิบาย :
`generated_reviews_enth`
Generated product reviews dataset for machine translation quality prediction, part of [scb-mt-en-th-2020](https://arxiv.org/pdf/2007.03541.pdf)
`generated_reviews_enth` is created as part of [scb-mt-en-th-2020](https://arxiv.org/pdf/2007.03541.pdf) for machine translation task.
This dataset (referred to as `generated_reviews_yn` in [scb-mt-en-th-2020](https://arxiv.org/pdf/2007.03541.pdf)) are English product reviews
generated by [CTRL](https://arxiv.org/abs/1909.05858), translated by Google Translate API and annotated as accepted or rejected (`correct`)
based on fluency and adequacy of the translation by human annotators.
This allows it to be used for English-to-Thai translation quality esitmation (binary label), machine translation, and sentiment analysis.
- ใบอนุญาต : ไม่มีใบอนุญาตที่รู้จัก
- เวอร์ชัน : 1.0.0
- แยก :
แยก | ตัวอย่าง |
---|---|
'test' | 17453 |
'train' | 141369 |
'validation' | 15708 |
- คุณสมบัติ :
{
"translation": {
"languages": [
"en",
"th"
],
"id": null,
"_type": "Translation"
},
"review_star": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"correct": {
"num_classes": 2,
"names": [
"neg",
"pos"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}