参考:
ehealth_kd
使用以下命令在 TFDS 中加载此数据集:
ds = tfds.load('huggingface:ehealth_kd/ehealth_kd')
- 说明:
Dataset of the eHealth Knowledge Discovery Challenge at IberLEF 2020. It is designed for
the identification of semantic entities and relations in Spanish health documents.
- 许可:https://creativecommons.org/licenses/by-nc-sa/4.0/
- 版本:1.0.0
- 拆分:
拆分 | 样本 |
---|---|
'test' |
100 |
'train' |
800 |
'validation' |
199 |
- 特征:
{
"sentence": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"entities": [
{
"ent_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"ent_text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"ent_label": {
"num_classes": 4,
"names": [
"Concept",
"Action",
"Predicate",
"Reference"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"start_character": {
"dtype": "int32",
"id": null,
"_type": "Value"
},
"end_character": {
"dtype": "int32",
"id": null,
"_type": "Value"
}
}
],
"relations": [
{
"rel_id": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rel_label": {
"num_classes": 13,
"names": [
"is-a",
"same-as",
"has-property",
"part-of",
"causes",
"entails",
"in-time",
"in-place",
"in-context",
"subject",
"target",
"domain",
"arg"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
},
"arg1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"arg2": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
]
}