参考:
canonical
使用以下命令在 TFDS 中加载此数据集:
ds = tfds.load('huggingface:ascent_kb/canonical')
- 说明:
This dataset contains 8.9M commonsense assertions extracted by the Ascent pipeline (https://ascent.mpi-inf.mpg.de/).
- 许可:The Creative Commons Attribution 4.0 International License. https://creativecommons.org/licenses/by/4.0/
- 版本:1.0.0
- 拆分:
拆分 | 样本 |
---|---|
'train' |
8904060 |
- 特征:
{
"arg1": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"rel": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"arg2": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"support": {
"dtype": "int64",
"id": null,
"_type": "Value"
},
"facets": [
{
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"support": {
"dtype": "int64",
"id": null,
"_type": "Value"
}
}
],
"source_sentences": [
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"source": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
]
}
open
使用以下命令在 TFDS 中加载此数据集:
ds = tfds.load('huggingface:ascent_kb/open')
- 说明:
This dataset contains 8.9M commonsense assertions extracted by the Ascent pipeline (https://ascent.mpi-inf.mpg.de/).
- 许可:The Creative Commons Attribution 4.0 International License. https://creativecommons.org/licenses/by/4.0/
- 版本:1.0.0
- 拆分:
拆分 | 样本 |
---|---|
'train' |
8904060 |
- 特征:
{
"subject": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"predicate": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"object": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"support": {
"dtype": "int64",
"id": null,
"_type": "Value"
},
"facets": [
{
"value": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"type": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"support": {
"dtype": "int64",
"id": null,
"_type": "Value"
}
}
],
"source_sentences": [
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"source": {
"dtype": "string",
"id": null,
"_type": "Value"
}
}
]
}