ascent_kb

参考:

canonical

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:ascent_kb/canonical')
  • 说明
This dataset contains 8.9M commonsense assertions extracted by the Ascent pipeline (https://ascent.mpi-inf.mpg.de/).
拆分 样本
'train' 8904060
  • 特征
{
    "arg1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "rel": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "arg2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "support": {
        "dtype": "int64",
        "id": null,
        "_type": "Value"
    },
    "facets": [
        {
            "value": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "type": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "support": {
                "dtype": "int64",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "source_sentences": [
        {
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "source": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ]
}

open

使用以下命令在 TFDS 中加载此数据集:

ds = tfds.load('huggingface:ascent_kb/open')
  • 说明
This dataset contains 8.9M commonsense assertions extracted by the Ascent pipeline (https://ascent.mpi-inf.mpg.de/).
拆分 样本
'train' 8904060
  • 特征
{
    "subject": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "predicate": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "object": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "support": {
        "dtype": "int64",
        "id": null,
        "_type": "Value"
    },
    "facets": [
        {
            "value": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "type": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "support": {
                "dtype": "int64",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "source_sentences": [
        {
            "text": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "source": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ]
}