TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

biosses

参考：

使用以下命令在 TFDS 中加载此数据集：

ds = tfds.load('huggingface:biosses')

说明：

BIOSSES is a benchmark dataset for biomedical sentence similarity estimation. The dataset comprises 100 sentence pairs, in which each sentence was selected from the TAC (Text Analysis Conference) Biomedical Summarization Track Training Dataset containing articles from the biomedical domain. The sentence pairs were evaluated by five different human experts that judged their similarity and gave scores ranging from 0 (no relation) to 4 (equivalent).

许可：BIOSSES 根据 The GNU Common Public License v.3.0 的条款提供。
版本：0.0.0
拆分：

拆分	样本
`'train'`	100

特征：

{
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "score": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    }
}

biosses 使用集合让一切井井有条 根据您的偏好保存内容并对其进行分类。

biosses