adv_glue

参照:

adv_sst2

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_sst2')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 148
  • 特徴
{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

adv_qqp

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_qqp')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 78
  • 特徴
{
    "question1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

adv_mnli

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_mnli')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 121
  • 特徴
{
    "premise": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "hypothesis": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

adv_mnli_mismatched

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_mnli_mismatched')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 162
  • 特徴
{
    "premise": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "hypothesis": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

adv_qnli

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_qnli')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 148
  • 特徴
{
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}

adv_rte

次のコマンドを使用して、このデータセットをTFDSにロードします。

ds = tfds.load('huggingface:adv_glue/adv_rte')
  • 説明
Adversarial GLUE Benchmark (AdvGLUE) is a comprehensive robustness evaluation benchmark
that focuses on the adversarial robustness evaluation of language models. It covers five
natural language understanding tasks from the famous GLUE tasks and is an adversarial
version of GLUE benchmark.
  • ライセンス:既知のライセンスはありません
  • バージョン:1.0.0
  • 分割
スプリット
'validation' 81
  • 特徴
{
    "sentence1": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentence2": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "idx": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    }
}