sharc_modified

Tài liệu tham khảo:

mod

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:sharc_modified/mod')
  • Sự miêu tả :
ShARC, a conversational QA task, requires a system to answer user questions based on rules expressed in natural language text. However, it is found that in the ShARC dataset there are multiple spurious patterns that could be exploited by neural models. SharcModified is a new dataset which reduces the patterns identified in the original dataset. To reduce the sensitivity of neural models, for each occurence of an instance conforming to any of the patterns, we automatically construct alternatives where we choose to either replace the current instance with an alternative instance which does not exhibit the pattern; or retain the original instance. The modified ShARC has two versions sharc-mod and history-shuffled. For morre details refer to Appendix A.3 .
  • Giấy phép : Không có giấy phép được biết đến
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 21890
'validation' 2270
  • Đặc trưng :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "utterance_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "source_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "snippet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "scenario": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "history": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "evidence": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "answer": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

mod_dev_multi

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:sharc_modified/mod_dev_multi')
  • Sự miêu tả :
ShARC, a conversational QA task, requires a system to answer user questions based on rules expressed in natural language text. However, it is found that in the ShARC dataset there are multiple spurious patterns that could be exploited by neural models. SharcModified is a new dataset which reduces the patterns identified in the original dataset. To reduce the sensitivity of neural models, for each occurence of an instance conforming to any of the patterns, we automatically construct alternatives where we choose to either replace the current instance with an alternative instance which does not exhibit the pattern; or retain the original instance. The modified ShARC has two versions sharc-mod and history-shuffled. For morre details refer to Appendix A.3 .
  • Giấy phép : Không có giấy phép được biết đến
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'validation' 2270
  • Đặc trưng :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "utterance_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "source_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "snippet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "scenario": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "history": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "evidence": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "answer": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "all_answers": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}

lịch sử

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:sharc_modified/history')
  • Sự miêu tả :
ShARC, a conversational QA task, requires a system to answer user questions based on rules expressed in natural language text. However, it is found that in the ShARC dataset there are multiple spurious patterns that could be exploited by neural models. SharcModified is a new dataset which reduces the patterns identified in the original dataset. To reduce the sensitivity of neural models, for each occurence of an instance conforming to any of the patterns, we automatically construct alternatives where we choose to either replace the current instance with an alternative instance which does not exhibit the pattern; or retain the original instance. The modified ShARC has two versions sharc-mod and history-shuffled. For morre details refer to Appendix A.3 .
  • Giấy phép : Không có giấy phép được biết đến
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'train' 21890
'validation' 2270
  • Đặc trưng :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "utterance_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "source_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "snippet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "scenario": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "history": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "evidence": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "answer": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

history_dev_multi

Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:

ds = tfds.load('huggingface:sharc_modified/history_dev_multi')
  • Sự miêu tả :
ShARC, a conversational QA task, requires a system to answer user questions based on rules expressed in natural language text. However, it is found that in the ShARC dataset there are multiple spurious patterns that could be exploited by neural models. SharcModified is a new dataset which reduces the patterns identified in the original dataset. To reduce the sensitivity of neural models, for each occurence of an instance conforming to any of the patterns, we automatically construct alternatives where we choose to either replace the current instance with an alternative instance which does not exhibit the pattern; or retain the original instance. The modified ShARC has two versions sharc-mod and history-shuffled. For morre details refer to Appendix A.3 .
  • Giấy phép : Không có giấy phép được biết đến
  • Phiên bản : 1.0.0
  • Chia tách :
Tách ra Ví dụ
'validation' 2270
  • Đặc trưng :
{
    "id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "utterance_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "source_url": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "snippet": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "question": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "scenario": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "history": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "evidence": [
        {
            "follow_up_question": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            },
            "follow_up_answer": {
                "dtype": "string",
                "id": null,
                "_type": "Value"
            }
        }
    ],
    "answer": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "all_answers": {
        "feature": {
            "dtype": "string",
            "id": null,
            "_type": "Value"
        },
        "length": -1,
        "id": null,
        "_type": "Sequence"
    }
}