gomma_multi_rc

Descrizione :

Eraser Multi RC è un set di dati per query su passaggi su più righe, insieme a risposte e razionali. Ogni esempio in questo set di dati ha le seguenti 5 parti

Un passaggio multilinea 2. Una domanda sul passaggio 3. Una risposta alla domanda
Una classificazione se la risposta è giusta o sbagliata 5. Una spiegazione che giustifica la classificazione

Documentazione aggiuntiva : Esplora documenti con codice
Pagina iniziale : https://cogcomp.seas.upenn.edu/multirc/
Codice sorgente : tfds.text.EraserMultiRc
Versioni :
- 0.1.1 (impostazione predefinita): nessuna nota di rilascio.
Dimensione del download : 1.59 MiB
Dimensione del set di dati: 62.59 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'test'`	4.848
`'train'`	24.029
`'validation'`	3.214

Struttura delle caratteristiche :

FeaturesDict({
    'evidences': Sequence(Text(shape=(), dtype=string)),
    'label': ClassLabel(shape=(), dtype=int64, num_classes=2),
    'passage': Text(shape=(), dtype=string),
    'query_and_answer': Text(shape=(), dtype=string),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
prove	Sequenza(Testo)	(Nessuno,)	corda
etichetta	ClassLabel		int64
passaggio	Testo		corda
domanda_e_risposta	Testo		corda

Chiavi supervisionate (Vedi as_supervised doc ): None
Figura ( tfds.show_examples ): non supportato.
Esempi ( tfds.as_dataframe ):

Citazione :

@unpublished{eraser2019,
    title = {ERASER: A Benchmark to Evaluate Rationalized NLP Models},
    author = {Jay DeYoung and Sarthak Jain and Nazneen Fatema Rajani and Eric Lehman and Caiming Xiong and Richard Socher and Byron C. Wallace}
}
@inproceedings{MultiRC2018,
    author = {Daniel Khashabi and Snigdha Chaturvedi and Michael Roth and Shyam Upadhyay and Dan Roth},
    title = {Looking Beyond the Surface:A Challenge Set for Reading Comprehension over Multiple Sentences},
    booktitle = {NAACL},
    year = {2018}
}