- Description:
The Opinosis Opinion Dataset consists of sentences extracted from reviews for 51 topics. Topics and opinions are obtained from Tripadvisor, Edmunds.com and Amazon.com.
Additional Documentation: Explore on Papers With Code
Homepage: http://kavita-ganesan.com/opinosis/
Source code:
tfds.datasets.opinosis.Builder
Versions:
1.0.0
(default): No release notes.
Download size:
739.65 KiB
Dataset size:
725.45 KiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
51 |
- Feature structure:
FeaturesDict({
'review_sents': Text(shape=(), dtype=string),
'summaries': Sequence(Text(shape=(), dtype=string)),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
review_sents | Text | string | ||
summaries | Sequence(Text) | (None,) | string |
Supervised keys (See
as_supervised
doc):('review_sents', 'summaries')
Figure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@inproceedings{ganesan2010opinosis,
title={Opinosis: a graph-based approach to abstractive summarization of highly redundant opinions},
author={Ganesan, Kavita and Zhai, ChengXiang and Han, Jiawei},
booktitle={Proceedings of the 23rd International Conference on Computational Linguistics},
pages={340--348},
year={2010},
organization={Association for Computational Linguistics}
}