Tài liệu tham khảo:
Sử dụng lệnh sau để tải tập dữ liệu này trong TFDS:
ds = tfds.load('huggingface:kan_hope')
- Sự miêu tả :
Numerous methods have been developed to monitor the spread of negativity in modern years by
eliminating vulgar, offensive, and fierce comments from social media platforms. However, there are relatively
lesser amounts of study that converges on embracing positivity, reinforcing supportive and reassuring content in online forums.
Consequently, we propose creating an English Kannada Hope speech dataset, KanHope and comparing several experiments to benchmark the dataset.
The dataset consists of 6,176 user generated comments in code mixed Kannada scraped from YouTube and manually annotated as bearing hope
speech or Not-hope speech.
This dataset was prepared for hope-speech text classification benchmark on code-mixed Kannada, an under-resourced language.
- Giấy phép : Giấy phép quốc tế Creative Commons Ghi công 4.0
- Phiên bản : 0.0.0
- Chia tách :
Tách ra | Ví dụ |
---|---|
'test' | 618 |
'train' | 4940 |
- Đặc trưng :
{
"text": {
"dtype": "string",
"id": null,
"_type": "Value"
},
"label": {
"num_classes": 2,
"names": [
"Not-Hope",
"Hope"
],
"names_file": null,
"id": null,
"_type": "ClassLabel"
}
}