TFDS now supports the Croissant 🥐 format! Read the documentation to know more.

urdu_sentiment_corpus

References:

Use the following command to load this dataset in TFDS:

ds = tfds.load('huggingface:urdu_sentiment_corpus')

Description:

“Urdu Sentiment Corpus” (USC) shares the dat of Urdu tweets for the sentiment analysis and polarity detection.
The dataset is consisting of tweets and overall, the dataset is comprising over 17, 185 tokens
with 52% records as positive, and 48 % records as negative.

License: No known license
Version: 1.0.0
Splits:

Split	Examples
`'train'`	1000

Features:

{
    "sentence": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "sentiment": {
        "num_classes": 3,
        "names": [
            "P",
            "N",
            "O"
        ],
        "names_file": null,
        "id": null,
        "_type": "ClassLabel"
    }
}

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2022-06-28 UTC.