sentiment140
Stay organized with collections
Save and categorize content based on your preferences.
Sentiment140 allows you to discover the sentiment of a brand, product, or topic
on Twitter.
The data is a CSV with emoticons removed. Data file format has 6 fields:
- the polarity of the tweet (0 = negative, 2 = neutral, 4 = positive)
- the id of the tweet (2087)
- the date of the tweet (Sat May 16 23:58:44 UTC 2009)
- the query (lyx). If there is no query, then this value is NO_QUERY.
- the user that tweeted (robotickilldozr)
- the text of the tweet (Lyx is cool)
For more information, refer to the paper Twitter Sentiment Classification with
Distant Supervision at
https://cs.stanford.edu/people/alecmgo/papers/TwitterDistantSupervision09.pdf
Split |
Examples |
'test' |
498 |
'train' |
1,600,000 |
FeaturesDict({
'date': Text(shape=(), dtype=string),
'polarity': int32,
'query': Text(shape=(), dtype=string),
'text': Text(shape=(), dtype=string),
'user': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
date |
Text |
|
string |
|
polarity |
Tensor |
|
int32 |
|
query |
Text |
|
string |
|
text |
Text |
|
string |
|
user |
Text |
|
string |
|
@ONLINE {Sentiment140,
author = "Go, Alec and Bhayani, Richa and Huang, Lei",
title = "Twitter Sentiment Classification using Distant Supervision",
year = "2009",
url = "http://help.sentiment140.com/home"
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-23 UTC.
[null,null,["Last updated 2022-12-23 UTC."],[],[],null,["# sentiment140\n\n\u003cbr /\u003e\n\n- **Description**:\n\nSentiment140 allows you to discover the sentiment of a brand, product, or topic\non Twitter.\n\nThe data is a CSV with emoticons removed. Data file format has 6 fields:\n\n1. the polarity of the tweet (0 = negative, 2 = neutral, 4 = positive)\n2. the id of the tweet (2087)\n3. the date of the tweet (Sat May 16 23:58:44 UTC 2009)\n4. the query (lyx). If there is no query, then this value is NO_QUERY.\n5. the user that tweeted (robotickilldozr)\n6. the text of the tweet (Lyx is cool)\n\nFor more information, refer to the paper Twitter Sentiment Classification with\nDistant Supervision at\n\u003chttps://cs.stanford.edu/people/alecmgo/papers/TwitterDistantSupervision09.pdf\u003e\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/sentiment140)\n\n- **Homepage** :\n \u003chttp://help.sentiment140.com/home\u003e\n\n- **Source code** :\n [`tfds.datasets.sentiment140.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/sentiment140/sentiment140_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): No release notes.\n- **Download size** : `77.59 MiB`\n\n- **Dataset size** : `305.13 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|-----------|\n| `'test'` | 498 |\n| `'train'` | 1,600,000 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'date': Text(shape=(), dtype=string),\n 'polarity': int32,\n 'query': Text(shape=(), dtype=string),\n 'text': Text(shape=(), dtype=string),\n 'user': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| date | Text | | string | |\n| polarity | Tensor | | int32 | |\n| query | Text | | string | |\n| text | Text | | string | |\n| user | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('text', 'polarity')`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @ONLINE {Sentiment140,\n author = \"Go, Alec and Bhayani, Richa and Huang, Lei\",\n title = \"Twitter Sentiment Classification using Distant Supervision\",\n year = \"2009\",\n url = \"http://help.sentiment140.com/home\"\n }"]]