amazon_us_reviews

Referencias:

Libros_v1_01

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_01')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 6106719
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Relojes_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Watches_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 960872
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Electrodomésticos_de_cuidado_personal_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Personal_Care_Appliances_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 85981
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Electrónica_móvil_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Electronics_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 104975
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Videojuegos_digitales_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Games_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 145431
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Software_digital_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Software_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 102084
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Electrodomésticos_principales_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Major_Appliances_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 96901
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Tarjeta_regalo_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Gift_Card_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 149086
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Vídeo_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Video_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 380604
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Equipaje_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Luggage_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 348657
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Software_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Software_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 341931
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Video_Juegos_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Video_Games_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1785997
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Muebles_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Furniture_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 792113
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Instrumentos_musicales_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Musical_Instruments_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 904765
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Compra_de_música_digital_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Music_Purchase_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1688884
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Libros_v1_02

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_02')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 3105520
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Inicio_Entretenimiento_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Home_Entertainment_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 705889
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Tienda de comestibles_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Grocery_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2402458
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Al aire libre_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Outdoors_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2302401
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Productos_para_mascotas_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Pet_Products_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2643619
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Vídeo_DVD_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Video_DVD_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5069140
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Ropa_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Apparel_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5906333
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

PC_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/PC_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 6908554
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Herramientas_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Tools_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1741100
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Joyería_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Jewelry_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1767753
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Bebé_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Baby_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1752932
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Inicio_Mejora_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Home_Improvement_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2634781
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Cámara_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Camera_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 1801974
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Césped_y_jardín_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Lawn_and_Garden_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2557288
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Productos_de_oficina_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Office_Products_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 2642434
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Electrónica_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Electronics_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 3093869
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Automotriz_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Automotive_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 3514942
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Descarga_vídeo_digital_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Video_Download_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4057147
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Aplicaciones_móviles_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Mobile_Apps_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5033376
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Zapatos_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Shoes_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4366916
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Juguetes_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Toys_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4864249
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Deportes_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Sports_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4850360
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Cocina_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Kitchen_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4880466
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Belleza_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Beauty_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5115666
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Música_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Music_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 4751577
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Salud_Cuidado_Personal_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Health_Personal_Care_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5331449
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Compra_de_libros_digitales_v1_01

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_01')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 5101693
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Inicio_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Home_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 6221559
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Inalámbrico_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Wireless_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 9002021
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Libros_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Books_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 10319090
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Compra_ebook_digital_v1_00

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:amazon_us_reviews/Digital_Ebook_Purchase_v1_00')
  • Descripción :
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.

Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).

Each Dataset contains the following columns:

- marketplace: 2 letter country code of the marketplace where the review was written.
- customer_id: Random identifier that can be used to aggregate reviews written by a single author.
- review_id: The unique ID of the review.
- product_id: The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id.
- product_parent: Random identifier that can be used to aggregate reviews for the same product.
- product_title: Title of the product.
- product_category: Broad product category that can be used to group reviews (also used to group the dataset into coherent parts).
- star_rating: The 1-5 star rating of the review.
- helpful_votes: Number of helpful votes.
- total_votes: Number of total votes the review received.
- vine: Review was written as part of the Vine program.
- verified_purchase: The review is on a verified purchase.
- review_headline: The title of the review.
- review_body: The review text.
- review_date: The date the review was written.
  • Licencia : Ninguna licencia conocida
  • Versión : 0.1.0
  • Divisiones :
Dividir Ejemplos
'train' 12520722
  • Características :
{
    "marketplace": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "customer_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_id": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_parent": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "product_category": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "star_rating": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "helpful_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "total_votes": {
        "dtype": "int32",
        "id": null,
        "_type": "Value"
    },
    "vine": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "verified_purchase": {
        "num_classes": 2,
        "names": [
            "N",
            "Y"
        ],
        "id": null,
        "_type": "ClassLabel"
    },
    "review_headline": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_body": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "review_date": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}