reddit_tifu

सन्दर्भ:

छोटा

इस डेटासेट को TFDS में लोड करने के लिए निम्नलिखित कमांड का उपयोग करें:

ds = tfds.load('huggingface:reddit_tifu/short')
  • विवरण :
Reddit dataset, where TIFU denotes the name of subbreddit /r/tifu.
As defined in the publication, styel "short" uses title as summary and
"long" uses tldr as summary.

Features includes:
  - document: post text without tldr.
  - tldr: tldr line.
  - title: trimmed title without tldr.
  - ups: upvotes.
  - score: score.
  - num_comments: number of comments.
  - upvote_ratio: upvote ratio.
  • लाइसेंस : कोई ज्ञात लाइसेंस नहीं
  • संस्करण : 1.1.0
  • विभाजन :
विभाजित करना उदाहरण
'train' 79740
  • विशेषताएँ :
{
    "ups": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "num_comments": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "upvote_ratio": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "score": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "documents": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "tldr": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

लंबा

इस डेटासेट को TFDS में लोड करने के लिए निम्नलिखित कमांड का उपयोग करें:

ds = tfds.load('huggingface:reddit_tifu/long')
  • विवरण :
Reddit dataset, where TIFU denotes the name of subbreddit /r/tifu.
As defined in the publication, styel "short" uses title as summary and
"long" uses tldr as summary.

Features includes:
  - document: post text without tldr.
  - tldr: tldr line.
  - title: trimmed title without tldr.
  - ups: upvotes.
  - score: score.
  - num_comments: number of comments.
  - upvote_ratio: upvote ratio.
  • लाइसेंस : कोई ज्ञात लाइसेंस नहीं
  • संस्करण : 1.1.0
  • विभाजन :
विभाजित करना उदाहरण
'train' 42139
  • विशेषताएँ :
{
    "ups": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "num_comments": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "upvote_ratio": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "score": {
        "dtype": "float32",
        "id": null,
        "_type": "Value"
    },
    "documents": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "tldr": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    },
    "title": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}