Conozca lo último en aprendizaje automático, IA generativa y más en el Simposio WiML 2023.

Se usó la API de Cloud Translation para traducir esta página.

lm1b

Referencias:

Texto sin formato

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:lm1b/plain_text')

A benchmark corpus to be used for measuring progress in statistical language modeling. This has almost one billion words in the training data.

Separar	Ejemplos
`'test'`	306688
`'train'`	30301028

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}

Referencias:

Utilice el siguiente comando para cargar este conjunto de datos en TFDS:

ds = tfds.load('huggingface:lm1b/plain_text')

A benchmark corpus to be used for measuring progress in statistical language modeling. This has almost one billion words in the training data.

Separar	Ejemplos
`'test'`	306688
`'train'`	30301028

{
    "text": {
        "dtype": "string",
        "id": null,
        "_type": "Value"
    }
}