tf.keras.preprocessing.text.Tokenizer

DEPRECATED.

tf.keras.preprocessing.text.Tokenizer(
    num_words=None,
    filters='!"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n',
    lower=True,
    split=' ',
    char_level=False,
    oov_token=None,
    analyzer=None,
    **kwargs
)

Used in the notebooks

Used in the tutorials
Wiki Talk Comments Toxicity Prediction

Methods

`fit_on_sequences`

View source

fit_on_sequences(
    sequences
)

`fit_on_texts`

View source

fit_on_texts(
    texts
)

`get_config`

View source

get_config()

`sequences_to_matrix`

View source

sequences_to_matrix(
    sequences, mode='binary'
)

`sequences_to_texts`

View source

sequences_to_texts(
    sequences
)

`sequences_to_texts_generator`

View source

sequences_to_texts_generator(
    sequences
)

`texts_to_matrix`

View source

texts_to_matrix(
    texts, mode='binary'
)

`texts_to_sequences`

View source

texts_to_sequences(
    texts
)

`texts_to_sequences_generator`

View source

texts_to_sequences_generator(
    texts
)

`to_json`

View source

to_json(
    **kwargs
)

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates. Some content is licensed under the numpy license.

Last updated 2024-06-07 UTC.