Warning: This API is deprecated and will be removed in a future version of TensorFlow after the replacement is stable.

UnicodeEncode

public final class UnicodeEncode

Encode a tensor of ints into unicode strings.

Returns a vector of strings, where `output[i]` is constructed by encoding the Unicode codepoints in `input_values[input_splits[i]:input_splits[i+1]]` using `output_encoding`.

---

Example:

input_values = [72, 101, 108, 108, 111, 87, 111, 114, 108, 100]
 input_splits = [0, 5, 10]
 output_encoding = 'UTF-8'
 
 output = ['Hello', 'World']

Nested Classes

class UnicodeEncode.Options Optional attributes for UnicodeEncode

Public Methods

Output<String>	asOutput() Returns the symbolic handle of a tensor.
static <T extends Number> UnicodeEncode	create(Scope scope, Operand<Integer> inputValues, Operand<T> inputSplits, String outputEncoding, Options... options) Factory method to create a class wrapping a new UnicodeEncode operation.
static UnicodeEncode.Options	errors(String errors)
Output<String>	output() The 1-D Tensor of strings encoded from the provided unicode codepoints.
static UnicodeEncode.Options	replacementChar(Long replacementChar)

Inherited Methods

From class org.tensorflow.op.PrimitiveOp

final boolean	equals(Object obj)
final int	hashCode()
Operation	op() Returns the underlying `Operation`
final String	toString()

From class java.lang.Object

boolean	equals(Object arg0)
final Class<?>	getClass()
int	hashCode()
final void	notify()
final void	notifyAll()
String	toString()
final void	wait(long arg0, int arg1)
final void	wait(long arg0)
final void	wait()

From interface org.tensorflow.Operand

abstract Output<String>

asOutput()

Returns the symbolic handle of a tensor.

Public Methods

public Output<String> asOutput ()

Returns the symbolic handle of a tensor.

Inputs to TensorFlow operations are outputs of another TensorFlow operation. This method is used to obtain a symbolic handle that represents the computation of the input.

public static UnicodeEncode create (Scope scope, Operand<Integer> inputValues, Operand<T> inputSplits, String outputEncoding, Options... options)

Factory method to create a class wrapping a new UnicodeEncode operation.

Parameters

scope	current scope
inputValues	A 1D tensor containing the unicode codepoints that should be encoded.
inputSplits	A 1D tensor specifying how the unicode codepoints should be split into strings. In particular, `output[i]` is constructed by encoding the codepoints in the slice `input_values[input_splits[i]:input_splits[i+1]]`.
outputEncoding	Unicode encoding of the output strings. Valid encodings are: `"UTF-8", "UTF-16-BE", and "UTF-32-BE"`.
options	carries optional attributes values

Returns

a new instance of UnicodeEncode

public static UnicodeEncode.Options errors (String errors)

Parameters

errors	Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the `replacement_char` codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character.

errors

Error handling policy when there is invalid formatting found in the input. The value of 'strict' will cause the operation to produce a InvalidArgument error on any invalid input formatting. A value of 'replace' (the default) will cause the operation to replace any invalid formatting in the input with the `replacement_char` codepoint. A value of 'ignore' will cause the operation to skip any invalid formatting in the input and produce no corresponding output character.

public Output<String> output ()

The 1-D Tensor of strings encoded from the provided unicode codepoints.

public static UnicodeEncode.Options replacementChar (Long replacementChar)

Parameters

replacementChar	The replacement character codepoint to be used in place of any invalid formatting in the input when `errors='replace'`. Any valid unicode codepoint may be used. The default value is the default unicode replacement character is 0xFFFD (U+65533).