Warning: This project is deprecated. TensorFlow Addons has stopped development,
The project will only be providing minimal maintenance releases until May 2024. See the full
announcement here or on
github.
tfa.losses.SigmoidFocalCrossEntropy
Stay organized with collections
Save and categorize content based on your preferences.
Implements the focal loss function.
tfa.losses.SigmoidFocalCrossEntropy(
from_logits: bool = False,
alpha: tfa.types.FloatTensorLike
= 0.25,
gamma: tfa.types.FloatTensorLike
= 2.0,
reduction: str = tf.keras.losses.Reduction.NONE,
name: str = 'sigmoid_focal_crossentropy'
)
Focal loss was first introduced in the RetinaNet paper
(https://arxiv.org/pdf/1708.02002.pdf). Focal loss is extremely useful for
classification when you have highly imbalanced classes. It down-weights
well-classified examples and focuses on hard examples. The loss value is
much higher for a sample which is misclassified by the classifier as compared
to the loss value corresponding to a well-classified example. One of the
best use-cases of focal loss is its usage in object detection where the
imbalance between the background class and other classes is extremely high.
Usage:
fl = tfa.losses.SigmoidFocalCrossEntropy()
loss = fl(
y_true = [[1.0], [1.0], [0.0]],y_pred = [[0.97], [0.91], [0.03]])
loss
<tf.Tensor: shape=(3,), dtype=float32, numpy=array([6.8532745e-06, 1.9097870e-04, 2.0559824e-05],
dtype=float32)>
Usage with tf.keras
API:
model = tf.keras.Model()
model.compile('sgd', loss=tfa.losses.SigmoidFocalCrossEntropy())
Args |
alpha
|
balancing factor, default value is 0.25.
|
gamma
|
modulating factor, default value is 2.0.
|
Returns |
Weighted loss float Tensor . If reduction is NONE , this has the same
shape as y_true ; otherwise, it is scalar.
|
Raises |
ValueError
|
If the shape of sample_weight is invalid or value of
gamma is less than zero.
|
Methods
from_config
@classmethod
from_config(
config
)
Instantiates a Loss
from its config (output of get_config()
).
Args |
config
|
Output of get_config() .
|
get_config
View source
get_config()
Returns the config dictionary for a Loss
instance.
__call__
__call__(
y_true, y_pred, sample_weight=None
)
Invokes the Loss
instance.
Args |
y_true
|
Ground truth values. shape = [batch_size, d0, .. dN] , except
sparse loss functions such as sparse categorical crossentropy where
shape = [batch_size, d0, .. dN-1]
|
y_pred
|
The predicted values. shape = [batch_size, d0, .. dN]
|
sample_weight
|
Optional sample_weight acts as a coefficient for the
loss. If a scalar is provided, then the loss is simply scaled by the
given value. If sample_weight is a tensor of size [batch_size] ,
then the total loss for each sample of the batch is rescaled by the
corresponding element in the sample_weight vector. If the shape of
sample_weight is [batch_size, d0, .. dN-1] (or can be
broadcasted to this shape), then each loss element of y_pred is
scaled by the corresponding value of sample_weight . (Note
ondN-1 : all loss functions reduce by 1 dimension, usually
axis=-1.)
|
Returns |
Weighted loss float Tensor . If reduction is NONE , this has
shape [batch_size, d0, .. dN-1] ; otherwise, it is scalar. (Note
dN-1 because all loss functions reduce by 1 dimension, usually
axis=-1.)
|
Raises |
ValueError
|
If the shape of sample_weight is invalid.
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2023-07-12 UTC.
[null,null,["Last updated 2023-07-12 UTC."],[],[],null,["# tfa.losses.SigmoidFocalCrossEntropy\n\n\u003cbr /\u003e\n\n**Note:** `tensorflow-addons` is deprecated, use [`keras-cv`](https://keras.io/guides/keras_cv/)'s [`keras_cv.losses.FocalLoss`](https://github.com/keras-team/keras-cv/blob/master/keras_cv/losses/focal.py) instead. \n\n|---------------------------------------------------------------------------------------------------------------------------|\n| [View source on GitHub](https://github.com/tensorflow/addons/blob/v0.20.0/tensorflow_addons/losses/focal_loss.py#L25-L81) |\n\nImplements the focal loss function. \n\n tfa.losses.SigmoidFocalCrossEntropy(\n from_logits: bool = False,\n alpha: ../../tfa/types/FloatTensorLike = 0.25,\n gamma: ../../tfa/types/FloatTensorLike = 2.0,\n reduction: str = tf.keras.losses.Reduction.NONE,\n name: str = 'sigmoid_focal_crossentropy'\n )\n\nFocal loss was first introduced in the RetinaNet paper\n(\u003chttps://arxiv.org/pdf/1708.02002.pdf\u003e). Focal loss is extremely useful for\nclassification when you have highly imbalanced classes. It down-weights\nwell-classified examples and focuses on hard examples. The loss value is\nmuch higher for a sample which is misclassified by the classifier as compared\nto the loss value corresponding to a well-classified example. One of the\nbest use-cases of focal loss is its usage in object detection where the\nimbalance between the background class and other classes is extremely high.\n\n#### Usage:\n\n fl = tfa.losses.SigmoidFocalCrossEntropy()\n loss = fl(\n y_true = [[1.0], [1.0], [0.0]],y_pred = [[0.97], [0.91], [0.03]])\n loss\n \u003ctf.Tensor: shape=(3,), dtype=float32, numpy=array([6.8532745e-06, 1.9097870e-04, 2.0559824e-05],\n dtype=float32)\u003e\n\nUsage with [`tf.keras`](https://www.tensorflow.org/api_docs/python/tf/keras) API: \n\n model = tf.keras.Model()\n model.compile('sgd', loss=tfa.losses.SigmoidFocalCrossEntropy())\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Args ---- ||\n|---------|------------------------------------------|\n| `alpha` | balancing factor, default value is 0.25. |\n| `gamma` | modulating factor, default value is 2.0. |\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Returns ------- ||\n|---|---|\n| Weighted loss float `Tensor`. If `reduction` is `NONE`, this has the same shape as `y_true`; otherwise, it is scalar. ||\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Raises ------ ||\n|--------------|-----------------------------------------------------------------------------------|\n| `ValueError` | If the shape of `sample_weight` is invalid or value of `gamma` is less than zero. |\n\n\u003cbr /\u003e\n\nMethods\n-------\n\n### `from_config`\n\n @classmethod\n from_config(\n config\n )\n\nInstantiates a `Loss` from its config (output of `get_config()`).\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Args ||\n|----------|---------------------------|\n| `config` | Output of `get_config()`. |\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Returns ||\n|---|---|\n| A `Loss` instance. ||\n\n\u003cbr /\u003e\n\n### `get_config`\n\n[View source](https://github.com/tensorflow/addons/blob/v0.20.0/tensorflow_addons/utils/keras_utils.py#L63-L68) \n\n get_config()\n\nReturns the config dictionary for a `Loss` instance.\n\n### `__call__`\n\n __call__(\n y_true, y_pred, sample_weight=None\n )\n\nInvokes the `Loss` instance.\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Args ||\n|-----------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n| `y_true` | Ground truth values. shape = `[batch_size, d0, .. dN]`, except sparse loss functions such as sparse categorical crossentropy where shape = `[batch_size, d0, .. dN-1]` |\n| `y_pred` | The predicted values. shape = `[batch_size, d0, .. dN]` |\n| `sample_weight` | Optional `sample_weight` acts as a coefficient for the loss. If a scalar is provided, then the loss is simply scaled by the given value. If `sample_weight` is a tensor of size `[batch_size]`, then the total loss for each sample of the batch is rescaled by the corresponding element in the `sample_weight` vector. If the shape of `sample_weight` is `[batch_size, d0, .. dN-1]` (or can be broadcasted to this shape), then each loss element of `y_pred` is scaled by the corresponding value of `sample_weight`. (Note on`dN-1`: all loss functions reduce by 1 dimension, usually axis=-1.) |\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Returns ||\n|---|---|\n| Weighted loss float `Tensor`. If `reduction` is `NONE`, this has shape `[batch_size, d0, .. dN-1]`; otherwise, it is scalar. (Note `dN-1` because all loss functions reduce by 1 dimension, usually axis=-1.) ||\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n\u003cbr /\u003e\n\n| Raises ||\n|--------------|---------------------------------------------|\n| `ValueError` | If the shape of `sample_weight` is invalid. |\n\n\u003cbr /\u003e"]]