Input images are passed through backbone first. Decoder network is then
applied, and finally, segmentation head is applied on the output of the
decoder network. Layers such as ASPP should be part of decoder. Any feature
fusion is done as part of the segmentation head (i.e. deeplabv3+ feature
fusion is not part of the decoder, instead it is part of the segmentation
head). This way, different feature fusion techniques can be combined with
different backbones, and decoders.
Args
backbone
a backbone network.
decoder
a decoder network. E.g. FPN.
head
segmentation head.
mask_scoring_head
mask scoring head.
**kwargs
keyword arguments to be passed.
Attributes
checkpoint_items
Returns a dictionary of items to be additionally checkpointed.