Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

@article{Chen2018EncoderDecoderWA,
  title={Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation},
  author={Liang-Chieh Chen and Yukun Zhu and George Papandreou and Florian Schroff and Hartwig Adam},
  journal={ArXiv},
  year={2018},
  volume={abs/1802.02611}
}
Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. [...] Key Method Specifically, our proposed model, DeepLabv3+, extends DeepLabv3 by adding a simple yet effective decoder module to refine the segmentation results especially along object boundaries.Expand
DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES
TLDR
This work proposed a dual pyramids encoder-decoder deep neural network (DPEDNet) to tackle semantic segmentation, which has achieved state-of-the-art performances on two challenging benchmark image datasets for both ground and aerial view scenes. Expand
Encoder–decoder with double spatial pyramid for semantic segmentation
TLDR
A network with an encoder–decoder architecture based on two proposed modules: global pyramid attention module (GPAM) and pyramid decoder module (PDM) that achieves a mean intersection over union score of 83.4% on PASCAL VOC 2012 dataset and 78.5% on Cityscapes dataset. Expand
Mixed spatial pyramid pooling for semantic segmentation
TLDR
This paper proposes a Mixed Spatial Pyramid Pooling (MSPP) module based on region-based average pooling and dilated convolution to obtain dense multi-level contextual priors and proposes a Global-Attention Fusion module to provide global context as guidance for low-level features. Expand
A Novel Upsampling and Context Convolution for Image Semantic Segmentation
TLDR
A novel dense upsampling convolution method based on a guided filter to effectively preserve the spatial information of the image in the network and a novel local context Convolution method that not only covers larger-scale objects in the scene but covers them densely for precise object boundary delineation. Expand
Multilevel feature fusion dilated convolutional network for semantic segmentation
Recently, convolutional neural network (CNN) has led to significant improvement in the field of computer vision, especially the improvement of the accuracy and speed of semantic segmentation tasks,Expand
DXNet: An Encoder-Decoder Architecture with XSPP for Semantic Image Segmentation in Street Scenes
TLDR
This work proposes a Xiphoid Spatial Pyramid Pooling method integrated with detailed information that performs better in the case of occlusion, and the mean intersection-over-union score of the model outperforms some representative works. Expand
Deeplab_v3_plus-net for Image Semantic Segmentation with Channel Compression
TLDR
This paper introduces how do the authors compress channel information and channel information fusion and presents a Channel Compression-Encoder-Decoder with Atrous Separable Convolution Net (CC-Deeplab_v3_plus). Expand
Dense Semantic Labeling with Atrous Spatial Pyramid Pooling and Decoder for High-Resolution Remote Sensing Imagery
TLDR
A more efficient fully convolutional network is proposed by combining the advantages from both atrous spatial pyramid pooling and encoder-decoder by utilizing the deep residual network followed by ASPP as the encoder and combines two scales of high-level features with corresponding low- level features as the decoder at the upsampling stage. Expand
Semantic segmentation using stride spatial pyramid pooling and dual attention decoder
TLDR
This paper proposes a structure called stride spatial pyramid pooling (SSPP) to capture multiscale semantic information from the high-level feature map and proposes a dual attention decoder consisting of a channel attention branch and a spatial attention branch that can result in a more “semantic” low-levelfeature map and a high- level feature map with more accurate spatial information. Expand
Attention-Guided Network for Semantic Video Segmentation
TLDR
An Attention-Guided Network (AGNet) is proposed to adaptively strengthen inter-frame and intra-frame features for more precise segmentation predictions, and sum the outputs of the two attention modules to further improve feature representations which contribute to more precise segmentsation predictions across temporal and spatial dimensions simultaneously. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 92 REFERENCES
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
TLDR
This work addresses the task of semantic image segmentation with Deep Learning and proposes atrous spatial pyramid pooling (ASPP), which is proposed to robustly segment objects at multiple scales, and improves the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models. Expand
Rethinking Atrous Convolution for Semantic Image Segmentation
TLDR
The proposed `DeepLabv3' system significantly improves over the previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark. Expand
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
TLDR
Quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures, including FCN and DeconvNet. Expand
Understanding Convolution for Semantic Segmentation
TLDR
DUC is designed to generate pixel-level prediction, which is able to capture and decode more detailed information that is generally missing in bilinear upsampling, and a hybrid dilated convolution (HDC) framework in the encoding phase is proposed. Expand
Densely connected deconvolutional network for semantic segmentation
TLDR
This work proposes a Densely Connected Deconvolutional Network (DCDN) for semantic segmentation that achieves better performance than some state-of-the-art methods without using any post-processing, pretrained model, nor temporal information, whilst requiring less parameters. Expand
Stacked Deconvolutional Network for Semantic Segmentation
TLDR
This work proposes a Stacked Deconvolutional Network (SDN) for semantic segmentation and achieves the new state-ofthe- art results on four datasets, including PASCAL VOC 2012, CamVid, GATECH, COCO Stuff. Expand
Attention to Scale: Scale-Aware Semantic Image Segmentation
TLDR
An attention mechanism that learns to softly weight the multi-scale features at each pixel location is proposed, which not only outperforms averageand max-pooling, but allows us to diagnostically visualize the importance of features at different positions and scales. Expand
RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation
TLDR
RefineNet is presented, a generic multi-path refinement network that explicitly exploits all the information available along the down-sampling process to enable high-resolution prediction using long-range residual connections and introduces chained residual pooling, which captures rich background context in an efficient manner. Expand
Fully Convolutional Networks for Semantic Segmentation
TLDR
It is shown that convolutional networks by themselves, trained end- to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation. Expand
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation
TLDR
This work shows how to improve semantic segmentation through the use of contextual information, specifically, ' patch-patch' context between image regions, and 'patch-background' context, and formulate Conditional Random Fields with CNN-based pairwise potential functions to capture semantic correlations between neighboring patches. Expand
...
1
2
3
4
5
...