• Corpus ID: 229153995

Meticulous Object Segmentation

  title={Meticulous Object Segmentation},
  author={Chenglin Yang and Yilin Wang and Jianming Zhang and He Zhang and Zhe L. Lin and Alan Loddon Yuille},
Compared with common image segmentation tasks targeted at low-resolution images, higher resolution detailed image segmentation receives much less attention. In this paper, we propose and study a task named Meticulous Object Segmentation (MOS), which is focused on segmenting well-defined foreground objects with elaborate shapes in high resolution images (e.g. 2k - 4k). To this end, we propose the MeticulousNet which leverages a dedicated decoder to capture the object boundary details… 
High Quality Segmentation for Ultra High-resolution Images
The Continuous Refinement Model (CRM) continuously aligns the feature map with the refinement target and aggregates features to reconstruct these images’ details and shows its significant generalization ability to fill the resolution gap between low-resolution training images and ultra high-resolution testing ones.
Highly Accurate Dichotomous Image Segmentation
A simple intermediate supervision baseline (ISNet) is introduced using both feature-level and mask-level guidance for DIS model training, and a new metric called human correction efforts (HCE) which approximates the number of mouse clicking operations required to correct the false positives and false negatives is designed.


CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
This paper presents quantitative and qualitative studies on different datasets to show that CascadePSP can reveal pixel-accurate segmentation boundaries using the novel refinement module without any finetuning.
CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning
Canet is presented, a class-agnostic segmentation network that performs few-shot segmentation on new classes with only a few annotated images available, and introduces an attention mechanism to effectively fuse information from multiple support examples under the setting of k-shot learning.
Towards High-Resolution Salient Object Detection
Experiments illustrate that the proposed novel approach outperforms existing state-of-the-art methods on high-resolution saliency datasets by a large margin, and achieves comparable or even better performance than them on some widely used saliency benchmarks.
PointRend: Image Segmentation As Rendering
The PointRend (Point-based Rendering) neural network module is presented: a module that performs point-based segmentation predictions at adaptively selected locations based on an iterative subdivision algorithm that enables output resolutions that are otherwise impractical in terms of memory or computation compared to existing approaches.
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs
This work brings together methods from DCNNs and probabilistic graphical models for addressing the task of pixel-level classification by combining the responses at the final DCNN layer with a fully connected Conditional Random Field (CRF).
Saliency Optimization from Robust Background Detection
This work proposes a robust background measure, called boundary connectivity, which characterizes the spatial layout of image regions with respect to image boundaries and is much more robust and presents unique benefits that are absent in previous saliency measures.
RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation
RefineNet is presented, a generic multi-path refinement network that explicitly exploits all the information available along the down-sampling process to enable high-resolution prediction using long-range residual connections and introduces chained residual pooling, which captures rich background context in an efficient manner.
Deep High-Resolution Representation Learning for Visual Recognition
The superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, is shown, suggesting that the HRNet is a stronger backbone for computer vision problems.
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
This work extends DeepLabv3 by adding a simple yet effective decoder module to refine the segmentation results especially along object boundaries and applies the depthwise separable convolution to both Atrous Spatial Pyramid Pooling and decoder modules, resulting in a faster and stronger encoder-decoder network.
Learning to Detect Salient Objects with Image-Level Supervision
This paper develops a weakly supervised learning method for saliency detection using image-level tags only, which outperforms unsupervised ones with a large margin, and achieves comparable or even superior performance than fully supervised counterparts.