Mask guided attention for fine-grained patchy image classification

  title={Mask guided attention for fine-grained patchy image classification},
  author={Jun Wang and Xiaohan Yu and Yongsheng Gao},
In this work, we present a novel mask guided attention (MGA) method for fine-grained patchy image classification. The key challenge of fine-grained patchy image classification lies in two folds, ultra-fine-grained inter-category variances among objects and very few data available for training. This motivates us to consider employing more useful supervision signal to train a discriminative model within limited training samples. Specifically, the proposed MGA integrates a pre-trained semantic… Expand

Figures and Tables from this paper

Feature Fusion Vision Transformer Fine-Grained Visual Categorization
This work designs a novel token selection module called mutual attention weight selection (MAWS) to guide the network effectively and efficiently towards selecting discriminative tokens without introducing extra parameters and proves the effectiveness of FFVT on three benchmarks where FFVT achieves the state-of-the-art performance. Expand
Mask-Guided Feature Extraction and Augmentation for Ultra-Fine-Grained Visual Categorization
  • Zicheng Pan, Xiaohan Yu, Miaohua Zhang, Yongsheng Gao
  • Computer Science
  • ArXiv
  • 2021
A mask-guided feature extraction and feature augmentation method to extract discriminative and informative regions of images which are then used to augment the original feature map and demonstrate the effectiveness of the proposed method both visually and quantitatively. Expand
EAR-NET: Error Attention Refining Network For Retinal Vessel Segmentation
A novel error attention refining network (ERANet) that is capable of learning and predicting the potential false predictions in a two-stage manner for effective retinal vessel segmentation and achieves state-of-the-art performance on two common retinal blood vessel datasets. Expand


The application of two-level attention models in deep convolutional neural network for fine-grained image classification
This paper proposes to apply visual attention to fine-grained classification task using deep neural network and achieves the best accuracy under the weakest supervision condition, and is competitive against other methods that rely on additional annotations. Expand
Diversified Visual Attention Networks for Fine-Grained Object Classification
A diversified visual attention network (DVAN) is proposed to address the problem of fine-grained object classification, which substantially relieves the dependency on strongly supervised information for learning to localize discriminative regions com-pared with attention-less models. Expand
Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
A novel attention-based convolutional neural network (CNN) which regulates multiple object parts among different input images, which can be easily trained end-to-end, and is highly efficient which requires only one training stage. Expand
MaskCOV: A random mask covariance network for ultra-fine-grained visual categorization
Experimental results of the proposed novel random mask covariance network (MaskCOV), which integrates an auxiliary self-supervised learning module with a powerful in-image data augmentation scheme for the ultra-FGVC, demonstrate its superiority and potential of MaskCOV concept, which pushes research boundary forward from the fine-grained to theUltra-fine-Grained visual categorization. Expand
Part-Stacked CNN for Fine-Grained Visual Categorization
A novel Part-Stacked CNN architecture that explicitly explains the finegrained recognition process by modeling subtle differences from object parts is proposed, from multiple perspectives of classification accuracy, model interpretability, and efficiency. Expand
Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition
This paper proposes a novel part learning approach by a multi-attention convolutional neural network (MA-CNN), where part generation and feature learning can reinforce each other, and shows the best performances on three challenging published fine-grained datasets. Expand
Object-Part Attention Model for Fine-Grained Image Classification
The object-part attention model (OPAM) for weakly supervised fine-grained image classification is proposed, which avoids the heavy labor consumption of labeling and compares with more than ten state-of-the-art methods on four widely-used datasets. Expand
Mask-Guided Contrastive Attention Model for Person Re-identification
  • Chunfeng Song, Yan Huang, Wanli Ouyang, Liang Wang
  • Computer Science
  • 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
  • 2018
This paper introduces the binary segmentation masks to construct synthetic RGB-Mask pairs as inputs, then designs a mask-guided contrastive attention model (MGCAM) to learn features separately from the body and background regions, and proposes a novel region-level triplet loss to restrain the features learnt from different regions. Expand
Patchy Image Structure Classification Using Multi-Orientation Region Transform
Experimental results demonstrate the effectiveness and superiority of the proposed MORT over the state-of-the-art methods in classifying patchy image structures and can be extended to combine with the deep convolutional neural network techniques, for further enhancement of classification accuracy. Expand
Learning deep part-aware embedding for person retrieval
An improved triplet loss is introduced such that the global feature representations from the same identity are closely clustered and a localization branch is proposed to automatically localize those discriminative person-wise parts or regions, only using identity labels in a weakly supervised manner. Expand