Using Contrastive Learning and Pseudolabels to learn representations for Retail Product Image Classification

@inproceedings{Srivastava2021UsingCL,
  title={Using Contrastive Learning and Pseudolabels to learn representations for Retail Product Image Classification},
  author={Muktabh Mayank Srivastava},
  booktitle={VISIGRAPP},
  year={2021}
}
Retail product Image classification problems are often few shot classification problems, given retail product classes cannot have the type of variations across images like a cat or dog or tree could have. Previous works have shown different methods to finetune Convolutional Neural Networks to achieve better classification accuracy on such datasets. In this work, we try to address the problem statement : Can we pretrain a Convolutional Neural Network backbone which yields good enough… 

Figures and Tables from this paper

References

SHOWING 1-10 OF 15 REFERENCES

Bag of Tricks for Retail Product Image Classification

A new neural network layer called Local-Concepts-Accumulation (LCA) layer is introduced which gives consistent gains across multiple datasets and enables us to increase the accuracy of fine tuned convnets for retail product image classification by a large margin.

Fine-Grained Grocery Product Recognition by One-Shot Learning

A novel hybrid classification approach that combines feature-based matching and one-shot deep learning with a coarse-to-fine strategy to improve the accuracy of fine-grained grocery products recognition effectively is presented.

Domain invariant hierarchical embedding for grocery products recognition

Supervised Contrastive Learning

A novel training methodology that consistently outperforms cross entropy on supervised learning tasks across different architectures and data augmentations is proposed, and the batch contrastive loss is modified, which has recently been shown to be very effective at learning powerful representations in the self-supervised setting.

A Simple Framework for Contrastive Learning of Visual Representations

It is shown that composition of data augmentations plays a critical role in defining effective predictive tasks, and introducing a learnable nonlinear transformation between the representation and the contrastive loss substantially improves the quality of the learned representations, and contrastive learning benefits from larger batch sizes and more training steps compared to supervised learning.

Self-Training With Noisy Student Improves ImageNet Classification

We present a simple self-training method that achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art model that requires 3.5B weakly labeled Instagram images. On

Exploring the Limits of Weakly Supervised Pretraining

This paper presents a unique study of transfer learning with large convolutional networks trained to predict hashtags on billions of social media images and shows improvements on several image classification and object detection tasks, and reports the highest ImageNet-1k single-crop, top-1 accuracy to date.

Aggregated Residual Transformations for Deep Neural Networks

On the ImageNet-1K dataset, it is empirically show that even under the restricted condition of maintaining complexity, increasing cardinality is able to improve classification accuracy and is more effective than going deeper or wider when the authors increase the capacity.

Rethinking Pre-training and Self-training

Self-training works well exactly on the same setup that pre-training does not work (using ImageNet to help COCO), and on the PASCAL segmentation dataset, though pre- training does help significantly, self-training improves upon the pre-trained model.

Barlow Twins: Self-Supervised Learning via Redundancy Reduction

This work proposes an objective function that naturally avoids collapse by measuring the cross-correlation matrix between the outputs of two identical networks fed with distorted versions of a sample, and making it as close to the identity matrix as possible.