Corpus ID: 226278391

Self Supervised Learning for Object Localisation in 3D Tomographic Images

  title={Self Supervised Learning for Object Localisation in 3D Tomographic Images},
  author={Yaroslav Zharov and Alexey Ershov and Tilo Baumbach},
While a lot of work is dedicated to self-supervised learning, most of it is dealing with 2D images of natural scenes and objects. In this paper, we focus on \textit{volumetric} images obtained by means of the X-Ray Computed Tomography (CT). We describe two pretext training tasks which are designed taking into account the specific properties of volumetric data. We propose two ways to transfer a trained network to the downstream task of object localization with a zero amount of manual markup… Expand

Figures from this paper


Multi-Task Self-Supervised Object Detection via Recycling of Bounding Box Annotations
A novel object detection approach that takes advantage of both multi-task learning (MTL) and self-supervised learning (SSL) to improve the accuracy of object detection and empirically validate that this approach effectively improves detection performance on various architectures and datasets. Expand
Unsupervised Representation Learning by Predicting Image Rotations
This work proposes to learn image features by training ConvNets to recognize the 2d rotation that is applied to the image that it gets as input, and demonstrates both qualitatively and quantitatively that this apparently simple task actually provides a very powerful supervisory signal for semantic feature learning. Expand
Albumentations: fast and flexible image augmentations
Albumentations is presented, a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries. Expand
Weakly Supervised Instance Segmentation using the Bounding Box Tightness Prior
The proposed deep model integrates MIL into a fully supervised instance segmentation network, and can be derived by the objective consisting of two terms, i.e., the unary term and the pairwise term. Expand
Unsupervised Learning of Visual Representations by Solving Jigsaw Puzzles
A novel unsupervised learning approach to build features suitable for object detection and classification and to facilitate the transfer of features to other tasks, the context-free network (CFN), a siamese-ennead convolutional neural network is introduced. Expand
Learning Features by Watching Objects Move
Inspired by the human visual system, low-level motion-based grouping cues can be used to learn an effective visual representation that significantly outperforms previous unsupervised approaches across multiple settings, especially when training data for the target task is scarce. Expand
Self-Supervised Feature Learning by Learning to Spot Artifacts
  • S. Jenni, P. Favaro
  • Computer Science
  • 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
  • 2018
A novel self-supervised learning method based on adversarial training to train a discriminator network to distinguish real images from images with synthetic artifacts, and then to extract features from its intermediate layers that can be transferred to other data domains and tasks. Expand
Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey
An extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos as a subset of unsupervised learning methods to learn general image and video features from large-scale unlabeled data without using any human-annotated labels is provided. Expand
Deep Residual Learning for Image Recognition
This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. Expand
Quantitative morphometric analysis of adult teleost fish by X-ray computed tomography
This work reports a complete pipeline of high-throughput 3D data acquisition and image analysis, including tissue preparation and contrast enhancement for micro-CT imaging down to cellular resolution, automated data processing and organ or tissue segmentation that is applicable to comparative 3D morphometrics of small vertebrates. Expand