Going deeper with convolutions
- Christian Szegedy, Wei Liu, Andrew Rabinovich
- Computer ScienceComputer Vision and Pattern Recognition
- 16 September 2014
We propose a deep convolutional neural network architecture codenamed Inception that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition…
Caffe: Convolutional Architecture for Fast Feature Embedding
- Yangqing Jia, Evan Shelhamer, Trevor Darrell
- Computer ScienceACM Multimedia
- 20 June 2014
Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
- Martín Abadi, Ashish Agarwal, Xiaoqiang Zheng
- Computer ScienceArXiv
- 14 March 2016
The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
- Priya Goyal, Piotr Dollár, Kaiming He
- Computer ScienceArXiv
- 8 June 2017
This paper empirically show that on the ImageNet dataset large minibatches cause optimization difficulties, but when these are addressed the trained networks exhibit good generalization and enable training visual recognition models on internet-scale data with high efficiency.
DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition
- Jeff Donahue, Yangqing Jia, Trevor Darrell
- Computer ScienceInternational Conference on Machine Learning
- 5 October 2013
DeCAF, an open-source implementation of deep convolutional activation features, along with all associated network parameters, are released to enable vision researchers to be able to conduct experimentation with deep representations across a range of visual concept learning paradigms.
FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search
- Bichen Wu, Xiaoliang Dai, K. Keutzer
- Computer ScienceComputer Vision and Pattern Recognition
- 9 December 2018
This work proposes a differentiable neural architecture search (DNAS) framework that uses gradient-based methods to optimize ConvNet architectures, avoiding enumerating and training individual architectures separately as in previous methods.
MatchNet: Unifying feature and metric learning for patch-based matching
- Xufeng Han, Thomas Leung, Yangqing Jia, R. Sukthankar, A. Berg
- Computer ScienceComputer Vision and Pattern Recognition
- 7 June 2015
A unified approach to combining feature computation and similarity networks for training a patch matching system that improves accuracy over previous state-of-the-art results on patch matching datasets, while reducing the storage requirement for descriptors is confirmed.
Deep Convolutional Ranking for Multilabel Image Annotation
- Yunchao Gong, Yangqing Jia, Thomas Leung, Alexander Toshev, Sergey Ioffe
- Computer ScienceInternational Conference on Learning…
- 17 December 2013
It is shown that a significant performance gain could be obtained by combining convolutional architectures with approximate top-$k$ ranking objectives, as thye naturally fit the multilabel tagging problem.
A category-level 3-D object dataset: Putting the Kinect to work
- Allison Janoch, Sergey Karayev, Trevor Darrell
- Computer ScienceIEEE International Conference on Computer Vision
- 1 November 2011
A dataset of color and depth image pairs, gathered in real domestic and office environments, establishes baseline performance in a PASCAL VOC-style detection task, and suggests two ways that inferred world size of the object may be used to improve detection.
Applied Machine Learning at Facebook: A Datacenter Infrastructure Perspective
- Kim M. Hazelwood, Sarah Bird, Xiaodong Wang
- Computer ScienceInternational Symposium on High-Performance…
- 1 February 2018
The hardware and software infrastructure that supports machine learning at global scale is described, leveraging both GPU and CPU platforms for training and abundant CPU capacity for real-time inference.
...
...