XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
- Mohammad Rastegari, Vicente Ordonez, Joseph Redmon, Ali Farhadi
- Computer ScienceEuropean Conference on Computer Vision
- 16 March 2016
The Binary-Weight-Network version of AlexNet is compared with recent network binarization methods, BinaryConnect and BinaryNets, and outperform these methods by large margins on ImageNet, more than \(16\,\%\) in top-1 accuracy.
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
- Sachin Mehta, Mohammad Rastegari, A. Caspi, L. Shapiro, Hannaneh Hajishirzi
- Computer ScienceEuropean Conference on Computer Vision
- 19 March 2018
A fast and efficient convolutional neural network, ESPNet, for semantic segmentation of high resolution images under resource constraints, which outperforms all the current efficient CNN networks such as MobileNet, ShuffleNet, and ENet on both standard metrics and the newly introduced performance metrics that measure efficiency on edge devices.
OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
- Kenneth Marino, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi
- Computer ScienceComputer Vision and Pattern Recognition
- 31 May 2019
This paper addresses the task of knowledge-based visual question answering and provides a benchmark, called OK-VQA, where the image content is not sufficient to answer the questions, encouraging methods that rely on external knowledge resources.
What’s Hidden in a Randomly Weighted Neural Network?
- Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi, Mohammad Rastegari
- Computer ScienceComputer Vision and Pattern Recognition
- 29 November 2019
It is empirically show that as randomly weighted neural networks with fixed weights grow wider and deeper, an ``untrained subnetwork" approaches a network with learned weights in accuracy.
ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network
- Sachin Mehta, Mohammad Rastegari, L. Shapiro, Hannaneh Hajishirzi
- Computer ScienceComputer Vision and Pattern Recognition
- 28 November 2018
We introduce a light-weight, power efficient, and general purpose convolutional neural network, ESPNetv2, for modeling visual and sequential data. Our network uses group point-wise and depth-wise…
MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer
- Sachin Mehta, Mohammad Rastegari
- Computer ScienceInternational Conference on Learning…
- 5 October 2021
This paper introduces MobileViT, a light-weight and general-purpose vision transformer for mobile devices that significantly outperforms CNNand ViT-based networks across different tasks and datasets.
Attribute Discovery via Predictable Discriminative Binary Codes
- Mohammad Rastegari, Ali Farhadi, D. Forsyth
- Computer ScienceEuropean Conference on Computer Vision
- 7 October 2012
In this work, each image claims its own code in a way that maintains discrimination while being predictable from visual data, and this method outperforms state-of-the-art binary code methods on this large scale dataset.
IQA: Visual Question Answering in Interactive Environments
- Daniel Gordon, Aniruddha Kembhavi, Mohammad Rastegari, Joseph Redmon, D. Fox, Ali Farhadi
- Computer ScienceIEEE/CVF Conference on Computer Vision and…
- 9 December 2017
The Hierarchical Interactive Memory Network (HIMN), consisting of a factorized set of controllers, allowing the system to operate at multiple levels of temporal abstraction, is proposed, and outperforms popular single controller based methods on IQUAD V1.
Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning
- Mitchell Wortsman, Kiana Ehsani, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi
- Computer ScienceComputer Vision and Pattern Recognition
- 3 December 2018
A self-adaptive visual navigation method (SAVN) which learns to adapt to new environments without any explicit supervision which shows major improvements in both success rate and SPL for visual navigation in novel scenes.
Predictable Dual-View Hashing
- Mohammad Rastegari, Jonghyun Choi, Shobeir Fakhraei, Hal Daumé, L. Davis
- Computer ScienceInternational Conference on Machine Learning
- 16 June 2013
We propose a Predictable Dual-View Hashing (PDH) algorithm which embeds proximity of data samples in the original spaces. We create a cross-view hamming space with the ability to compare information…
...
...