• Publications
  • Influence
Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments
Most face databases have been created under controlled conditions to facilitate the study of specific parameters on the face recognition problem. These parameters include such variables as position,Expand
  • 3,910
  • 1015
ReferItGame: Referring to Objects in Photographs of Natural Scenes
In this paper we introduce a new game to crowd-source natural language referring expressions. By designing a two player game, we can both collect and verify referring expressions directly within theExpand
  • 345
  • 92
Two-person interaction detection using body-pose features and multiple instance learning
Human activity recognition has potential to impact a wide range of applications from surveillance to human computer interfaces to content based video retrieval. Recently, the rapid development ofExpand
  • 289
  • 68
Modeling Context in Referring Expressions
Humans refer to objects in their environments all the time, especially in dialogue with other people. We explore generating and comprehending natural language referring expressions for objects inExpand
  • 194
  • 63
Im2Text: Describing Images Using 1 Million Captioned Photographs
We develop and demonstrate automatic image description methods using a large captioned photo collection. One contribution is our technique for the automatic collection of this new dataset –Expand
  • 495
  • 53
Shape matching and object recognition using low distortion correspondences
We approach recognition in the framework of deformable shape matching, relying on a new algorithm for finding correspondences between feature points. This algorithm sets up correspondence as anExpand
  • 923
  • 49
Parsing clothing in fashion photographs
In this paper we demonstrate an effective method for parsing clothing in fashion photographs, an extremely challenging problem due to the large number of possible garment items, variations inExpand
  • 358
  • 49
MAttNet: Modular Attention Network for Referring Expression Comprehension
In this paper, we address referring expression comprehension: localizing an image region described by a natural language expression. While most recent work treats expressions as a single unit, weExpand
  • 156
  • 48
Where to Buy It: Matching Street Clothing Photos in Online Shops
In this paper, we define a new task, Exact Street to Shop, where our goal is to match a real-world example of a garment item to the same item in an online shop. This is an extremely challenging taskExpand
  • 291
  • 37
High level describable attributes for predicting aesthetics and interestingness
With the rise in popularity of digital cameras, the amount of visual data available on the web is growing exponentially. Some of these pictures are extremely beautiful and aesthetically pleasing, butExpand
  • 392
  • 32