• Publications
  • Influence
You Only Look Once: Unified, Real-Time Object Detection
We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem toExpand
  • 8,144
  • 1298
Learning Everything about Anything: Webly-Supervised Visual Concept Learning
Recognition is graduating from labs to real-world applications. While it is encouraging to see its potential being tapped, it brings forth a fundamental challenge to the vision researcher:Expand
  • 244
  • 23
How Important Are "Deformable Parts" in the Deformable Parts Model?
The Deformable Parts Model (DPM) has recently emerged as a very useful and popular tool for tackling the intra-category diversity problem in object detection. In this paper, we summarize the keyExpand
  • 121
  • 19
An empirical study of context in object detection
This paper presents an empirical evaluation of the role of context in a contemporary, challenging object detection task - the PASCAL VOC 2008. Previous experiments with context have mostly been doneExpand
  • 415
  • 18
Asynchronous Temporal Fields for Action Recognition
Actions are more than just movements and trajectories: we cook to eat and we hold a cup to drink from it. A thorough understanding of videos requires going beyond appearance modeling and necessitatesExpand
  • 97
  • 14
FigureSeer: Parsing Result-Figures in Research Papers
‘Which are the pedestrian detectors that yield a precision above 95 % at 25 % recall?’ Answering such a complex query involves identifying and analyzing the results reported in figures within severalExpand
  • 48
  • 10
PDFFigures 2.0: Mining figures from research papers
Figures and tables are key sources of information in many scholarly documents. However, current academic search engines do not make use of figures and tables when semantically parsing documents orExpand
  • 41
  • 8
Exemplar Driven Character Recognition in the Wild
Character recognition in natural scenes continues to represent a formidable challenge in computer vision. Traditional optical character recognition (OCR) methods fail to perform well on charactersExpand
  • 29
  • 8
VisKE: Visual knowledge extraction and question answering by visual verification of relation phrases
How can we know whether a statement about our world is valid. For example, given a relationship between a pair of entities e.g., `eat(horse, hay)', how can we know whether this relationship is trueExpand
  • 93
  • 4
Looking Beyond Text: Extracting Figures, Tables and Captions from Computer Science Papers
Identifying and extracting figures and tables along with their captions from scholarly articles is important both as a way of providing tools for article summarization, and as part of larger systemsExpand
  • 49
  • 4