Mohsen Hejrati

Learn More
Humans can prepare concise descriptions of pictures, focusing on what they find important. We demonstrate that automatic methods can do so too. We describe a system that can compute a score linking an image to a sentence. This score can be used to attach a descriptive sentence to a given image, or to obtain images that illustrate a given sentence. The score(More)
We present an approach to detecting and analyzing the 3D configuration of objects in real-world images with heavy occlusion and clutter. We focus on the application of finding and analyzing cars. We do so with a two-stage model; the first stage reasons about 2D shape and appearance variation due to within-class variation (station wagons look different than(More)
We introduce a new approach for recognizing and reconstructing 3D objects in images. Our approach is based on an analysis by synthesis strategy. A forward synthesis model constructs possible geometric interpretations of the world, and then selects the interpretation that best agrees with the measured visual evidence. The forward model synthesizes visual(More)
A new method for seamless image stitching is presented. The proposed algorithm is a hybrid method which uses optimal seam methods and smoothes the intensity transition between two images by color correction. A dynamic programming algorithm that finds an optimal seam along which gradient disparities are minimized is used. A modification of Poisson image(More)
This paper introduces and analyzes the novel task of categorical classification of cuboidal objects - e.g., distinguishing washing machines versus filing cabinets. To do so, it makes use of recent methods for automatic alignment of cuboidal objects in images. Given such geometric alignments, the natural approach for recognition might extract pose-normalized(More)
In this article we introduce the cylindrical construction, as an edge-replacement procedure admitting twists on both ends of the hyperedges, generalizing the concepts of lifts and Pultr templates at the same time. We prove a tensor-hom duality for this construction and we show that not only a large number of well-known graph constructions are cylindrical(More)
Actions exhibit complex temporal structure, such as sub-<lb>actions of variable durations and compositional orderings. Much research<lb>on action recognition ignores such structure and instead focuses on K-<lb>way classification of temporally pre-segmented video clips. We describe<lb>lightweight and e⇤cient grammars that segment a continuous video(More)
  • 1