Learn More
We consider the problem of metrically reconstructing a scene viewed by a m o ving stereo head. The head comprises two cameras with coplanar optical axes arranged on a lateral rig, each camera being free to vary its angle of vergence. Under various constraints, we derive n o vel explicit forms for the epipolar equation, and show that a static stereo head(More)
If suuciently many pairs of corresponding points in a stereo image pair are available to construct the associated fundamental matrix, then it has been shown that 5 relative orientation parameters and 2 focal lengths can be recovered from this fundamental matrix. This paper presents a new and essentially linear algorithm for recovering focal lengths.(More)
Existing techniques for 3D action recognition are sensitive to viewpoint variations because they extract features from depth images which change significantly with viewpoint. In contrast, we directly process the pointclouds and propose a new technique for action recognition which is more robust to noise, action speed and viewpoint variations. Our technique(More)
Existing techniques for 3D action recognition are sensitive to viewpoint variations because they extract features from depth images which are viewpoint dependent. In contrast, we directly process point clouds for cross-view action recognition from unknown and unseen views. We propose the histogram of oriented principal components (HOPC) descriptor that is(More)
It is widely known that, for the affine camera model, both shape and motion can be factorized directly from the so-called image measurement matrix constructed from image point coordinates. The ability to extract both shape and motion from this matrix by a single SVD operation makes this shape-from-motion approach attractive; however, it can not deal with(More)
We consider the self-calibration problem in the special context of a stereo head, where the two cameras are arranged on a lateral rig with coplanar optical axes, each camera being free to vary its angle of vergence. Under various constraints, we derive explicit forms for the epipolar equation, and show that a static stereo head constitutes a degenerate(More)
In this paper we propose a novel appearance descriptor for 3D human pose estimation from monocular images using a learning-based technique. Our image-descriptor is based on the intermediate local appearance descriptors that we design to encapsulate local appearance context and to be resilient to noise. We encode the image by the histogram of such local(More)
In this paper, we propose a hybrid method that combines Gaussian process learning, a particle filter, and annealing to track the 3D pose of a human subject in video sequences. Our approach, which we refer to as annealed Gaussian process guided particle filter, comprises two steps. In the training step, we use a supervised learning method to train a Gaussian(More)