Model-based object pose in 25 lines of code


In this paper, we describe a method for finding the pose of an object from a single image. We assume that we can detect and match in the image four or more noncoplanar feature points of the object, and that we know their relative geometry on the object. The method combines two algorithms; the first algorithm,POS (Pose from Orthography and Scaling) approximates the perspective projection with a scaled orthographic projection and finds the rotation matrix and the translation vector of the object by solving a linear system; the second algorithm,POSIT (POS with ITerations), uses in its iteration loop the approximate pose found by POS in order to compute better scaled orthographic projections of the feature points, then applies POS to these projections instead of the original image projections. POSIT converges to accurate pose measurements in a few iterations. POSIT can be used with many feature points at once for added insensitivity to measurement errors and image noise. Compared to classic approaches making use of Newton's method, POSIT does not require starting from an initial guess, and computes the pose using an order of magnitude fewer floating point operations; it may therefore be a useful alternative for real-time operation. When speed is not an issue, POSIT can be written in 25 lines or less in Mathematica; the code is provided in an Appendix.

DOI: 10.1007/BF01450852

Extracted Key Phrases

2 Figures and Tables

Citations per Year

1,069 Citations

Semantic Scholar estimates that this publication has 1,069 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{DeMenthon1992ModelbasedOP, title={Model-based object pose in 25 lines of code}, author={Daniel DeMenthon and Larry S. Davis}, journal={International Journal of Computer Vision}, year={1992}, volume={15}, pages={123-141} }