Shape Quantization and Recognition with Randomized Trees
@article{Amit1997ShapeQA, title={Shape Quantization and Recognition with Randomized Trees}, author={Yali Amit and Donald Geman}, journal={Neural Computation}, year={1997}, volume={9}, pages={1545-1588} }
We explore a new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity. Each query corresponds to a spatial arrangement of several local topographic codes (or tags), which are in themselves too primitive and common to be informative about shape. All the discriminating power derives from relative angles and distances among the tags. The important attributes…
1,148 Citations
Joint Induction of Shape Features and Tree Classifiers
- Computer ScienceIEEE Trans. Pattern Anal. Mach. Intell.
- 1997
A very large family of binary features for two-dimensional shapes determined by inductive learning during the construction of classification trees is introduced, which makes it possible to narrow the search for informative ones at each node of the tree.
Image Classification using Random Forests and Ferns
- Computer Science2007 IEEE 11th International Conference on Computer Vision
- 2007
It is shown that selecting the ROI adds about 5% to the performance and, together with the other improvements, the result is about a 10% improvement over the state of the art for Caltech-256.
Coarse-to-Fine Visual Selection
- Computer Science
- 1999
The approach is sequential testing which is coarse-tone in both in the exploration of poses and the representation of objects, and the spatial distribution of processing is highly skewed and detection is rapid, but at the expense of false alarms which, presumably, could be eliminated with localized, more intensive, processing.
Coarse-to-Fine Face Detection
- Computer ScienceInternational Journal of Computer Vision
- 2004
The approach is sequential testing which is coarse-to-fine in both in the exploration of poses and the representation of objects, and the spatial distribution of processing is highly skewed and detection is rapid, but at the expense of (isolated) false alarms which could be eliminated with localized, more intensive, processing.
A Framework for Learning Visual Discrimination
- Computer ScienceFLAIRS Conference
- 1999
A method that allows incremental learning of discriminative features in a feature space that includes juxtapositions of oriented local pieces of edge and is parameterized by and the relative angles and distances between the edgels is presented.
A Graph Lattice Approach to Maintaining and Learning Dense Collections of Subgraphs as Image Features
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine Intelligence
- 2013
This paper shows how large families of complex image features in the form of subgraphs can be built out of simpler ones through construction of a graph lattice - a hierarchy of related sub graphs linked in a lattice.
Distance sets for shape filters and shape recognition
- Computer ScienceIEEE Trans. Image Process.
- 2003
This work addresses two problems that are often encountered in object recognition: object segmentation, for which a distance sets shape filter is formulated, and shape matching, which is illustrated on printed and handwritten character recognition and detection of traffic signs in complex scenes.
A Computational Model for Visual Selection
- Computer ScienceNeural Computation
- 1999
The model was not conceived to explain brain functions, but it does cohere with evidence about the functions of neurons in V1 and V2, such as responses to coarse or incomplete patterns and to scale and translation invariance in IT.
Graded Learning for Object Detection
- Computer Science
- 1999
The goal is to detect all instances of a generic object class, such as a face, in greyscale scenes by learning a hierarchy of spatial arrangements of edge fragments, graded by their size (sparsity).
A Memory Efficient Discriminative Approach for Location Aided Recognition
- Computer ScienceECCV Workshops
- 2012
This paper investigates a new approach to mobile visual recognition that would involve uploading only GPS coordinates to a server, following which a compact location specific classifier would be downloaded to the client and recognition would be computed completely on the client.
References
SHOWING 1-10 OF 79 REFERENCES
Joint Induction of Shape Features and Tree Classifiers
- Computer ScienceIEEE Trans. Pattern Anal. Mach. Intell.
- 1997
A very large family of binary features for two-dimensional shapes determined by inductive learning during the construction of classification trees is introduced, which makes it possible to narrow the search for informative ones at each node of the tree.
A Computational Model for Visual Selection
- Computer ScienceNeural Computation
- 1999
The model was not conceived to explain brain functions, but it does cohere with evidence about the functions of neurons in V1 and V2, such as responses to coarse or incomplete patterns and to scale and translation invariance in IT.
Memory-based character recognition using a transformation invariant metric
- Computer ScienceProceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5)
- 1994
A new distance measure which can be made locally invariant to any set of transformations of the input; and can be computed efficiently is proposed.
Solving Multiclass Learning Problems via Error-Correcting Output Codes
- Computer ScienceJ. Artif. Intell. Res.
- 1995
It is demonstrated that error-correcting output codes provide a general-purpose method for improving the performance of inductive learning programs on multiclass problems.
Invariant Descriptors for 3D Object Recognition and Pose
- Computer ScienceIEEE Trans. Pattern Anal. Mach. Intell.
- 1991
A model-based vision system that recognizes curved plane objects irrespective of their pose is demonstrated and the stability of a range of invariant descriptors to measurement error is treated in detail.
What Size Net Gives Valid Generalization?
- Mathematics, Computer ScienceNeural Computation
- 1989
It is shown that if m O(W/ ∊ log N/∊) random examples can be loaded on a feedforward network of linear threshold functions with N nodes and W weights, so that at least a fraction 1 ∊/2 of the examples are correctly classified, then one has confidence approaching certainty that the network will correctly classify a fraction 2 ∊ of future test examples drawn from the same distribution.
A comparison of decision tree classifiers with backpropagation neural networks for multimodal classification problems
- Computer SciencePattern Recognit.
- 1993