A kernel-based approach to document retrieval

@inproceedings{Gordo2010AKA,
  title={A kernel-based approach to document retrieval},
  author={Albert Gordo and Jaume Gibert and Ernest Valveny and Marçal Rusi{\~n}ol},
  booktitle={Document Analysis Systems},
  year={2010}
}
In this paper we tackle the problem of document image retrieval by combining a similarity measure between documents and the probability that a given document belongs to a certain class. The membership probability to a specific class is computed using Support Vector Machines in conjunction with similarity measure based kernel applied to structural document representations. In the presented experiments, we use different document representations, both visual and structural, and we apply them to a… CONTINUE READING