Linguistically Motivated Features for Enhanced Back-of-the-Book Indexing

  title={Linguistically Motivated Features for Enhanced Back-of-the-Book Indexing},
  author={Andras Csomai and Rada Mihalcea},
In this paper we present a supervised method for back-of-the-book index construction. We introduce a novel set of features that goes beyond the typical frequency-based analysis, including features based on discourse comprehension, syntactic patterns, and information drawn from an online encyclopedia. In experiments carried out on a book collection, the method was found to lead to an improvement of roughly 140% as compared to an existing state-of-the-art supervised method. 
Highly Cited
This paper has 24 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.
Showing 1-10 of 17 references

Introduction to latent semantic analysis

  • T. K. Landauer, P. Foltz, D. Laham
  • 1998
Highly Influential
5 Excerpts

Towards modeling threaded discussions through ontology - based analysis

  • E. Frank, I. H. Witten, C. Gutwin, C. G. Nevill-Manning
  • Proceedings of National Conference on Artificial…
  • 2006

Similar Papers

Loading similar papers…