Learn More
A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents (" semantic structure ") in order to improve the detection of relevant documents on the basis of terms found in queries. The particular technique used is singular-value decomposition,(More)
In a new method for automatic indexing and retrieval, implicit higher-order structure in the association of terms with documents is modeled to improve estimates of term-document association, and therefore the detection of relevant documents on the basis of terms found in queries. Singular-value decomposition is used to decompose a large term by document(More)
This paper describes a new approach for dealing with the vocabulary problem in human-computer interaction. Most approaches to retrieving textual materials depend on a lexical match between words in users' requests and those in or assigned to database objects. Because of the tremendous diversity in the words people use to describe the same object, lexical(More)
The emergence of the CD-ROM as a storage medium for full-text databases raises the question of the maximum size database that can be contained by this medium. As an example, the problem of storing the Trésor de la Langue Fran&ccidel;aise on a CD-ROM is examined in this paper. The text alone of this database is 700 megabytes long, more than a CD-ROM can(More)
The web consists of huge amounts of data available in a variety of digital forms stored in thousands of repositories. Approaches that use the semantics of information captured in the metadata extracted from the data are being viewed as an appealing approach, especially in the context of the Semantic Web effort. We present in this chapter a discussion on(More)
The existence of machine readable text makes possible the development of new techniques that assist the literary scholar in locating interesting passages of text. In this paper we explore in a preliminary manner the possibility of adapting techniques developed in the field of document retrieval to the full text context. As an alternative to the conventional(More)
  • 1