Using Long Runs as Predictors of Semantic Coherence in a Partial Document Retrieval System

@inproceedings{Shin2000UsingLR,
  title={Using Long Runs as Predictors of Semantic Coherence in a Partial Document Retrieval System},
  author={Hyopil Shin and J. Stach},
  year={2000}
}
We propose a method for dealing with semantic complexities occurring in information retrieval systems on the basis of linguistic observations. Our method follows from an analysis indicating that long runs of content words appear in a stopped document cluster, and our observation that these long runs predominately originate from the prepositional phrase and subject complement positions and as such, may be useful predictors of semantic coherence. From this linguistic basis, we test three… Expand
3 Citations

References

SHOWING 1-10 OF 10 REFERENCES
Indexing by Latent Semantic Analysis
  • 5,206
  • PDF
Indexing by Latent Semantic Analysis
  • 7,998
  • PDF
Incorporating Semantics Within a Connectionist Model and a Vector Processing Model
  • 8
Retrieval of Partial Documents
  • 61
An Introduction to Functional Grammar
  • 11,247
Extracting Context from Unstructured Text Documents by Content Word Density
  • 1997
Incorporating Probabilistic Semantic Categories ( SEMCATs ) Into Vector Space Techniques for Partial Document Retrieval
  • Journal of Computer Science and Information Management
  • 1999
Incorporating Probabilistic Semantic Categories (SEMCATs) Into Vector Space Techniques for Partial Document Retrieval
  • Journal of Computer Science and Information Management
  • 1999
Incorporating Semantic Categories (SEMCATs) into a Partial Information Retrieval System
  • 1997