# Latent Semantic Indexing : An Overview 1 Latent Semantic Indexing : An overview INFOSYS 240 Spring 2000 Final Paper

@inproceedings{Rosario2001LatentSI, title={Latent Semantic Indexing : An Overview 1 Latent Semantic Indexing : An overview INFOSYS 240 Spring 2000 Final Paper}, author={Barbara Rosario}, year={2001} }

Typically, information is retrieved by literally matching terms in documents with those of a query. However, lexical matching methods can be inaccurate when they are used to match a user's query. Since there are usually many ways to express a given concept (synonymy), the literal terms in a user's query may not match those of a relevant document. In addition, most words have multiple meanings (polysemy), so terms in a user's query will literally match terms in irrelevant documents. A better…

## 33 Citations

Context driven approach for extracting relevant documents from WWW

- Computer ScienceInternational Conference on Computing, Communication & Automation
- 2015

A promising approach to overcoming problems is Latent Semantic Indexing, which uses Singular Value Decomposition (SVD) to find the underlying latent semantic structure and relevant pages as a result.

LDA-Based Retrieval Framework for Semantic News Video Retrieval

- Computer ScienceInternational Conference on Semantic Computing (ICSC 2007)
- 2007

A lexicon-guided two-level LDA retrieval framework that uses the HowNet to guide the first- level LDA model's parameter estimation, and further construct the second-level HNet-guided LDA models based on theFirst-level's inference results.

A new approach to build a geographical taxonomy of adjacency automatically using the latent semantic indexing method

- Computer Science2015 Intelligent Systems and Computer Vision (ISCV)
- 2015

This paper introduces an approach for constructing a geographical taxonomy of adjacency for a country, to be used in reformulating spatial queries, and notes that reformulating geographical queries based on the built taxonomy improves widely the precision of the queries.

Linear transformations of semantic spaces for word-sense discrimination and collocation compositionality grading

- Computer Science
- 2015

It is hypothesised that Word Space and LSA should perform similarly in actual word-sense disambiguation and discrimination experiments, given the analysis of the SVD spaces produced by both models.

Meta-Search Engine based on Query-Expansion Using Latent Semantic Analysis and Probabilistic Latent Semantic Analysis

- 2007

As the result of the rapid advancements in Information Technology, Information Retrieval on Internet (Internet -Searching) is gaining importance, day by day. SearchEngines are admittedly essential…

An overview of topic modeling methods and tools

- Computer Science2017 International Conference on Intelligent Computing and Control Systems (ICICCS)
- 2017

Methods of Topic Modeling which includes Vector Space Model (VSM), Latent Semantic Indexing (LSI), Probabilistic LatentSemantic Analysis (PLSA),Latent Dirichlet Allocation (LDA) with their features and limitations are discussed.

Semantic routed network for distributed search engines

- Computer Science
- 2010

Using all these techniques, it is possible to design a Semantic Routed Network which has a potential to improve search results and response time, while saving resources.

Latent Ambiguity in Latent Semantic Analysis?

- Computer ScienceICPRAM
- 2013

It is shown that on b oth a tiny contrived data- set and also on a more substantial word-sense discovery data-set that the empiri cal outcomes achieved with LSA vary according to which formulation is chosen.

Semi-automatic terminology ontology learning based on topic modeling

- Computer ScienceEng. Appl. Artif. Intell.
- 2017

Two topic modeling algorithms are explored, namely LSI & SVD and Mr.LDA for learning topic ontology, to determine the statistical relationship between document and terms to build a topicontology and ontology graph with minimum human intervention.

A Heterogeneous System Based on Latent Semantic Analysis Using GPU and Multi-CPU

- Computer ScienceSci. Program.
- 2017

A heterogeneous Latent Semantic Analysis (hLSA) system, which has been developed using General-Purpose computing on Graphics Processing Units (GPGPUs) architecture, which can solve large numeric problems faster through the thousands of concurrent threads on multiple CUDA cores of GPUs and multi-CPU architecture.

## References

SHOWING 1-10 OF 18 REFERENCES

Information Management Tools for Updating an SVD-Encoded Indexing Scheme

- Computer Science
- 1994

Latent Semantic Indexing (LSI) is a conceptual indexing technique which uses the SVD to estimate the underlying latent semantic structure of the word to document association, which dampens the effect of word choice variability by representing terms and documents using the (orthogonal) left and right singular vectors.

Cross-Language Information Retrieval Using Latent Semantic Indexing

- Computer Science
- 1994

Using the proposed merge strategies, LSI is shown to be able to retrieve relevant documents from either language (Greek or English) without requiring any translation of a user's query.

Latent Semantic Indexing (LSI) and TREC-2

- Computer ScienceTREC
- 1993

LSI is an extension of the vector retrieval method in which the dependencies between terms are explicitly taken into account in the representation and exploited in retrieval by simultaneously modeling all the interrelationships among terms and documents.

Indexing by Latent Semantic Analysis

- Mathematics
- 1990

A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”)…

An application of least squares fit mapping to text information retrieval

- Computer ScienceSIGIR
- 1993

It is discovered that the knowledge about relevance among queries and documents can be used to obtain empirical connections between query terms and the canonical concepts which are used for indexing the content of documents.

Improving text retrieval for the routing problem using latent semantic indexing

- Computer ScienceSIGIR '94
- 1994

This paper applies LSI to the routing task, which operates under the assumption that a sample of relevant and non-relevant documents is available to use in constructing the query, and finds that when LSI is used is conjuction with statistical classification, there is a dramatic improvement in performance.

Latent semantic indexing: a probabilistic analysis

- Computer SciencePODS '98
- 1998

It is proved that under certain conditions LSI does succeed in capturing the underlying semantics of the corpus and achieves improved retrieval performance.

Information retrieval using a singular value decomposition model of latent semantic structure

- Computer ScienceSIGIR '88
- 1988

In a new method for automatic indexing and retrieval, implicit higher-order structure in the association of terms with documents is modeled to improve estimates of term-document association, and…

A similarity-based probability model for latent semantic indexing

- Computer ScienceSIGIR '99
- 1999

A dual probability model is constructed for the Latent Semantic Indexing using the cosine similarity measure, establishing a statistical framework for LSI and leading to a statistical criterion for the optimal semantic dimensions.

Dimensions of meaning

- Computer ScienceSupercomputing '92
- 1992

The author analyzes the structure of the vector representations and applies them to word sense disambiguation and thesaurus induction and finds that dimensionality reduction by means of a singular value decomposition is employed.