• Corpus ID: 246015474

Proactive Query Expansion for Streaming Data Using External Source

@article{Alshanik2022ProactiveQE,
  title={Proactive Query Expansion for Streaming Data Using External Source},
  author={Farah Alshanik and Amy W. Apon and Yuheng Du and Alexander Herzog and Ilya Safro},
  journal={ArXiv},
  year={2022},
  volume={abs/2201.06592}
}
Query expansion is the process of reformulating the original query by adding relevant words. Choosing which terms to add in order to improve the performance of the query expansion methods or to enhance the quality of the retrieved results is an important aspect of any information retrieval system. Adding words that can positively impact the quality of the search query or are informative enough play an important role in returning or gathering relevant documents that cover a certain topic can… 

References

SHOWING 1-10 OF 45 REFERENCES
Finding a good query-related topic for boosting pseudo-relevance feedback
TLDR
A topic-based feedback model with three different strategies for finding a good query-related topic based on the Latent Dirichlet Allocation model is proposed that achieves statistically significant improvements over a strong feedback model in the language modeling framework.
A Study on Query Expansion Based on Topic Distributions of Retrieved Documents
TLDR
This paper describes a new relevance feedback method that uses latent topic information extracted from target documents by means of latent Dirichlet allocation (LDA) and expands the initial query by providing the topic distributions of the documents retrieved at the first search.
Incorporating Query Expansion and Quality Indicators in Searching Microblog Posts
TLDR
A language modeling approach tailored to microblogging characteristics, where redundancy-based IR methods cannot be used in a straightforward manner, is developed and a dynamic query expansion model for microblog post retrieval is proposed.
Latent Document Re-Ranking
TLDR
A novel document re-ranking method based on Latent Dirichlet Allocation (LDA) which exploits the implicit structure of the documents with respect to original queries and calculates the distance between queries and initial retrieval results based on latent semantic information deduced.
Evaluating topic models for information retrieval
TLDR
Experiments to measure topic models' ability to predict held-out likelihood confirm past results on small corpora, but suggest that simple approaches to topic model are better for large corpora.
Using latent topics to enhance search and recommendation in Enterprise Social Software
Concept Based Query Expansion Using WordNet
TLDR
This paper proposed a new query expansion technique using the comprehensive thesaurus WordNet and its semantic relatedness measure modules, demonstrating a 7% precision improvement over retrieval methods not employing query expansion techniques.
LDA-based document models for ad-hoc retrieval
TLDR
This paper proposes an LDA-based document model within the language modeling framework, and evaluates it on several TREC collections, and shows that improvements over retrieval using cluster-based models can be obtained with reasonable efficiency.
Cluster-based language models for distributed retrieval
TLDR
A new approach to distributed retrieval based on document clustering and language modeling is proposed and it is shown that all three methods improve the effectiveness of distributed retrieval.
Quary Expansion Using Local and Global Document Analysis
TLDR
It is shown that, although global analysis haa some advantages, local analysia is generally more effective than global techniques and using global analysis techniques.
...
...