Corpus ID: 16057891

A Comparison of Open Source Search Engines

@inproceedings{Middleton2007ACO,
  title={A Comparison of Open Source Search Engines},
  author={Christian Middleton and Ricardo Baeza-Yates},
  year={2007}
}
Yet Another Comparison of Lucene and Indri Performance
TLDR
This work presents results that compare the performance of Lucene and Indri at two points in time using data from TREC 6 through 8, and examines the degree to which the results produced by the two systems overlap. Expand
Abordagens para a pesquisa por palavra-chave em bases de dados estruturadas
Os sistemas de pesquisa na web tornaram popular a pesquisapor palavras-chave. Este metodo de pesquisa melhorou claramente a usabilidadedas pesquisas na web. Por outro lado, as tradicionaisExpand
Report on the First HIPstIR Workshop on the Future of Information Retrieval
The vision of HIPstIR is that early stage information retrieval (IR) researchers get together to develop a future for non-mainstream ideas and research agendas in IR. The first iteration of thisExpand
Mining 100 million notes to find homelessness and adverse childhood experiences: 2 case studies of rare and severe social determinants of health in electronic health records
TLDR
A methodology to capture 2 rare and severe social determinants of health, homelessness and adverse childhood experiences (ACEs), from a large EHR repository, provides an efficient solution for mining homelessness and ACE information from EHRs, which can facilitate large clinical and genetic studies of these social determinant of health. Expand
Performance Evaluation of Distributed Indexing Using Solr and Terrier Information Retrievals
TLDR
Comparing the distributed indexing performance over MapReduce for the indexing strategies of Solr and Terrier using 1GB, 3GB, 6GB, and 9GB datasets shows that Terrier is more efficient with large datasets in the presence of processing resource scalability. Expand
An Extensible Search Engine for E icient Caching and Fast Lists Intersection
  • 2017
In modern search engines, caching has been widely used to reduce query latency. Most frequently requested documents and text fragments are cached to improve query throughput. Besides, there areExpand
NBLucene: Flexible and Efficient Open Source Search Engine
TLDR
An open source text searching project written in C++ for research purpose is expanded and it is shown how to use parallel mechanisms in the modern computer system like SIMD and GPUs. Expand
Query expansion strategies for laypeople-centred health information retrieval
TLDR
This thesis proposes several query expansion methods using different sources and methodologies to identify which terms will be added to the original query, and proposes the MTI approach, proving that medical concepts related to the query are good terms for the query expansion. Expand
Segment-Based Temporal Information Retrieval
Tese de doutoramento do Programa de Doutoramento em Ciencias e Tecnologias da Informacao, apresentada ao Departamento de Engenharia Informatica da Faculdade de Ciencias e Tecnologia da UniversidadeExpand
Term frequency with average term occurrences for textual information retrieval
TLDR
A new TWS is proposed that is based on computing the average term occurrences of terms in documents and it also uses a discriminative approach based on the document centroid vector to remove less significant weights from the documents. Expand
...
1
2
3
4
5
...

References

SHOWING 1-8 OF 8 REFERENCES
Introduction to information retrieval
TLDR
This groundbreaking new textbook teaches web-era information retrieval, including web search and the related areas of text classification and text clustering from basic concepts from a computer science perspective by three leading experts in the field. Expand
Modern Information Retrieval
  • Modern Information Retrieval
  • 1999
Managing Gigabytes: Compressing and Indexing Documents and Images
TLDR
A guide to the MG system and its applications, as well as a comparison to the NZDL reference index, are provided. Expand
Conclusions Bibliography
    Load Monitor Project Homepage. http://sourceforge.net/projects/monitor
    • Load Monitor Project Homepage. http://sourceforge.net/projects/monitor
    Text REtrieval Conference (TREC) Homepage
    • Text REtrieval Conference (TREC) Homepage
    Xapian Code Library Homepage
    • Xapian Code Library Homepage
    http://homepage.mac.com/pauljlucas/software/swish
    • http://homepage.mac.com/pauljlucas/software/swish