The Information Retrieval Anthology

  title={The Information Retrieval Anthology},
  author={Martin Potthast and S. G{\"u}nther and Janek Bevendorff and J. P. Bittner and Alexander Bondarenko and Maik Fr{\"o}be and Christian Kahmann and Andreas Niekler and Michael V{\"o}lske and Benno Stein and Matthias Hagen},
  journal={Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval},
We present the IR Anthology, a corpus of information retrieval publications accessible via a metadata browser and a full-text search engine. Following the example of the well-known ACL Anthology, the IR Anthology serves as a hub for researchers interested in information retrieval. Our search engine ChatNoir indexes the publications' full texts, enabling a focused search and linking users to the respective publisher's site for personal access. Listing more than 40,000 publications at the time of… Expand
1 Citations

Figures and Tables from this paper

The information retrieval anthology 2021
The Information Retrieval Anthology is an endeavor to create a comprehensive collection of metadata and full texts of IR-related publications, and the challenges lying ahead to develop it towards a resource that serves the IR community for years to come. Expand


The ACL Anthology Searchbench
A novel application for structured search in scientific digital libraries that provides search in both its bibliographic metadata and semantically analyzed full textual content and serves as a showcase for the recent progress in natural language processing research and language technology. Expand
The ACL Anthology: Current State and Future Directions
How the planned use of Docker images will improve the ACL Anthology’s long-term stability is discussed, and an open challenge of reviewer matching is issued which the ACL community can directly benefit from. Expand
Transitioning the information retrieval literature to a fully open access model
It is proposed that the IR community starts working on a road map for transitioning the IR literature to a fully, "diamond", open access model. Expand
On Forgetting to Cite Older Papers: An Analysis of the ACL Anthology
There is indeed a tendency for recent papers to cite more recent work, but the rate at which papers older than 15 years are cited has remained relatively stable, and bibliographic analysis finds this to be the case. Expand
Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl
Elastic ChatNoir’s main purpose is to serve as a baseline for reproducible IR experiments and user studies for the coming years, empowering research at a scale not attainable to many labs beforehand, and to provide a platform for experimenting with new approaches to web search. Expand
The State of NLP Literature: A Diachronic Analysis of the ACL Anthology
It is found that only about 30% of first authors are female, and that this percentage has not improved since the year 2000, which shows that citation and participation gaps across demographic groups will encourage more inclusiveness and fairness in research. Expand
Explorations in Bibliography: Zotero Goes Public
The publishing of scholarly bibliographies has diminished significantly over the past two decades for an obvious reason: access to bibliographic tools via the Internet. But this has not marked theExpand
CiteSeer: an automatic citation indexing system
CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations. Expand
NLP Scholar: An Interactive Visual Explorer for Natural Language Processing Literature
Several interconnected interactive visualizations (dashboards) that present various aspects of the data are described that allow users to search for papers in the area of their interest, published within specific time periods, published by specified authors, etc. Expand
NLP Scholar: A Dataset for Examining the State of NLP Research
The NLP Scholar Dataset is presented – a single unified source of information (from both AA and Google Scholar) for tens of thousands of NLP papers that can be used to identify broad trends in productivity, focus, and impact of N LP research. Expand