CiteSeerX: AI in a Digital Library Search Engine
@inproceedings{Wu2014CiteSeerXAI, title={CiteSeerX: AI in a Digital Library Search Engine}, author={Jian Wu and K. Williams and Hung-Hsuan Chen and Madian Khabsa and Cornelia Caragea and Suppawong Tuarob and Alexander G. Ororbia and Douglas Jordan and Prasenjit Mitra and C. Lee Giles}, booktitle={AI Mag.}, year={2014} }
CiteSeerX is a digital library search engine providing access to more than five million scholarly documents with nearly a million users and millions of hits per day. We present key AI technologies used in the following components: document classification and de-duplication, document and citation clustering, automatic metadata extraction and indexing, and author disambiguation. These AI technologies have been developed by CiteSeerX group members over the past 5–6 years. We show the usage status… CONTINUE READING
Figures, Tables, and Topics from this paper
70 Citations
Utility-Based Control Feedback in a Digital Library Search Engine: Cases in CiteSeerX
- Computer Science
- Feedback Computing
- 2014
- 3
- PDF
A Supervised Learning Approach To Entity Matching Between Scholarly Big Datasets
- Computer Science
- K-CAP
- 2017
- 4
- PDF
ParsRec: A Novel Meta-Learning Approach to Recommending Bibliographic Reference Parsers
- Computer Science
- AICS
- 2018
- 7
- PDF
The References of References: Enriching Library Catalogs via Domain-Specific Reference Mining
- Computer Science
- BIR@ECIR
- 2016
- 2
- PDF
References
SHOWING 1-10 OF 39 REFERENCES
The evolution of a crawling strategy for an academic document search engine: whitelists and blacklists
- Computer Science
- WebSci '12
- 2012
- 28
- PDF
Scholarly big data information extraction and integration in the CiteSeerχ digital library
- Computer Science
- 2014 IEEE 30th International Conference on Data Engineering Workshops
- 2014
- 45
- PDF
AckSeer: a repository and search engine for automatically extracted acknowledgments from digital libraries
- Computer Science
- JCDL '12
- 2012
- 40
- PDF
SeerSuite: Developing a Scalable and Reliable Application Framework for Building Digital Libraries by Crawling the Web
- Computer Science
- WebApps
- 2010
- 27
- PDF
TableSeer: automatic table metadata extraction and searching in digital libraries
- Computer Science
- JCDL '07
- 2007
- 150
- Highly Influential
- PDF
A Web Service for Scholarly Big Data Information Extraction
- Computer Science
- 2014 IEEE International Conference on Web Services
- 2014
- 19
- PDF
A figure search engine architecture for a chemistry digital library
- Computer Science
- JCDL '13
- 2013
- 24
- PDF
Information extraction from research papers using conditional random fields
- Computer Science
- Inf. Process. Manag.
- 2006
- 224