Corpus ID: 60831687

Maintaining an Online Bibliographical Database: The Problem of Data Quality

@inproceedings{Ley2006MaintainingAO,
  title={Maintaining an Online Bibliographical Database: The Problem of Data Quality},
  author={M. Ley and P. Reuther},
  booktitle={EGC},
  year={2006}
}
CiteSeer and Google-Scholar are huge digital libraries which provide access to (computer-)science publications. Both collections are operated like specialized search engines, they crawl the web with little human intervention and analyse the documents to classifiy them and to extract some metadata from the full texts. On the other hand there are traditional bibliographic data bases like INSPEC for engineering and PubMed for medicine. For the field of computer science the DBLP service evolved… Expand
71 Citations

Topics from this paper

Methods for Extracting Meta-Information from bibliographic databases
  • PDF
Integration and Warehousing of Social Metadata for Search and Assessment of Scientific Knowledge
  • PDF
Disambiguating publication venue titles using association rules
  • 13
Discovering and Analyzing Scientific Communities using Conference Network
  • PDF
Towards structured representation of academic search results
  • 1
  • PDF
Sieving publishing communities in DBLP
  • Christoph Schommer
  • Computer Science
  • 2008 Third International Conference on Digital Information Management
  • 2008
  • 1
Your Personal, Virtual Librarian
  • 3
Automating Document Annotation Using Open Source Knowledge
  • 7
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 15 REFERENCES
The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives
  • M. Ley
  • Computer Science
  • SPIRE
  • 2002
  • 378
Browsing and visualizing digital bibliographic data
  • 26
Cleaning the spurious links in data
  • 41
Comparative study of name disambiguation problem using a scalable blocking-based framework
  • 118
  • PDF
A hierarchical naive Bayes mixture model for name disambiguation in author citations
  • 97
  • PDF
Co-authorship networks in the digital library research community
  • 744
  • PDF
On six degrees of separation in DBLP-DB and more
  • 152
  • PDF
Social Networks Applied
  • 332
  • PDF
Adaptive Name Matching in Information Integration
  • 541
  • PDF
Data quality for the information age
  • 750
...
1
2
...