Data Cleaning and XML: The DBLP Experience

  title={Data Cleaning and XML: The DBLP Experience},
  author={Wai Lup Low and Wee Hyong Tok and Mong-Li Lee and Tok Wang Ling},
CiteSeer and Google-Scholar are huge digital libraries which provide access to (computer-)science publications. Both collections are operated like specialized search engines, they crawl the web with little human intervention and analyse the documents to classifiy them and to extract some metadata from the full texts. On the other hand there are traditional bibliographic data bases like INSPEC for engineering and PubMed for medicine. For the field of computer science the DBLP service evolved… CONTINUE READING
Highly Cited
This paper has 22 citations. REVIEW CITATIONS

From This Paper

Topics from this paper.