Data sets for author name disambiguation: an empirical analysis and a new resource

@inproceedings{Mller2017DataSF,
  title={Data sets for author name disambiguation: an empirical analysis and a new resource},
  author={Christoph M{\"u}ller and Florian Reitz and Nicolas Roy},
  booktitle={Scientometrics},
  year={2017}
}
Data sets of publication meta data with manually disambiguated author names play an important role in current author name disambiguation (AND) research. We review the most important data sets used so far, and compare their respective advantages and shortcomings. From the results of this review, we derive a set of general requirements to future AND data sets. These include both trivial requirements, like absence of errors and preservation of author order, and more substantial ones, like full… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-10 of 39 references

Construction of a large-scale test set for author disambiguation

  • Kang, I.-S, +4 authors B.-J
  • Information Processing & Management,
  • 2011
Highly Influential
7 Excerpts

Maintaining an online bibliographical database: The problem of data quality

  • M. Ley, P. Reuther
  • EGC
  • 2006
Highly Influential
5 Excerpts

Similar Papers

Loading similar papers…