Scalable data citation in dynamic, large databases: Model and reference implementation

  title={Scalable data citation in dynamic, large databases: Model and reference implementation},
  author={Stefan Pr{\"o}ll and Andreas Rauber},
  journal={2013 IEEE International Conference on Big Data},
Uniquely and precisely identifying and citing arbitrary subsets of data is essential in many settings, e.g. to facilitate experiment validation and data re-use in meta-studies. Current approaches relying on pointers to entire data collections or on explicit copies of data do not scale. We propose a novel approach relying on persistent, timestamped, adapted queries to versioned and timestamped data sources. Result set hashes are used for validation correctness on later re-execution. The proposed… CONTINUE READING
Highly Cited
This paper has 43 citations. REVIEW CITATIONS
22 Citations
13 References
Similar Papers


Publications referenced by this paper.
Showing 1-10 of 13 references

Managing Time in Relational Databases: How to Design, Update and Query Temporal Data, 1st ed

  • T. Johnston, R. Weis
  • 2010
1 Excerpt

Persistent identification of electronic documents and the future of footnotes

  • S. Lyons
  • Law Libr. J., vol. 97, p. 681, 2005.
  • 2005
1 Excerpt

Similar Papers

Loading similar papers…