Crossref: The sustainable source of community-owned scholarly metadata

@article{Hendricks2020CrossrefTS,
  title={Crossref: The sustainable source of community-owned scholarly metadata},
  author={Ginny Hendricks and Dominika Tkaczyk and Jennifer Lin and Patricia Feeney},
  journal={Quantitative Science Studies},
  year={2020},
  volume={1},
  pages={414-427}
}
This paper describes the scholarly metadata collected and made available by Crossref, as well as its importance in the scholarly research ecosystem. Containing over 106 million records and expanding at an average rate of 11% a year, Crossref’s metadata has become one of the major sources of scholarly data for publishers, authors, librarians, funders, and researchers. The metadata set consists of 13 content types, including not only traditional types, such as journals and conference papers, but… 
A systematic metadata harvesting workflow for analysing scientific networks
TLDR
This work presents replicable Python scripts to perform network analysis and hypothesise that this workflow shall provide an avenue for understanding scientific scholarship in multiple dimensions.
Using Conventional Bibliographic Databases for Social Science Research: Web of Science and Scopus are not the Only Options
Although large citation databases such as Web of Science and Scopus are widely used in bibliometric research, they have several disadvantages, including limited availability, poor coverage of books
How to structure citations data and bibliographic metadata in the OpenCitations accepted format
TLDR
This paper illustrates how the citation data and bibliographic metadata should be structured to comply with the OpenCitations accepted format.
Transparency to hybrid open access through publisher-provided metadata: An article-level study of Elsevier
TLDR
This study addresses the lack of transparency by leveraging Elsevier article metadata and provides the first publisher-level study of hybrid OA uptake and invoicing, and demonstrates the value of publisher-provided metadata improve the transparency in scholarly publishing by linking invoices data to bibliometrics.
Linking Publications to Funding at Project Level: A curated dataset of publications reported by FP7 projects
TLDR
This dataset is, to the authors' knowledge, the first comprehensive and curated dataset of scholarly outputs of the Framework Programme and could only be created thanks to significant improvements and investments made in the reporting systems used by EU funded projects.
Toward transparency of hybrid open access through publisher‐provided metadata: An article‐level study of Elsevier
TLDR
This study addresses the lack of transparency by leveraging Elsevier article metadata and provides the first publisher‐level study of hybrid OA uptake and invoicing, and demonstrates the value of publisher‐provided metadata to improve the transparency in scholarly publishing.
Scholarly outputs of EU Research Funding Programs: Understanding differences between datasets of publications reported by grant holders and OpenAIRE Research Graph in H2020
TLDR
OpenAIRE Research Graph offers a more complete dataset of scholarly outputs of from EU Research funding programs, and describes the dataflow leading to their creation and assess the quality of data by validating the link <project, publications>.
Identifying and correcting invalid citations due to DOI errors in Crossref data
TLDR
The data gathered in this study can enable investigating possible reasons for DOI mistakes from a qualitative point of view, helping publishers identify the problems underlying their production of invalid citation data, and could be integrated into the existing process to add citations by automatically correcting a wrong DOI.
A map of Digital Humanities research across bibliographic data sources
TLDR
Citations from and to DH publications showed strong connections between DH and research in Computer Science, Linguistics, Psychology, and Pedagogical & Educational Research, which suggests a reciprocal interest between the two disciplines.
Large-scale comparison of bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic
TLDR
A large-scale comparison of five multidisciplinary bibliographic data sources: Scopus, Web of Science, Dimensions, Crossref, and Microsoft Academic highlights the importance of combining a comprehensive coverage of the scientific literature with a flexible set of filters for making selections of the literature.
...
...

References

SHOWING 1-10 OF 19 REFERENCES
Merits and Limits: Applying open data to monitor open access publications in bibliometric databases
TLDR
Preliminary results suggest that identification of OA state of publications denotes a difficult and currently unfulfilled task.
Integrating and Exploiting Public Metadata Sources in a Bibliographic Information System
TLDR
Some insights are given how the dblp bibliography as an example for such a system is maintained and improved and how metadata can be automatically harvested from publisher websites and how the harvesting process can be steered.
Comparing published scientific journal articles to their pre-print versions
TLDR
A comparative study of pre-print papers from two distinct science, technology, and medicine corpora and their final published counterparts revealed that the text contents of the scientific papers generally changed very little from their pre- print to final published versions.
Disciplinary differences of the impact of altmetric
  • J. Ortega
  • Computer Science
    FEMS microbiology letters
  • 2018
TLDR
The results show that articles in the General category attract more attention from social media, Social Sciences articles have higher usage than Physical Sciences, and General articles are more cited and saved than Health Sciences and Social sciences articles.
The practice of self-citations: a longitudinal study
TLDR
The analysis showed an overall increment in author self-citations in several of the 24 academic disciplines considered, but depicted a stronger causal relation between such increment and the rules introduced by the 2012 Italian Scientific Habilitation in 10 out of 24 disciplines analysed.
Network analysis to evaluate the impact of research funding on research community consolidation
TLDR
An analysis of collaboration networks for both groups of authors suggests that the Sloan Foundation’s program resulted in a more consolidated community of researchers, specifically in terms of number of components, diameter, density, and transitivity of the coauthor networks.
Research data management and the evolutions of scholarship
This case study critically examines ongoing developments in contemporary scholarship through the lens of research data management support at KU Leuven, and KU Leuven Libraries in particular. By
The Two-Way Street of Open Access Journal Publishing: Flip It and Reverse It
TLDR
It is argued that reverse flips present a unique perspective on OA, and that further research would greatly benefit from enhanced data and tools for identifying such cases.
Do Authors Deposit on Time? Tracking Open Access Policy Compliance
TLDR
After the introduction of the UK REF 2021 OA policy, this time lag has decreased significantly in the UK and that the policy introduction might have accelerated the UK's move towards immediate OA compared to other countries, supporting the argument for the inclusion of a time-limited deposit requirement in OA policies.
...
...