Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations

@article{Heibi2019SoftwareRC,
  title={Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations},
  author={Ivan Heibi and Silvio Peroni and David M. Shotton},
  journal={Scientometrics},
  year={2019},
  volume={121},
  pages={1213 - 1228}
}
In this paper, we present COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations (http://opencitations.net/index/coci). COCI is the first open citation index created by OpenCitations, in which we have applied the concept of citations as first-class data entities, and it contains more than 445 million DOI-to-DOI citation links derived from the data available in Crossref. These citations are described using the resource description framework by means of the newly extended version of… 

OpenCitations, an infrastructure organization for open scholarship

OpenCitations is an infrastructure organization for open scholarship dedicated to the publication of open citation data as Linked Open Data using Semantic Web technologies, thereby providing a

OpenCitations, an open e-infrastructure to foster maximum reuse of citation data

TLDR
Since its data satisfies all the Reuse guidelines provided by FAIR in terms of richness, provenance, usage licenses and domain-relevant community standards, OpenCitations provides an example of a successful open e-infrastructure in which the reusability of data is integral to its mission.

Investigating missing citations in COCI and publishers involved

The primary purpose of this research is to find the publishers responsible for the missing citations in COCI by sending incorrect metadata to Crossref, the publishers to whom such invalid citations

OPTIMETA – Strengthening the Open Access publishing system through open citations and spatiotemporal metadata 

TLDR
This work will extend Open Journal Systems (OJS) to give it functionalities for collecting and distributing open data by developing two OJS plugins for capturing citation networks and articles' spatial and temporal properties as machine-readable and accessible metadata.

Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations

TLDR
This paper investigates 3,073,351 citations found by these six data sources to 2,515 English-language highly-cited documents published in 2006 from 252 subject categories, expanding and updating the largest previous study.

Identifying and correcting invalid citations due to DOI errors in Crossref data

TLDR
The data gathered in this study can enable investigating possible reasons for DOI mistakes from a qualitative point of view, helping publishers identify the problems underlying their production of invalid citation data, and could be integrated into the existing process to add citations by automatically correcting a wrong DOI.

Open bibliographic data and the Italian NSQ

TLDR
Investigating the status of open bibliographic data of three of the most used open resources, namely Microsoft Academic Graph, Crossref and OpenAIRE, concludes that open datasets are ready to be used for some disciplines, among which mathematics, natural sciences, economics and statistics, even if there is still room for improvement.

Finding citations for PubMed: a large-scale comparison between five freely available bibliographic data sources

TLDR
Results indicate that Dimensions is the most comprehensive data source that provides references for 62.4% of PubMed documents, outperforming the official NIH-OCC dataset and achieving the best overall results.

La cobertura de los índices de citas abiertos se acerca a la de Web of Science y Scopus

The information sources that are often used to monitor and to obtain a better understanding of the system of scholarly communication (such as Web of Science, Scopus, and Google Scholar) have

Can we assess research using open scientific knowledge graphs? A case study within the Italian National Scientific Qualification

TLDR
The outcomes of the experiment show that the amount of open bibliographic metadata and open citation data currently available in the two scientific knowledge graphs adopted is not yet enough for obtaining results similar to those provided using commercial databases.

References

SHOWING 1-10 OF 51 REFERENCES

Open data to evaluate academic researchers: an experiment with the Italian Scientific Habilitation

TLDR
This paper investigates the growth of open citations with an experiment on the Italian Scientific Habilitation, the National process for University Professor qualification which instead uses data from commercial indexes, and simulates the procedure by only using open data.

One Year of the OpenCitations Corpus - Releasing RDF-Based Scholarly Citation Data into the Public Domain

TLDR
The OCC is introduced and its outcomes and uses after the first year of life are discussed, creating an RDF dataset of scholarly citation data that is open to all.

One Year of the OpenCitations Corpus - Releasing RDF-Based Scholarly Citation Data into the Public Domain.

TLDR
The OCC is introduced and its outcomes and uses after the first year of life are discussed, creating an RDF dataset of scholarly citation data that is open to all.

Enabling text search on SPARQL endpoints through OSCAR

TLDR
The latest version (Version 2.0) of OSCAR, the OpenCitations RDF Search Application, is introduced, which has several improved features and extends the query work compared with the previous version, presented at the workshop entitled Semantics, Analytics, Visualisation: Enhancing Scholarly Dissemination (SAVE-SD 2018).

Nine million book items and eleven million citations: a study of book-based scholarly communication using OpenCitations

TLDR
Books take longer than other bibliographic materials to reach peak citation levels, yet are cited for the same duration as journal articles, and most-cited books tend to cover general (yet essential) topics, theories, and technological concepts in mathematics and statistics.

The OpenCitations Data Model

TLDR
The OpenCitations Data Model (OCDM), a generic data model for describing bibliographic entities and citations, developed using Semantic Web technologies is presented and the effective reusability of OCDM is evaluated according to ontology evaluation practices.

OSCAR: A Customisable Tool for Free-Text Search over SPARQL Endpoints

TLDR
This paper introduces OSCAR, the OpenCitations RDF Search Application, which is a user-friendly search platform that can be used to search any RDF triplestore providing a SPARQL endpoint, while hiding the complexities of SParQL.

Introducing Wikidata to the Linked Data Web

TLDR
New RDF exports that connect Wikidata to the Linked Data Web are introduced and several partial exports are introduced that provide more selective or simplified views on the data.

OpenAIRE LOD Services: Scholarly Communication Data as Linked Data

TLDR
This work presents a scalable and maintainable architecture that converts the OpenAIRE data from its original HBase NoSQL source to RDF, and explores how this novel integration of data about research can facilitate scholarly communication.
...