Statistics

This page gives an overview of the information about works, people and organizations made available via DataCite Commons. Please reach out to DataCite Support for questions or comments.

Data Sources

The following main data sources are used in DataCite Commons for a total of currently 44,484,079 records:

DataCite

22,858,234 Works
100% of identifiers and metadata.

Crossref

9,976,797 Works
8.04% of identifiers and metadata. Import is ongoing.

ORCID

11,548,581 People
100% of identifiers. Personal and employment metadata.

ROR

100,467 Organizations
100% of identifiers and metadata.
Additional information comes from these data sources:
  • Wikidata: inception year, geolocation and Twitter account for organizations
  • Unpaywall: download link for Open Access content via Crossref

Works

DataCite Commons currently includes 32,835,031 works, with identifiers and metadata provided by DataCite and Crossref. For the three major work types publication, dataset and software, the respective numbers by publication year are shown below.

16,466,479 Publications

9,040,091 Datasets

199,889 Software

6,410,371 out of all 32,838,830 (19.52%) works have been cited at least once, including 0.97% of works registered with DataCite, and 62.01% of works registered with Crossref.

6,162,486 (37.42%) Cited Publications

97,864 (1.08%) Cited Datasets

1,762 (0.88%) Cited Software

People

DataCite Commons includes all 11,548,581 ORCID identifiers, and personal and employment metadata. This information is retrieved live from the ORCID REST API, the respective numbers by registration year are shown below.

11,548,581 People

4,708,139 out of all 32,838,830 (14.34%) works have been claimed (connected) to at least one ORCID record, including 5.82% of works registered with DataCite, and 33.86% of works registered with Crossref.

3,679,641 (22.35%) Claimed Publications

662,860 (7.33%) Claimed Datasets

30,805 (15.41%) Claimed Software

Organizations

DataCite Commons includes all 100,467 Research Organization Registry (ROR) identifiers and metadata. This information is retrieved live from the ROR REST API, the respective numbers by registration year are shown below.

100,467 Organizations

23,726,121 out of all 32,838,830 (72.25%) works are connected with at least one organization via ROR ID or Crossref Funder ID, including 63.49% of works registered with DataCite, and 92.33% of works registered with Crossref.

12,307,297 (74.74%) Connected Publications

6,018,355 (66.57%) Connected Datasets

195,428 (97.77%) Connected Software