Make Data Count: A Central Corpus for All Data Citations

Project Title

Strategic Initiative Make Data Count: A Central Corpus for All Data Citations


The Wellcome Trust (Grant ID: 226453/Z/22/Z)

Duration and Start Date

36 mo., February 1, 2023


The project is a broad collaboration across the scholarly communication ecosystem, including DataCite, CZI Science, EMBL-EBI, and others.


For data citation to become a priority across the research landscape, we need to facilitate ways to incorporate data metrics into institutional and funder processes related to the evaluation of research coverage and impact, as well as researcher assessment. To make this possible, the global corpus of data citations will deliver: 

  • A central aggregate of all references to research data across articles, preprints, government documents, and other outputs, released as a community resource.
  • A dashboard to allow community stakeholders to visualize citations to datasets according to different dimensions e.g. field of research, affiliation, and others.


Iratxe Puebla

Iratxe Puebla

Green icon of the Open Researcher and Contributor ID showing a white mail icon Twitter logo in dark blue. A dark blue Mastodon logo. github logo in dark blue LinkedIn logo in dark blue

Make Data Count Director

Iratxe joined DataCite in June 2023 as Director of Make Data Count, in this role she supports adoption of open data metrics and the development of the Open Global Data Citation Corpus. Prior to DataCite, Iratxe held editorial roles at the open-access publishers PLOS and BioMed Central and more recently, she worked at ASAPbio, focusing on community engagement with preprints and multi-stakeholder projects related to best practices and adoption of preprints. Iratxe is based in Cambridge (UK), she is passionate about open science and community building, and also has an interest in research reproducibility and publication ethics.

Follow us on Twitter, Mastodon, and LinkedIn for updates.