Cite data#

DataCite Metadata Schema#

The DataCite Metadata Working Group published the DataCite Metadata Schema Documentation for the publication and citation of research data in 2019: DataCite Metadata Schema 4.3 together with a XSD (XML Schema Definition): metadata.xsd.

A simple datacite example can look like this:

<?xml version="1.0" encoding="UTF-8"?>
<resource xmlns:xsi="" xmlns="" xsi:schemaLocation="">
  <identifier identifierType="DOI">10.5072/D3P26Q35R-Test</identifier>
      <creatorName nameType="Personal">Fosmire, Michael</creatorName>
      <creatorName nameType="Personal">Wertz, Ruth</creatorName>
      <creatorName nameType="Personal">Purzer, Senay</creatorName>
    <title xml:lang="en">Critical Engineering Literacy Test (CELT)</title>
  <publisher xml:lang="en">Purdue University Research Repository (PURR)</publisher>
    <subject xml:lang="en">Assessment</subject>
    <subject xml:lang="en">Information Literacy</subject>
    <subject xml:lang="en">Engineering</subject>
    <subject xml:lang="en">Undergraduate Students</subject>
    <subject xml:lang="en">CELT</subject>
    <subject xml:lang="en">Purdue University</subject>
  <resourceType resourceTypeGeneral="Dataset">Dataset</resourceType>
    <description xml:lang="en" descriptionType="Abstract">
      We developed an instrument, Critical Engineering Literacy Test (CELT), which is a multiple choice instrument designed to measure undergraduate students’ scientific and information literacy skills. It requires students to first read a technical memo
      and, based on the memo’s arguments, answer eight multiple choice and six open-ended response questions. We collected data from 143 first-year engineering students and conducted an item analysis. The KR-20 reliability of the instrument was .39. Item
      difficulties ranged between .17 to .83. The results indicate low reliability index but acceptable levels of item difficulties and item discrimination indices. Students were most challenged when answering items measuring scientific and mathematical
      literacy (i.e., identifying incorrect information).


The PROV document family of the W3C working group defines various aspects that are necessary to be able to exchange provenance information interoperably.

Python prov#

With prov, a Python3 library is available that supports the import and export of the PROV data model into the following serialisation formats:

In addition, PROV documents can be created with NetworkX MultiDiGraph and vice versa. Finally, PROV documents can also be generated as graphs in PDF, PNG and SVG formats.