Research Data Provenance


Group details

Chair (s): 
Case Statement: 
IG Established

Tracking provenance for research data is vital to science and scholarship, providing answers to common questions researchers pose when sharing and exchanging data: Where did it come from? Who modified it? Is this copy the same as the copy I deposited? In what way is it the same? How do I resolve discrepancies or anomalies?

This group focuses on the comparison and evaluation of models for data provenance. It is concerned with questions of data origins, maintenance of identity through the data lifecycle, and how we account for data modification. Objectives of this group include: recommending general and expressive frameworks for documenting research data transactions proposing syntheses of complementary provenance views, and relating data provenance to problems of scientific equivalence and the assessment of data quality.

The Research Data Provenance group anticipates potential intersections with the Data Citation, Data Foundation and Terminology, and Metadata Standards working groups as well as the Data in Context interest group.


Recent Activity

27 Feb 2018

Meeting in 12 hours - update invitation

Hi all,
Please note we have change the invitation for the WG meeting at this time to be in UTC time, as per a request at the last meeting. Here is the updated invitation (attached).
Also, I have updated the notes from the previous meeting and added the Agenda for this one there:

09 Feb 2018

Researching a blockchain based provenance system

Hello everyone,

I am a Graduate student at the University of Amsterdam, and am currently finishing my Master’s degree in Cyber Security. The topic that I am researching is data traceability of satellite data. I work with Airbus to research how viable it is to trace satellite data, as it is processed from raw data (level 0) to usable data (level 3 or 4) that for instance, weather websites use. So at each step of the processing phase, I need to be able to verify that it originated from the previous dataset.


07 Dec 2017

IPAW 2018 CfP

Dear Provenance Interest Group,
For those who haven’t already seen it, the Call for Papers for ProvenanceWeek 2018 (combination of IPAW, International Provenance & Annotation Worksop & TaPP, Theory and Practice of Provenance) 2018 is out:
Likely quite a few members of this group, including me, will be going to that conference.
Nicholas Car
Senior Experimental Scientist
CSIRO Land & Water

06 Dec 2017

For Info: Call for proposals for Plenary 11 closes on December 28

Dear Research Data Provenance group members,
I am your TAB liaison.
I want to remind you that the call for proposals closes on December 28.
More details at
I would encourage you to submit a proposal!
I will be on Annual Leave from December 15, but would be happy to work with
you on your proposal prior to this, if that would be useful.
​I hope to see you in Berlin!​

26 Oct 2017

Progress from the RDA Prov WG [SEC=UNCLASSIFIED]

Hi RDA Provenance IG and Aust. Research Data Prov. Group,
Just a note from the RDA's Provenance Patterns WG that is consuming all of my, Davie Dubin and Paolo Missier's group time!
WG Charter
We are sticking to a Charter that we wrote for the WG which you can see here: From the Charter, our main work items are:
1. Common provenance Use Cases
2. Provenance design patterns

18 Sep 2017

“R programming language to manage metadata, data complying with OGC standards and controlled vocabularies: the case of Tuna Fisheries”

Dear IG's / WG's,

We would like to invite you to an RDA related event that we believe to be relevant to your activities; “R programming language to manage metadata and data by complying with OGC (EML, CF conventions) standards and controlled vocabularies: the case of Tuna Fisheries” .

When: Tuesday 19 September 2017, from 11:30 to 13:00 local time

Where: room: Mansfield 10, RDA Plenary Meeting, Montreal, Canada

14 Sep 2017

Joint Metadata & Provenance meeting at P10

The joint meeting of the Metadata IG, Metadata Standards Catalog WG,
Research Data Provenance IG, and Data in Context IG will be on Tuesday,
September 19 in Breakout Session 2 (14:00 - 15: 30 EST/UTC -5). The agenda
for this session is here
If you are unable to attend the meeting in person, there will be remote
access for this session (see below).
Remote Access Information (Gotomeeting):

24 Jul 2017

Re: Meeting notes from 19/07 [SEC=UNCLASSIFIED]

Dear Provenance Group,
Here are the minutes from the meeting last week on the 18th or 19th of July:
As always, you are able to comment on the document which will be reviewed at the next Group meeting.
Next meeting is at the "Australian Friendly" time:
Los Angeles, USA Tue, 1 Aug 2017 at 6:00 pm PDT
Chicago, USA Tue, 1 Aug 2017 at 8:00 pm CDT

17 Jul 2017

Meeting in 36 hours [SEC=UNCLASSIFIED]

Dear Provenance Group,
We have our regular, European Friendly, meeting on Tuesday/Wednesday this week in 36 hours. Here's a quick agenda and with teleconference details.
1. Review of last meeting's minutes
2. Review progress to date on Prov Patterns WG