MINDSWAP < Challenge

Challenge.MINDSWAP

This page in progress

Participating Team

Short team name: MINDSWAP
Participant names: Jennifer Golbeck
Project URL: http://provenance.mindswap.org
Project Overview: Using Semantic Web Technologies for Provenance
Provenance-specific Overview:
Relevant Publications:

Jennifer Golbeck. 2006. Combining Provenance with Trust in Social Networks for Semantic Web Content Filtering. Proceedings of the International Provenance and Annotation Workshop. Chicago, Illinois, May, 2006. ( http://trust.mindswap.org/papers/IPAW_Trust.pdf )

Christian Halaschek-Wiener, Jennifer Golbeck, Andrew Schain, Michael Grove, Bijan Parsia, and Jim Hendler. Annotation and provenance tracking in semantic web photo libraries.Proceedings of the International Provenance and Annotation Workshop (IPAW). Chicago, Illinois, May 2006. ( http://www.mindswap.org/papers/2006/IPAW_PhotoStuff.pdf )

Workflow Representation

The workflow has been encoded using an OWL ontology at http://provenance.mindswap.org/provenance.owl

Provenance Trace

The structure of the provenance trace can be extracted by following the logical and rule-based connections of instances of the ontology. Our ontology, instance data, and rules are all expressed in OWL. They can be browsed at http://provenance.mindswap.org (this site is still being constructed, so pardon any errors). A visualization shows some of the connections to a certain depth on each page. Transitive roles in the ontology allow provenance tracking back up to the original files. SPARQL queries are used for finding any more complex information.

Provenance Queries Matrix

Teams	Queries
Teams	Q1	Q2	Q3	Q4	Q5	Q6	Q7	Q8	Q9
MINDSWAP team

Provenance Queries

Queries can be run in real time at http://www.mindswap.org/~golbeck/queries.html

Write your own queries at http://provenance.mindswap.org/query/

We use SPARQL for all of our queries. Below is the syntax that we use for meeting each of the queries in the challenge. In some instances (for example, query 1), we give a full URI. Those URIs refer to a specific instance of a graphic or Service Execution. Those URIs can be replaced to perform the query about any other instance. They also can be substituted with a variable to query for the data related to any instance.

Find the process that led to Atlas X Graphic / everything that caused Atlas X Graphic to be as it is. This should tell us the new brain images from which the averaged atlas was generated, the warping performed etc.
Find the process that led to Atlas X Graphic, excluding everything prior to the averaging of images with softmean.
Find the Stage 3, 4 and 5 details of the process that led to Atlas X Graphic.
Find all invocations of procedure align_warp using a twelfth order nonlinear 1365 parameter model (see model menu describing possible values of parameter "-m 12" of align_warp) that ran on a Monday.
Find all Atlas Graphic images outputted from workflows where at least one of the input Anatomy Headers had an entry global maximum=4095. The contents of a header file can be extracted as text using the scanheader AIR utility.
Find all output averaged images of softmean (average) procedures, where the warped images taken as input were align_warped using a twelfth order nonlinear 1365 parameter model, i.e. "where softmean was preceded in the workflow, directly or indirectly, by an align_warp procedure with argument -m 12."
A user has run the workflow twice, in the second instance replacing each procedures (convert) in the final stage with two procedures: pgmtoppm, then pnmtojpeg. Find the differences between the two workflow runs. The exact level of detail in the difference that is detected by a system is up to each participant.

Since we reference files by URI, the outputs will be very different. Each file output will differ, as well as the services used and inputs.
A user has annotated some anatomy images with a key-value pair center=UChicago. Find the outputs of align_warp where the inputs are annotated with center=UChicago.
A user has annotated some atlas graphics with key-value pair where the key is studyModality. Find all the graphical atlas sets that have metadata annotation studyModality with values speech, visual or audio, and return all other annotations to these files.

Suggested Wokflow Variants

Suggest variants of the workflow that can exhibit capabilities that your system support.

Suggested Queries

Suggest significant queries that your system can support and are not in the proposed list of queries, and how you have implemented/would implement them. These queries may be with regards to a variant of the workflow suggested above.

Categorisation of queries

According to your provenance approach, you may be able to provide a categorisation of queries. Can you elaborate on the categorisation and its rationale.

Live systems

The system can be accessed and tested in its current form at http://provenance.mindswap.org

Further Comments

Provide here further comments.

Conclusions

Provide here your conclusions on the challenge, and issues that you like to see discussed at a face to face meeting.

-- LucMoreau - 31 May 2006

-- JenniferGolbeck - 28 Jun 2006
to top

End of topic
Skip to action links | Back to top

You are here: Challenge > FirstProvenanceChallenge > ParticipatingTeams > MINDSWAP

to top

Provenance Challenge