university of california, davis daniel zinn 1 university of california, davis daniel zinn 1 daniel...

Download University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludäscher University of California at Davis

If you can't read please download the document

Upload: jason-daniel

Post on 13-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

  • Slide 1

University of California, Davis Daniel Zinn 1 University of California, Davis Daniel Zinn 1 Daniel Zinn Bertram Ludscher University of California at Davis Presented at IPAW 2010 Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Abstract Provenance Graphs: Anticipating and Exploiting Schema-Level Data Provenance Slide 2 University of California, Davis Daniel Zinn 2 Motivation Phylogenetic Tree of Primates Slide 3 University of California, Davis Daniel Zinn 3 Scientific Workflows Actors Channels Ports Tokens int, string, record{..}, array[..],.. SciWF = Executable specification of Scientific Method DrawTree AA-Sequences Clustal Aligned AA-Sequences Quicktree Newick Tree Slide 4 University of California, Davis Daniel Zinn 4 Virtual Data Assembly Lines (COMAD in Kepler) Data is organized as XML-like tree structures Encapsulate actor within a configurable shell XML Three configuration parameters: Scope to select scope of actor invocation Input assembler to create inputs for wrapped component Write expression to write results back into data stream Slide 5 University of California, Davis Daniel Zinn 5 Workflow execution and Provenance Slide 6 University of California, Davis Daniel Zinn 6 Ex1: Phylogenetics Workflow Configurations WorkflowGraph APG Slide 7 University of California, Davis Daniel Zinn 7 Ex2: BUG! Idle Actor No Input Configurations WorkflowGraph APG No input Slide 8 University of California, Davis Daniel Zinn 8 Ex3: Bug! Wrong input Configurations WorkflowGraph APG Slide 9 University of California, Davis Daniel Zinn 9 Bigger Picture Slide 10 University of California, Davis Daniel Zinn 10 Types and such Slide 11 University of California, Davis Daniel Zinn 11 In the paper Slide 12 University of California, Davis Daniel Zinn 12 APGs for the Three Examples Ex1: Desired result Ex2: Bug - No input Ex3: Bug - Too much input Slide 13 University of California, Davis Daniel Zinn 13 Time-Collapsed Flowgraph Only Show Collection-Structure at the End Collapse Provenance Slide 14 University of California, Davis Daniel Zinn 14 Structure-Collapsed Flowgraph Collapse the Collection edges Slide 15 University of California, Davis Daniel Zinn 15 Summary Abstract Provenance Graphs summarize potential Provenance graphs via graph homomorphisms are constructed via static analysis of the workflow without running the WF explain workflows data-flow make it easier to spot certain configuration bugs Slide 16 University of California, Davis Daniel Zinn 16 University of California, Davis Daniel Zinn 16