Seminars & Colloquia
Department of Computer Science, University of Chicago
"A Virtual Data Language and System for Scientific Workflow Management in Data Grid Environments"
Wednesday February 28, 2007 10:00 AM
Location: 3211, EBII NCSU Centennial Campus
(Visitor parking instructions)
However, the description, composition, and execution of even logically simple scientific workflows are often complicated by the need to deal with 'messy' issues like heterogeneous storage formats and ad-hoc file system structures. We show how these difficulties can be overcome via a typed workflow notation called virtual data language, within which issues of physical representation are cleanly separated from logical typing, and by the implementation of this notation within the context of a powerful runtime system that supports distributed execution. The resulting virtual data language and system are capable of expressing complex workflows in a simple compact form, enacting those workflows in distributed environments, monitoring and recording the execution processes, and tracing the derivation history of data products.
We describe the motivation, design, implementation, and evaluation of the virtual data language and system, and the application of the virtual data paradigm to various science disciplines including astronomy, cognitive neuroscience, high energy physics and science education.
Host: Rada Chirkova, Computer Science, NCSU