WORKS 2024

19th Workshop on Workflows in Support of Large-Scale Science
November 18
Atlanta, GA, USA In conjunction with


Proceedings by

WORKS 2024 focuses on the many facets of scientific workflow composition, management, sustainability, and application to domain sciences in an increasingly diverse landscape. The workshop covers a broad range of topics in the scientific workflow lifecycle that include: reproducible research with workflows; workflow execution in distributed and heterogeneous environments; application of AI/ML in workflow management; workflow provenance; serverless workflows; exascale computing with workflows; stream-processing, interactive, adaptive and data-driven workflows; workflow scheduling and resource management; workflow fault-tolerance, debugging, performance analysis/modeling; big data and AI workflows, etc.

Workshop Program (Monday 18, 9am to 5:30pm, Room B312)

Time Event
9:00am-9:14am Welcome
Silvina Caino-Lores, Anirban Mandal
9:14am-9:37am Paper: A software Ecosystem for Multi-Level Provenance Management in Large-Scale Scientific Workflows for AI Applications
Padovani, Anantharaj, Sacco, Kurihana, Bunino, Tsolaki, Girone, Antonio, Sopranzetti, Fronza, Fiore
9:37am-10:00am Paper: Performance Characterization and Provenance of Distributed Task-based Workflows on HPC Platforms
Gueroudji, Phelps, Islam, Carns, Snyder, Ross, Pouchard
10:00am-10:30am Break
10:30am-11:21am Invited Talk: Assessing and Advancing the Potential of Quantum Computing: A NASA Case Study
Eleanor Rieffel
11:21am-11:44am Paper: Parsl+CWL: Experiences Combining the Python and CWL Ecosystems
Karle, Clifford, Babuji, Chard, Katz, Chard
11:44am-12:07pm Paper: Towards Generating Contracts for Scientific Data Analysis Workflows
Vu, Kehrer
12:07pm-12:30pm Paper: GNU Parallel: Enabling Low-Overhead HT-HPC Workflows at Extreme Scale
Maheshwari, Arndt, Karimi, Yin, Suter, Johnson, Ferreira da Silva
12:30pm-2:00pm Lunch
2:00pm-2:05pm Panel Lightning Talk: Towards a Cohesive Ecosystem of Workflows, Data, Artificial Intelligence, and Humans
Paine, Deshmukh, Gunter, O'Donnell, Poon, Ramakrishnan
2:05pm-2:10pm Panel Lightning Talk: Accelerating Operation of Complex Workflows through Standard Data Interfaces
Paul, Regli
2:10pm-2:15pm Panel Lightning Talk: Trust and Verification of AI-Based Decision Making for Future Scientific Workflows: Challenges and Solutions
Giannakou, Amusat, Ramakrishnan
2:15pm-2:20pm Panel Lightning Talk: Integrating Evolutionary Algorithms with Distributed Deep Learning for Optimizing Hyperparameters on HPC Systems
Coletti, Santos Souza, Skluzacek, Suter, Ferreira da Silva
2:20pm-2:30pm Invited Panel Lightning Talk: Worfklows on LUMI: Europe's most powerful supercomputer
Tomasz Malkiewicz
2:30pm-3:00pm Panel Discussion: Future of Scientific Workflows
TBD
3:00pm-3:30pm Break
3:30pm-3:53pm Paper: Managing Workflow Malleabillity in Urgent Computing for Earthquake Alerts
Ejarque, Monterrubio-Velasco, Bhihe, Pienkowska, de la Puente, Badia
3:53pm-4:16pm Paper: A Microservices Architecture Toolkit for Interconnected Science Ecosystems
Brim, Drane, McDonnell, Engelmann, Malviya Thakur
4:16pm-4:39pm Paper: Shepherd: Seamless Integration of Service Workflows into Task-Based Workflows through Log Monitoring
Islam, Thain
4:39pm-5:02pm Paper: Laminar 2.0: Serverless Stream Processing with Enhanced Code Search and Recommendations
Rotchford, Evans, Filgueira
5:02pm-5:25pm Paper: Serverless Computing for Dynamic HPC Workflows
Thurimella
5:25pm-5:30pm Closure
Silvina Caino-Lores, Anirban Mandal

Important Dates

  • Papers and Abstracts Submission: August 4 August 8, 2024 (Final extension)
  • Paper and Abstract Acceptance Notifications: September 6, 2024
  • Camera-ready Submissions: September 27, 2024
  • Workshop: November 18, 2024

All deadlines are Anywhere on Earth (AoE).

Call for Papers

Scientific workflows have underpinned some of the most significant discoveries of the past several decades. Workflow management systems (WMSs) provide abstraction and automation that enable researchers to easily define sophisticated computational processes, and to then execute them efficiently on parallel and distributed computing systems. As workflows have been adopted by multiple scientific communities, they are becoming more complex and require more sophisticated workflow management capabilities. A workflow can now analyze terabyte-scale data sets, execute millions of individual tasks; coordinate heterogeneous resources and tasks from edge to core; and process near real-time data streams, files, and data placed in different types of storage systems. The computations can be single core workloads, loosely-coupled tasks, or tightly-coupled computations, and can run in heterogeneous distributed computing platforms all within a single anirban@ Steering Committee

David Abramson

University of Queensland, Australia

Malcolm Atkinson

University of Edinburgh, UK

Ewa Deelman

University of Southern California, USA

Michela Taufer

University of Tennessee, USA

Program Committee

Ilkay Altintas

San Diego Supercomputing Center

Rosa M. Badia

Barcelona Supercomputing Center

Changxin Bai

Kettering University

Jesus Carretero

University Carlos III of Madrid

Henri Casanova

University of Hawaii at Manoa

Alberto Cascajo

University Carlos III of Madrid

Tainã Coleman

San Diego Supercomputing Center

Iacoppo Colonnelli

University of Torino

Alexandru Costan

Inria

Daniel de Oliveira

Fluminese Federal University

Ewa Deelman

University of Southern California

Jorge Ejarque

Barcelona Supercomputing Center

Rafael Ferreira da Silva

Oak Ridge National Laboratory

Rosa Filgueira

EPCC, University of Edinburgh

Sandro Fiore

University of Trento

Daniel Garijo

Universidad Politécnica de Madrid

Sandra Gesing

San Diego Supercomputing Center

William Godoy

Oak Ridge National Laboratory

Shantenu Jha

Rutgers University

Daniel S. Katz

University of Illinois Urbana-Champaign

Mariam Kiran

Oak Ridge National Laboratory

Daniel Laney

Lawrence Livermore National Laboratory

Jakob Luettgau

Inria

Ketan C. Maheshwari

Oak Ridge National Laboratory

Maciej Malawski

AGH UST

Marta Mattoso

UFRJ

Paolo Missier

Newcastle University

Raffaele Montella

University of Naples Parthenope

Bogdan Nicolae

Argonne National Laboratory

Paola Olaya

University of Tennessee

Tapasya Patki

Lawrence Berkeley National Laboratory

Loïc Pottier

Lawrence Livermore National Laboratory

Radu Prodan

University of Klagenfurt

Bruno Raffin

Inria

Lavanya Ramakrishnan

Lawrence Berkeley National Laboratory

Tong Shu

Southern Illinois University

Yogesh Simhan

Indian Institute of Science

Raul Sirvent

Barcelona Supercomputing Center

Tyler Skluzacek

Oak Ridge National Laboratory

Renan Souza

Oak Ridge National Laboratory

Frédéric Suter

Oak Ridge National Laboratory

Domenico Talia

University of Calabria

Francois Tessier

French Institute for Research in Digital Science and Technology (Inria)

Douglas Thain

University of Notre Dame

Paolo Trunfio

University of Calabria

Cong Wang

RENCI

Sean R. Wilkinson

Oak Ridge National Laboratory

Justin Wozniak

Argonne National Laboratory

Orcun Yildiz

Argonne National Laboratory

Pawel Zuk

University of Southern California

Contact

For information please direct your inquiries to the workshop chairs: