The 1st Worshop on Workflow Monitoring, Observability, and in situ Analytics
August 12th, 2024
Gotland, Sweden

WOWMON 2024

In cooperation with IEEE Computer Society and ACM

Held in conjunction with ICPP 24: International Conference in Parallel Processing

Topics

In recent years, there has been an emergence in high-performance workflow systems to address large-scale, complex scientific applications involving the interoperation of heterogeneous parallel tasks, massive data storage, and computational resources. From scientific workflows coupling HPC simulations to distributed data-driven analytics frameworks to many task ensemble applications, workflow execution environments place significant demands on the system platform for its scalable and efficient operation. In contrast to the performance optimization of traditional HPC applications that can rely on post-mortem performance analysis, dynamic performance monitoring and in situ analysis are crucial for evaluating and guiding resource management strategies for modern workflows. The workshop will focus on research topics related to deploying high-performance workflow systems and integrating state-of-the-art technologies in this field. Topics of interest include, but are not limited to, performance modeling, resource management, fault tolerance, programming models, observability, and other aspects of workflow systems coupling in-situ/near-real-time analytics with HPC simulations/experiments.

The workshop is relevant to multiple aspects of parallel processing – HPC, workflows, runtime systems, performance measurement and analysis, scalability, and scheduling – that come together in high-performance workflow-driven applications where dynamic behavior and performance variability present challenges to effective workflow execution, resource scheduling, and tuning. The target audience would encompass researchers and practitioners across the parallel processing spectrum, from HPC experts and workflow system developers to runtime system engineers, performance analysts, and scalability specialists.

The main goals of the workshop are to bring together researchers across these aspects to discuss challenges facing the development of observability, online monitoring, and in situ analytics capabilities for scalable workflow operation and optimization.

Call For Papers

Call for Papers

WOWMON, The 1st Worshop on Workflow Monitoring, Observability, and in situ Analytics will be held during ICPP 2024, the International Conference on Parallel Processing https://icpp2024.org. ICPP is one of the oldest computer science conferences; ICPP 2024 is the 53rd edition of ICPP.

Workshop Theme

In recent years, there has been an emergence in high-performance workflow systems to address large-scale, complex scientific applications involving the interoperation of heterogeneous parallel tasks, massive data storage, and computational resources. From scientific workflows coupling HPC simulations to distributed data-driven analytics frameworks to many task ensemble applications, workflow execution environments place significant demands on the system platform for its scalable and efficient operation. In contrast to the performance optimization of traditional HPC applications that can rely on post-mortem performance analysis, dynamic performance monitoring and in situ analysis are crucial for evaluating and guiding resource management strategies for modern workflows. The workshop will focus on research topics related to deploying high-performance workflow systems and integrating state-of-the-art technologies in this field. Topics of interest include, but are not limited to, performance modeling, resource management, fault tolerance, programming models, observability, and other aspects of workflow systems coupling in-situ/near-real-time analytics with HPC simulations/experiments.

Workshop Topic Relevance and Goals

The workshop is relevant to multiple aspects of parallel processing (HPC, workflows, runtime systems, performance measurement and analysis, scalability, and scheduling) that come together in high-performance workflow-driven applications where dynamic behavior and performance variability present challenges to effective workflow execution, resource scheduling, and tuning. The target audience would encompass researchers and practitioners across the parallel processing spectrum, from HPC experts and workflow system developers to runtime system engineers, performance analysts, and scalability specialists.

The main goals of the workshop are to bring together researchers across these aspects to discuss challenges facing the development of observability, online monitoring, and in situ analytics capabilities for scalable workflow operation and optimization.

ICPP 2024 will be held in Gotland, Sweden, from August 12 - 15, 2024. Topics of interest for the WOWMON workshop include, but are not limited to:

Description of Target Audience

The landscape for large HPC workflows is changing rapidly, and traditional workflow models relying on post-mortem analysis after completion often offer limited guidance for optimization. In situ analytics techniques embedded within the workflow are gaining traction, enabling real-time performance and intermediate results analysis. This allows for dynamic adjustments to scheduling, resource allocation, and execution strategies based on the unfolding data. The concept for the workshop grew out of two Dagstuhl seminars in 2023:

in which the workshop organizers participated. From our experiences there, we believe that the workshop will be of interest to a cross-section of people attending those seminars, as well as those interested in these topics.

Submission

Important Dates

Abstract submission deadline: May 24, 2024 (AoE)

Full Paper submission deadline: May 31, 2024(AoE)

Author notification: June 21, 2024

Camera-ready final papers submission deadline: TBD (AoE)

Submissions

• Paper submissions should not exceed 10 pages (including references) and all submissions must be made electronically through the ICPP conference submission portal (https://ssl.linklings.net/conferences/icpp/) in PDF format printable on US letter size (8.5" x 11") paper. Please use the ACM format located at: https://www.acm.org/publications/proceedings-template. More specifically, we recommend using \documentclass[sigconf,review,anonymous]{acmart} configuration for submissions prepared in LaTex. Changes to the template (e.g., margin, font size) could lead to automatic rejection.

• Submissions should represent original research results and cannot already be under review or accepted for publication in another venue.

• Paper submission should be in single-blind ACM format.

• Submitted papers will be evaluated by at least 3 reviewers based upon technical merits. The accepted papers will be published with IEEE/ACM.

• All accepted papers that are presented at the conference will be published in the ACM Digital Library.

• Accepted papers will also need to follow the conference registration policy to be included in the conference proceedings.

• Rejected ICPP submissions are welcome to submit to the workshop if the authors choose to do so.

Committee Members

Organizing Committee

Allen Malony Allen D. Malony, University of Oregon

Shantenu Jah Shantenu Jah, Rutgers University / Brookhaven National Laboratory

Ana Gainaru Ana Gainaru, Oak Ridge National Laboratory

Kevin Huck Kevin Huck, University of Oregon

Silvina Caino-Lores Silvina Caino-Lores, Inria

Technical Program Committee

Michael Ott, Leibniz Supercomputing Centre

Florina Ciorba, University of Basel

Srinivasan Ramesh, NVIDIA

Anthony Kougkas, Illinois Institute of Technology

Luan Teylo, Inria

Sean Wilkinson, Oak Ridge National Lab

Cyrus Harrison, Lawrence Livermore National Lab

Iacopo Colonnelli, Università di Torino

Douglas Thain, University of Notre Dame

David Marchant, University of Copenhagen

Tapasya Patki, Lawrence Livermore National Lab

Ulf Leser, Humboldt University of Berlin

Ivan Rodero, University of Utah

Jakob Luettgau, Inria

Raul Sirvent, Barcelona Supercomputing Center