Speaker: Dr. Christian Meesters, Johannes Gutenberg University Mainz
Date: January 14, 2025
Abstract:
This talk highlights the benefits of using workflow management systems, with a focus on Snakemake, for multistep data analysis on high-performance computing (HPC) clusters. It shows how workflows can streamline research by automating tasks, managing software environments (e.g., Conda, containers, module files), and handling HPC-specific requirements like resource allocation and job submission. We introduce the Snakemake workflow catalog, a resource for prebuilt workflows to save time and avoid reinventing the wheel. Parameterization enables workflow flexibility and scalability. Finally, the talk will explore how Snakemake facilitates reproducibility, from deployment to comprehensive workflow reports with execution statistics and publication-ready outputs. Material from past events is available at: https://hpc.fau.de/teaching/hpc-cafe/