HPSF Conference 2025

Monday May 5, 2025 8:00am - 6:00pm CDT
Ballroom Meeting Foyer

Breaks / Special Events / Registration

9:00am CDT

Welcome Remarks - Christian Trott, Sandia National Laboratories

Monday May 5, 2025 9:00am - 9:10am CDT

Welcome to the inaugural HPSF Conference! These remarks will provide an overview of the conference and what to expect. Join us for the opening of this new chapter of HPSF.

Speakers

Christian Trott

Distinguished Member of Technical Staff, Sandia National Laboratories

Bill Hoffman

CTO, Kitware

Mr. Hoffman is a founder of Kitware and currently serves as Chairman of the Board, Vice President, and Chief Technical Officer (CTO). He is the original author and lead architect of CMake, an open source, cross-platform build and configuration tool that is used by hundreds of projects... Read More →

Monday May 5, 2025 9:35am - 9:50am CDT
Chicago River Ballroom

General Session

9:50am CDT

Working Groups Introduction - Damien Lebrun-Grandie, Oak Ridge National Laboratory

Monday May 5, 2025 9:50am - 10:00am CDT

HPSF conducts project cross-cutting activities in working groups. This presentation will give a short overview of the existing working groups. Attendees will get the information to decide which breakout session to attend on day two of the HPSF conference, to get involved in the important work of the foundation.

Speakers

Damien Lebrun-Grandie

Senior Computational Scientist, Oak Ridge National Laboratory

Monday May 5, 2025 9:50am - 10:00am CDT
Chicago River Ballroom

General Session

10:00am CDT

Coffee Break

Monday May 5, 2025 10:00am - 10:30am CDT

Breaks / Special Events / Registration

10:30am CDT

Project Updates

Monday May 5, 2025 10:30am - 12:30pm CDT

This session offers a fast-paced overview of HPSF's cutting-edge software projects, showcasing the innovative capabilities and the vibrant community driving these initiatives. Attendees will gain insights into the diverse range of projects and their potential applications. Join us to explore new avenues for collaboration and discover how you can leverage HPSF software projects to enhance your own work and contribute to the community's growth.

Project Updates: Systemtools
10:25 - 10:40 AM: HPCToolkit - Jonathon Anderson
10:40 - 10:55 AM: Apptainer - Dave Dykstra, Fermi National Accelerator Laboratory
10:55 - 11:10 AM: Charliecloud - Reid Priedhorsky, Los Alamos National Laboratory
11:10 - 11:25 AM: Spack - Greg Becker, Lawrence Livermore National Laboratory
11:25 - 11:40 AM: E4S - Sameer Shende, Paratools

Project Updates: Applications
11:40 - 11:55 AM: WarpX - Edoardo Zoni, Lawrence Berkeley National Laboratory

Project Updates: Scientific Libraries Viskores (12:00 - 12:15 PM) Trilinos (12:15 - 12:30 PM)

Speakers

Jonathon Anderson

Greg Becker

Software Developer, Lawrence Livermore National Laboratory

Dave Dykstra

Fermi National Accelerator Laboratory

Ken Moreland

Oak Ridge National Laboratory

Curtis Ober

Distinguished Member of Technical Staff, Sandia National Laboratories

Curt has been with Sandia for 30 years and is currently a Distinguished Member of the Technical Staff. His career has spanned many projects and missions at the laboratory, including computational fluid dynamics (CFD) for re-entry vehicles, shock hydrodynamics, time integration (Trilinos... Read More →

Reid Priedhorsky

Scientist, Los Alamos National Laboratory

I am a staff scientist at Los Alamos National Laboratory. Prior to Los Alamos, I was a research staff member at IBM Research. I hold a Ph.D. in computer science from the University of Minnesota and a B.A., also in computer science, from Macalester College.My work focuses on large-scale... Read More →

Sameer Shende

Research Professor and Director, Performance Research Lab, U. Oregon, University of Oregon

Sameer Shende serves as a Research Professor and the Director of the Performance Research Lab at the University of Oregon and the President and Director of ParaTools, Inc. (USA) and ParaTools, SAS (France). He serves as the Technical Lead of the Extreme-scale Scientific Software Stack... Read More →

Edoardo Zoni

Research Software Engineer, Lawrence Berkeley National Laboratory

Monday May 5, 2025 10:30am - 12:30pm CDT
Chicago River Ballroom

General Session

12:30pm CDT

Lunch (Provided for Attendees)

Monday May 5, 2025 12:30pm - 2:00pm CDT

Atrium

Monday May 5, 2025 12:30pm - 2:00pm CDT
Atrium

Breaks / Special Events / Registration

2:00pm CDT

Project Updates

Monday May 5, 2025 2:00pm - 3:00pm CDT

This session offers a fast-paced overview of HPSF's cutting-edge software projects, showcasing the innovative capabilities and the vibrant community driving these initiatives. Attendees will gain insights into the diverse range of projects and their potential applications. Join us to explore new avenues for collaboration and discover how you can leverage HPSF software projects to enhance your own work and contribute to the community's growth.

Project Updates: Libraries / Programming Systems
1:55 - 2:10 PM: AMReX - Andrew Myers, Lawrence Berkeley National Laboratory
2:10 - 2:25 PM: Kokkos - Christian Trott, Sandia National Laboratories

Project Updates: New and Prospective Projects
2:25 - 2:35 PM: Chapel - Brad Chamberlain, HPE
2:35 - 2:45 PM: HPX - Hartmut Kaiser, LSU
2:45 - 2:55 PM: OpenHPC - Adrian Reber, Red Hat

Speakers

Hartmut Kaiser

STE||AR Group, LSU

Hartmut is a member of the faculty at the CS department at Louisiana State University (LSU) and a senior research scientist at LSU's Center for Computation and Technology (CCT). He received his doctorate from the Technical University of Chemnitz (Germany) in 1988. He is probably best... Read More →

Adrian Reber

Senior Principal Software Engineer, Red Hat

Adrian is a Senior Principal Software Engineer at Red Hat and is migrating processes at least since 2010. He started to migrate processes in a high performance computing environment and at some point he migrated so many processes that he got a PhD for that. Most of the time he is... Read More →

Brad Chamberlain

Distinguished Technologist, HPE

Brad Chamberlain is a Distinguished Technologist at Hewlett Packard Enterprise (formerly Cray Inc.) who has spent his career focused on user productivity for high-performance computing (HPC) systems, particularly through the design and development of the Chapel parallel programming... Read More →

Andrew Myers

Computer Systems Engineer, Lawrence Berkeley National Laboratory

Christian Trott

Distinguished Member of Technical Staff, Sandia National Laboratories

Monday May 5, 2025 2:00pm - 3:00pm CDT
Chicago River Ballroom

General Session

3:00pm CDT

Coffee Break

Monday May 5, 2025 3:00pm - 3:30pm CDT

Breaks / Special Events / Registration

3:30pm CDT

Panel Discussion: Status and Trends in the HPC Landscape - Moderated by Todd Gamblin, Lawrence Livermore National Laboratory

Monday May 5, 2025 3:30pm - 5:00pm CDT

High-performance computing (HPC) systems have undergone significant evolution over the years. Today, HPC professionals utilize not only traditional on-premise large clusters but also cloud-based solutions and appliance-like platforms that bridge the gap between workstations and larger clusters. Join this session to hear from experts who are at the forefront of deploying the largest exascale systems, along with representatives from leading cloud vendors. Gain insights into the current landscape of HPC platforms and learn about the systems you should prepare for in the near future to stay ahead in this rapidly changing field.

Moderators

Todd Gamblin

Distinguished Member of Technical Staff, Lawrence Livermore National Laboratory

Speakers

Dan Stanzione

Director, Texas Advanced Computing Center

Dr. Dan Stanzione, Associate Vice President for Research at The University of Texas at Austin since 2018 and Executive Director of the Texas Advanced Computing Center (TACC) since 2014, is a nationally recognized leader in high performance computing. He serves on the National Artificial... Read More →

Heidi Poxon

Principal Member of Technical Staff, AWS

Sara Campbell

Program Manager, DOE/NNSA

Jayesh Badwaik

HPC Software Engineer, Jülich Supercomputing Centre

Nur Fadel

Head of Scientific Computing Unit, CSCS

Andrew Jones

Engineering Leader, Future AI & HPC Capabilities, Microsoft

Doug Jacobsen

HPC Software Engineer, Google Cloud

Monday May 5, 2025 3:30pm - 5:00pm CDT
Chicago River Ballroom

General Session

5:00pm CDT

Attendee Reception

Monday May 5, 2025 5:00pm - 7:00pm CDT

Monday May 5, 2025 5:00pm - 7:00pm CDT
Ballroom Meeting Foyer

Breaks / Special Events / Registration

5:00pm CDT

Poster Sessions

Monday May 5, 2025 5:00pm - 7:00pm CDT

Featured Posters:

Porting Legacy Codes to Kokkos - Trévis Morvany, CEA
Designing a Usable Architecture for Geometric Particle-In-Cell Methods with AMReX - Emil Poulsen & Nils Schild, Max Planck Institute for Plasma Physics
Kubernetes Turbo Charging Productivity in Era of Accelerated Computing - Tobi Knaup, Nutanix
The Evolution of Virtualization: Adding some Xen to HPC - Cody Zuschlag, Vates
High-performance Phase-field Solver Based on AMReX Software Framework - Akash Shinde & Nasir Attar, Centre for Development of Advanced Computing
HPSF Project Resource: Center for Open-Source Research Software Advancement (CORSA) - Daniel S. Katz, University of Illinois Urbana-Champaign & Elaine M. Raybourn, Sandia National Laboratories
PESO: Partnering for Scientific Software Ecosystem Stewardship Opportunities - James Willenbring, Sandia National Laboratories
Introduction to Charliecloud and its Weirdness - Reid Priedhorsky, LANL / Charliecloud, Angela Loshak, Los Alamos National Laboratory & Megan Phinney, Los Alamos National Laboratory
Boosting ANUGA Performance by GPU Porting - Samir Shaikh & Harsha Ugave, Centre for Development of Advanced Computing (C-DAC), India
AI Governance in High-Performance Computing: Ensuring Compliance, Efficiency, and Security - Rohith Vangalla, Optum Services
DaggerMPI:Seamless MPI operations and Scheduling - Felipe De Alcantara Tome, MIT
Compiler Dependencies in Spack v1.0 - Gregory Becker, LLNL
The Kokkos Performance Portability EcoSystem - Christian Trott, Sandia National Laboratories; Damien Lebrun-Grandie, Lawrence Livermore National Laboratory; Luc Berger-Vergiat & Siva Rajamanickam, Sandia National Laboratories

Speakers

Daniel S. Katz

Chief Scientist, NCSA, University of Illinois Urbana-Champaign

Dan's interest is in the development and use of advanced cyberinfrastructure to solve challenging problems at multiple scales. This includes applications, algorithms, fault tolerance, and programming in parallel and distributed computing, including HPC, Grid, Cloud, etc., as well... Read More →

Reid Priedhorsky

Scientist, Los Alamos National Laboratory

Elaine M. Raybourn

Principal Member of the Technical Staff, Sandia National Laboratories

Elaine M. Raybourn is a social scientist at Sandia National Laboratories. She has worked in the UK (British Telecom), Germany (Fraunhofer FIT), and France (INRIA) as a Fellow of the European Research Consortium in Informatics and Mathematics (ERCIM). She supports the DOE Office of... Read More →

Cody Zuschlag

Developer Relations Evangelist, Vates

With a clear focus on open-source solutions, Cody is deeply committed to shaping technology for the greater good. Cody has championed the benefits of full-stack and decentralized applications, underscoring the significance of open-source technologies. Through his work, presentations... Read More →

Megan Phinney

Scientist, Los Alamos National Laboratory

Angelica Loshak

Student, Los Alamos National Laboratory

Samir Shaikh

Scientist, Centre for Developement of Advanced Computing (C-DAC)

Samir Shaikh is an HPC specialist at C-DAC, Pune, optimizing large-scale workloads, parallel computing, and system architecture. As a Scientist C, he enhances HPC performance for AI/ML, scientific computing, and NSM supercomputers. An IIT Guwahati M.Tech graduate, he has contributed... Read More →

Akash Shinde

Project Engineer, Center for Development of Advanced Computing

Akash Shinde, Project Engineer, C-DAC Basically I working at C-DAC as Scientific Software Devloper.

Felipe De Alcantara Tome

Research Software Engineer, MIT

Felipe Tomé is a Brazilian Research Software Engineer passionate about high-performance computing (HPC), parallel computing, and scalable algorithms. At MIT, he contributed to Dagger.jl and DLA.jl, optimizing MPI, GPU acceleration, and numerical linear algebra. His research, published... Read More →

Harsha Ugave

HPC Engineer, Centre for Development of Advanced Computing (C-DAC), India

Harsha Ugave is an HPC Engineer at C-DAC Pune, specializing in performance portability, parallel computing, and system optimization. She plays a key role in deploying and tuning HPC applications under the National Supercomputing Mission (NSM). Her work ensures efficient execution... Read More →

James Willenbring

Senior Member of Technical Staff, Sandia National Laboratories

James M. Willenbring is a senior member of R&D Technical Staff in the Software Engineering and Research department at Sandia National Laboratories. His research interests include software sustainability and the application of software engineering methodologies for high-performance... Read More →

Nasir Attar

Project Engineer, Centre for Development of Advanced Computing

I’m Nasir Attar, and for the past three years, I’ve been with the Centre for Development of Advanced Computing (C-DAC), Pune, India. I lead the development of a high-performance phase-field solver using the AMReX software framework as part of a collaboration with leading Indian... Read More →

Nils Schild

PhD student, Max Planck Institute for Plasma Physics

After studying physics and working on solvers for sparse eigenvalue problems in quantum mechanics at the University of Bayreuth, he moved to the Max Planck Institute for Plasma Physics in Garching (Germany). During his Ph.D., he started implementing the software BSL6D, a solver for... Read More →

Rohith Vangalla

Lead Software Engineer, Optum Technologies (UnitedHealth Group), Optum services

I am Dr. Rohith Vangalla, a Lead Software Engineer at Optum Technologies, a subsidiary of UnitedHealth Group. I specialize in high-performance computing (HPC) and AI-driven healthcare solutions, focusing on scalable, cloud-native architectures and regulatory compliance. With a PhD... Read More →

Tobi Knaup

VP & GM Cloud Native, Nutanix

As GM Cloud Native at Nutanix, Tobi leads development of the company’s Kubernetes and Cloud Native portfolio, used by the likes of AWS and Microsoft Azure. A cloud-native pioneer, Tobi co-founded D2iQ – the leading independent Kubernetes company relied on by 30% of the Fortune... Read More →

Greg Becker

Software Developer, Lawrence Livermore National Laboratory

Damien Lebrun-Grandie

Senior Computational Scientist, Oak Ridge National Laboratory

Christian Trott

Distinguished Member of Technical Staff, Sandia National Laboratories

Trévis Morvany

Research engineer, CEA

Holder of a Master’s Degree in High Performance Computing and Simulation from Paris-Saclay University, Trévis Morvany joined the CExA team as a developer in January 2025.

Emil Poulsen

Post Doc, Max Planck Institute for Plasma Physics

Dr. Poulsen has more than six years of experience with scientific high performance computing in topics as diverse as quantum many-body physics, micromagnetism and plasma physics using mainly C++ and Fortran in combination with CUDA, MPI and OpenMP.

Luc Berger-Vergiat

Sandia National Laboratories

Siva Rajamanickam

Sandia National Laboratories

Monday May 5, 2025 5:00pm - 7:00pm CDT
Chicago River Ballroom

Poster Sessions

8:30am CDT

Registration & Badge Pick-Up

Tuesday May 6, 2025 8:30am - 5:00pm CDT

Tuesday May 6, 2025 8:30am - 5:00pm CDT
Ballroom Meeting Foyer

Breaks / Special Events / Registration

9:00am CDT

Welcome & Overview of Day

Tuesday May 6, 2025 9:00am - 9:05am CDT

Tuesday May 6, 2025 9:00am - 9:05am CDT
Chicago River Ballroom

General Session

9:05am CDT

Performance, Usability and Issues on Current Systems - Kevin Huck, University of Oregon & Chris Siefert, Sandia National Laboratories

Tuesday May 6, 2025 9:05am - 10:30am CDT

This session offers a comprehensive overview of the current landscape of exascale supercomputing technologies from the user perspective. Attendees will gain valuable insights into the nuances of transitioning between various system architectures and software stacks, focusing on usability, performance, and the common challenges faced by users. By exploring real-world experiences and best practices, participants will be better equipped to navigate the complexities of exascale computing environments, ultimately enhancing their ability to leverage these advanced technologies for their research and applications.

Speakers

Chris Siefert

R&D Staff, Sandia National Laboratories

Kevin Huck

Senior Research Associate, University of Oregon

Kevin Huck is a Senior Research Associate in the Oregon Advanced Computing Institute for Science and Society (OACISS) at the University of Oregon. He is interested in the unique problems of performance analysis of large HPC applications as well as automated methods for diagnosing... Read More →

Tuesday May 6, 2025 9:05am - 10:30am CDT
Chicago River Ballroom

General Session

10:30am CDT

Coffee Break

Tuesday May 6, 2025 10:30am - 11:00am CDT

Breaks / Special Events / Registration

11:00am CDT

Panel Discussion: Processor Trends and What They Mean for Software - Speakers to be Announced

Tuesday May 6, 2025 11:00am - 12:30pm CDT

The rapid acceleration of processor innovation in recent years has introduced both opportunities and challenges for developers of high-performance software. This expert panel will delve into emerging hardware trends that are poised to shape the future of software development. Join us for an insightful discussion as we explore the implications of these advancements, the new challenges that lie ahead, and how our community can collaboratively address them. Gain valuable perspectives on what to expect in the evolving landscape of high-performance computing and how to adapt our strategies for success.

Moderators

Christian Trott

Distinguished Member of Technical Staff, Sandia National Laboratories

Tuesday May 6, 2025 11:00am - 12:30pm CDT
Chicago River Ballroom

General Session

12:30pm CDT

Lunch (Provided for Attendees)

Tuesday May 6, 2025 12:30pm - 2:00pm CDT

Atrium

Tuesday May 6, 2025 12:30pm - 2:00pm CDT
Atrium

Breaks / Special Events / Registration

2:00pm CDT

Working Group Breakouts

Tuesday May 6, 2025 2:00pm - 3:00pm CDT

Join our working group breakout sessions for an interactive opportunity to engage in face-to-face discussions with fellow community members. Discover the ongoing activities of each working group and contribute to shaping future initiatives. Topics include continuous integration, benchmarking, community outreach, and efforts to enhance software project interoperability. Brief reports from each breakout will summarize key insights and activities, ensuring that the entire community stays informed and connected. Your participation is vital in driving collaboration and innovation within our community!

Tuesday May 6, 2025 2:00pm - 3:00pm CDT
Chicago River Ballroom

General Session

3:00pm CDT

Working Group Breakout Reports

Tuesday May 6, 2025 3:00pm - 3:25pm CDT

Tuesday May 6, 2025 3:00pm - 3:25pm CDT
Chicago River Ballroom

General Session

3:25pm CDT

Coffee Break

Tuesday May 6, 2025 3:25pm - 3:50pm CDT

Breaks / Special Events / Registration

3:50pm CDT

HPSF Community BOF

Tuesday May 6, 2025 3:50pm - 5:20pm CDT

Join us for an engaging Birds of a Feather (BoF) session focused on the current state and future direction of the HPSF community. Key community leaders will discuss successes and identify areas for improvement, fostering an open dialogue about our collective vision. This interactive session will provide ample opportunities for audience questions and feedback, allowing you to contribute your insights. Together, let’s shape the HPSF community into a valuable resource that meets the needs of all its members!

Tuesday May 6, 2025 3:50pm - 5:20pm CDT
Chicago River Ballroom

General Session

5:20pm CDT

Closing Remarks

Tuesday May 6, 2025 5:20pm - 5:35pm CDT

Tuesday May 6, 2025 5:20pm - 5:35pm CDT
Chicago River Ballroom

General Session

8:30am CDT

Registration & Badge Pick-Up

Wednesday May 7, 2025 8:30am - 4:30pm CDT

Wednesday May 7, 2025 8:30am - 4:30pm CDT
Ballroom Meeting Foyer

Breaks / Special Events / Registration

9:00am CDT

Welcome and Overview - Todd Gamblin, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:00am - 9:10am CDT

Salon E-G

Speakers

Todd Gamblin

Distinguished Member of Technical Staff, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:00am - 9:10am CDT
Salon E-G

Spack, Opening

9:00am CDT

Numerical Relativity with AMReX - Miren Radia, University of Cambridge, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 9:00am - 9:20am CDT

Einstein’s theory of General Relativity revolutionised Physics over a century ago. Despite this, the number of known analytical solutions to the equations, particularly in the dynamical strong-field case is very small. Numerical relativity is often the only tool that can be used to investigate this regime.

Speakers

Miren Radia

Research Software Engineer, University of Cambridge

Wednesday May 7, 2025 9:00am - 9:20am CDT
Rock River 1 & 2

AMReX Community Breakout

9:00am CDT

Wednesday May 7, 2025 9:00am - 10:20am CDT
Salon A-C

Kokkos User Group Meeting

9:10am CDT

State of the Spack Community, Todd Gamblin, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:10am - 9:45am CDT

Salon E-G

Speakers

Todd Gamblin

Distinguished Member of Technical Staff, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:10am - 9:45am CDT
Salon E-G

Spack, Opening

9:20am CDT

Cosmological Discoveries Enabled by the Nyx Code Zarija Lukic - Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 9:20am - 9:40am CDT

Speakers

Zarija Lukic

Staff Scientist and Group Lead, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 9:20am - 9:40am CDT
Rock River 1 & 2

AMReX Community Breakout

Spack v1.0 - Greg Becker, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:45am - 10:20am CDT

Salon E-G

Speakers

Greg Becker

Software Developer, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 9:45am - 10:20am CDT
Salon E-G

Spack, Opening

10:00am CDT

Leveraging SUNDIALS to Accelerate AMReX Simulations - Andy Nonaka, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT

In this talk I discuss the role of the SUNDIALS (SUite of Nonlinear and DIfferential/ALgebraic equation Solvers) package in three different AMReX-based applications. In this context, SUNDIALS provides support for the overall temporal integration scheme of the model equations. We are particularly interested in the multirate infinitesimal (MRI) schemes, where different physical processes can be advanced using different time steps. Our initial application is in the context of combustion, where we show that we are able to achieve increased accuracy over our traditional spectral deferred corrections approach with greater computational efficiency. Our second application is in micromagnetic memory and storage devices where we demonstrate efficiency by leveraging the non-stiff nature of the computationally-expensive demagnetization processes. Our final application is low-power ferroelectric transistors where we also effectively leverage the non-stiff nature of the computationally-expensive Poisson process. In each case we show significant computational savings over our baseline codes.

Speakers

Andy Nonaka

Staff Scientist and CCSE Group Lead, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT
Rock River 1 & 2

AMReX Community Breakout

10:00am CDT

Creating Apptainer Workflows with Docker-Compose-like Utilities - Brandon Biggs, Idaho National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT

Speakers

Brandon Biggs

Idaho National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT
Mississippi River 1 & 2

Apptainer

10:00am CDT

Charliecloud Workshop: Containers Are Not Special Alpine via Tarball - Megan Phinney, Los Alamos National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT

This workshop will provide participants with background and hands-on experience to use basic Charliecloud containers for HPC applications. Participants will build toy containers and a real HPC application, and then run them in parallel on a supercomputer. This will be a highly interactive workshop with lots of Q&A.

This section will explain that containers are not special by running an Alpine container via tarball.

LA-UR-25-22140

Speakers

Megan Phinney

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 10:00am - 10:20am CDT
Illinois River

Charliecloud

10:20am CDT

Coffee Break

Wednesday May 7, 2025 10:20am - 10:45am CDT

Breaks / Special Events / Registration

10:45am CDT

ExaEpi: Agent-Based Modeling for Epidemiology using AMReX - Andrew Myers, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 10:45am - 11:05am CDT

The adaptive mesh and particle capabilities of AMReX have been used to implement a wide range of numerical methods, targeting combustion, plasma physics, earth systems modeling, cosmology, and more. This talk will show how they can also be used to implement a quite different type of algorithm: an agent-based model (ABM) for the spread of respiratory diseases called ExaEpi. ABMs are valuable because they provide a fundamental and natural description of the system and are able to capture emergent phenomena. However, their use in forecasting and control is limited by the difficulty in calibrating and quantifying the uncertainty associated with a large number of parameters. By leveraging AMReX, ExaEpi can help address these limitations by enabling many large ensembles to run quickly on exascale compute facilities.

Speakers

Andrew Myers

Computer Systems Engineer, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 10:45am - 11:05am CDT
Rock River 1 & 2

AMReX Community Breakout

10:45am CDT

Breaking Barriers in HPC: How Apptainer is Changing Software Deployment in NSM - Parikshit Ardhapurkar & Samir Shaikh, Centre for Development of Advanced Computing

Wednesday May 7, 2025 10:45am - 11:05am CDT

Managing software on supercomputers is challenging due to diverse architectures, dependency complexities, and security constraints. Traditional tools like Spack help with software compilation, but they lack portability and require system-specific configurations. Apptainer (formerly Singularity) solves these issues by enabling secure, portable, and reproducible containerized environments tailored for HPC. Unlike Docker, Apptainer runs without root access, ensuring security and compliance in multi-user clusters. It supports MPI, GPUs, and Slurm, allowing high-performance workloads to run seamlessly across systems. By integrating Apptainer into NSM supercomputers, researchers benefit from faster deployments, reproducible results, and reduced administrative overhead. This session explores Apptainer’s role in transforming HPC workflows, its advantages over traditional software management, and best practices for adopting containers in large-scale computing environments.

Speakers

Parikshit Ardhapurkar

HPC Engineer, Centre for Development of Advanced Computing, Pune

Parikshit Ardhapurkar is an HPC Engineer at C-DAC, India, specializing in parallel computing, software optimization, and performance tuning. He plays a key role in deploying Apptainer for secure and portable HPC environments under the National Supercomputing Mission (NSM). With expertise... Read More →

Samir Shaikh

Scientist, Centre for Developement of Advanced Computing (C-DAC)

Wednesday May 7, 2025 10:45am - 11:05am CDT
Mississippi River 1 & 2

Apptainer

10:45am CDT

Charliecloud Workshop: Key Workflow Operation - Build from Dockerfile - Megan Phinney, Los Alamos National Laboratory

Wednesday May 7, 2025 10:45am - 11:05am CDT

Speakers

Megan Phinney

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 10:45am - 11:05am CDT
Illinois River

Charliecloud

10:45am CDT

Optimizing Spack: Multi-Package Parallel Builds for Faster Installation - Kathleen Shea, Lawrence Livermore National Laboratory

Wednesday May 7, 2025 10:45am - 11:05am CDT

Salon E-G

Spack 1.0 will feature faster builds through multi-package parallelism. In this talk, I’ll describe how I accelerated Spack’s package installation process by parallelizing its main installer loop. By enabling Spack to spawn multiple package builds concurrently, this feature increases available parallelism and significantly reduces overall build times for multi-package installations. I'll talk about the design decisions, tradeoffs, and performance gains achieved, providing valuable lessons for optimizing package builds on platforms from laptops to large-scale HPC environments. I’ll also talk about how, even as a a relatively new Spack user, I was able to come up to speed quickly and contribute meaningful improvements to the project.

Speakers

Kathleen Shea

Software Developer, Lawrence Livermore National Laboratory

Kathleen Shea graduated from Colorado College with a Bachelor of Arts in Computer Science in 2024. She then started her career at Lawrence Livermore National Lab contributing to both Center for Applied Scientific Computing and Livermore Computing. She specializes in core feature development... Read More →

Wednesday May 7, 2025 10:45am - 11:05am CDT
Salon E-G

Spack, Building Spack

10:45am CDT

Kokkos in Applications

Wednesday May 7, 2025 10:45am - 12:05pm CDT

Salon A-C

1. FleCSI Applications, Ben Bergen & Hyun Lim, Los Alamos National Laboratory (10 minutes)
The Flexible Computational Science Infrastructure (FleCSI) programming system provides a clutter-free environment that allows developers to focus on the arithmetic operations of their methods without the distraction of computer science details that are often visible in legacy simulation codes. To this end, FleSCI provides light-weight wrappers over the raw Kokkos interface that resemble native C++ keywords, e.g., forall. Using this design philosophy, we have been able to evolve our support to cover various Kokkos policies and execution spaces. HARD is a FleCSI-based application for radiation hydrodynamics that is performance portable across a variety of systems, e.g., El Capitan, Venado, and Crossroads, and inherits FleCSI’s support for multiple distributed-memory and tasking backends, e.g., Legion, HPX, and MPI. In this talk, we will demonstrate the basic data-parallel interface with implementation and usage examples. We will also present results for several test problems in inertial confinement fusion with comparisons between different backends and performance assessments in different heterogeneous computing environments.

2. DDC: A Performance Portable Library Abstracting Computation on Discrete Domains, Thomas Padioleau, CEA Paris-Saclay (10 minutes)
The Discrete Domain Computation (DDC) library is a modern C++ library that aims to offer to the C++ world an equivalent to the xarray.DataArray Python environment. The Xarray library introduces labeled multidimensional arrays, enabling more intuitive data manipulation by associating dimensions with user-provided names rather than relying on positional indexing. This approach simplifies indexing, slicing, and broadcasting while reducing common indexing errors. Inspired by these ideas, DDC extends the Kokkos library providing zero-overhead dimension labeling for multidimensional arrays along with performance-portable multidimensional algorithms. This labeling mechanism enables compile-time detection of indexing and slicing errors, ensuring safer and more expressive array operations in C++. In this presentation, we will introduce the core concepts of DDC and demonstrate its usage through a simple example that highlights its key features.

3. TChem-atm - A Performance Portable Chemistry Solver for Atmospheric Chemistry, Oscar Diaz-Ibarra, Sandia National Laboratories (20 minutes)
TChem-atm (https://github.com/PCLAeroParams/TChem-atm) is a performance-portable software library designed to support atmospheric chemistry applications, specifically computing source term Jacobian matrices. The software utilizes Kokkos as its portability layer, preparing it for next-generation computing architectures. The software interface employs a hierarchical parallelism design to leverage the massive parallelism available on modern computing platforms, including model parallelism, batch parallelism, and nested parallelism for each problem instance. Additionally, TChem-atm is designed to be coupled with third-party libraries that may be used to advance the state of gas and particle species over time, notably interfacing with the Tines, Kokkos-kernels, and Sundials libraries. We have tested TChem-atm in two scenarios: using a typical reaction mechanism in atmospheric science and an example involving multiple aerosol particles. This testing framework allows us to evaluate our code by varying the number of evaluations and the size of the source term (right-hand side). Finally, we report performance measurements using the CUDA, HIP, and OpenMP back ends.

4. GPU Porting of the TRUST CFD Platform with Kokkos, Rémi Bourgeois, French Atomic Energy Commission (CEA) (20 minutes)
TRUST is a High Performance Computing thermohydraulic platform for Computational Fluid Dynamics developed at the French Atomic Energy Commission (CEA). This software is designed for massively parallel (MPI) simulations of conduction, incompressible single-phase, and Low Mach Number (LMN) flows with a Weakly-Compressible multi-species solver and compressible multi-phase flows. It is used as the basis for many specialised applications in the nuclear and new energy fields across CEA. The code is being progressively ported to support GPU acceleration (Nvidia/AMD/Intel) thanks to the Kokkos library, as it is one of the demonstrators of the CExA project. In this talk we will go over our experience using Kokkos to progressively port our large code base. We will cover our enabled GPU features and performances. We will mention some of the difficulties we encountered as well as the strategies we had to adopt that sometimes differ from standard good practices due to the specificity of our application.

5. Omega: Towards a Performance-portable Ocean Model using Kokkos, Maciej Waruszewski, Sandia National Laboratories (20 minutes)
High-resolution simulations of the Earth system require resources available only on the world's largest supercomputers, which are increasingly based on GPUs. However, CPU-based systems are still frequently used to conduct simulations at coarse resolutions. To be able to take advantage of all compute platforms, we are developing Omega: the Ocean Model for E3SM Global Applications, a new ocean model written in C++ using Kokkos for performance portability. Omega will replace MPAS-Ocean to become the new ocean component of the DOE’s Energy Exascale Earth System Model (E3SM). Omega is an unstructured mesh ocean model based on the same finite-volume scheme as the current ocean component. Work on Omega began in 2023. Currently, Omega is a layered shallow water model with passive tracers. While still simple, this initial version can run on realistic size meshes and contains computational kernels representative of the full model horizontal numerics. After briefly describing Omega, this talk will go into our experiences with Kokkos and present initial performance results from a variety of compute platforms.)

Speakers

Ben Bergen

Scientist, Los Alamos National Laboratory

Ben Bergen is a computational scientist working on runtime systems, data structures, and applications development.

Hyun Lim

Scientist, Los Alamos National Laboratory

Hyun Lim is a staff scientist in CCS-7. Hyun has a background in theoretical and computational astrophysics, gravitational physics, and numerical methods.

Maciej Waruszewski

R&D Computer Science, Sandia National Laboratories

Maciej is a computer scientist at Sandia National Laboratories. He is one of the developers of the DOE’s Energy Exascale Earth System Model (E3SM). He holds a PhD in atmospheric physics from the University of Warsaw.

Oscar Diaz-Ibarra

Senior member of the technical staff, Sandia National Laboratories

Oscar is a senior member of the technical staff at Sandia National Laboratories, specializing in high-performance applications for atmospheric chemistry using Kokkos and modern C++. He holds a Ph.D. in chemical engineering from the University of Utah and has over 7 years of experience... Read More →

Rémi Bourgeois

Researcher / Engineer, French Atomic Energy Commission (CEA)

Rémi Bourgeois is a French researcher/engineer at CEA Saclay, specializing in HPC and numerical analysis for the TRUST platform, a massively parallel thermo-hydraulic simulation tool. He earned his PhD at CEA, focusing on MHD convection, developing finite-volume methods and GPU-based... Read More →

Thomas Padioleau

Engineer-Researcher, CEA

Dr. Thomas Padioleau is a CEA Engineer-Researcher at Maison de la Simulation. He leads the DDC project and also works on Voice++.

Wednesday May 7, 2025 10:45am - 12:05pm CDT
Salon A-C

Kokkos User Group Meeting

11:05am CDT

Updates on the WarpX Project: Experiences with AMReX, Lessons Learned, and Future Challenges - Edoardo Zoni, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 11:05am - 11:25am CDT

The WarpX project is advancing the modeling and simulation of a wide range of physics applications, including particle accelerators and nuclear fusion, through high-performance scientific computing. This talk will provide a comprehensive update on WarpX, focusing on our experiences with the AMReX software framework. We will highlight key achievements, share valuable lessons learned, and discuss the future challenges we anticipate. The presentation will cover the critical role of AMReX in enabling high performance and portability, showcase significant milestones, and provide insights into the challenges faced and solutions developed.

Speakers

Edoardo Zoni

Research Software Engineer, Lawrence Berkeley National Laboratory

Wednesday May 7, 2025 11:05am - 11:25am CDT
Rock River 1 & 2

AMReX Community Breakout

11:05am CDT

Leveraging Apptainer for Scalable and Secure Distributed Computing - Om Jadhav & Vamshi Krishna, C-DAC India

Wednesday May 7, 2025 11:05am - 11:25am CDT

This session delves into the advanced utilization of Apptainer (formerly Singularity) for distributed computing in high-performance computing (HPC) environments. It will cover containerized execution methodologies, security enhancements, and scalability optimizations critical for scientific computing. The discussion will emphasize MPI integration, multi-node orchestration, and performance considerations for HPC workloads within Apptainer containers. Researchers and scientists will gain a comprehensive understanding of deploying, managing, and optimizing containerized scientific applications across distributed HPC infrastructures.

Speakers

Om Jadhav

Scientist, C-DAC, India

Mr. Om Jadhav is positioned as Scientist-D, C-DAC, Ministry of Electronics and Information Technology, Government of India. He is associated with the HPC-Technologies team, CDAC Pune. His areas of expertise include HPC application optimization and management on HPC clusters. He has... Read More →

Vamshi Krishna

HPC Application Expert, C-DAC India

Dr. Vamshi Krishna, an HPC Application Expert & Scientist, has 20+ years of experience in HPC, Embedded Systems, Robotics, IoT, and AI. He plays a key role in deploying supercomputers across India under the NSM initiative, with 15 systems deployed and 9 more planned using indigenous... Read More →

Wednesday May 7, 2025 11:05am - 11:25am CDT
Mississippi River 1 & 2

Apptainer

11:05am CDT

Charliecloud Workshop: Key Workflow Operation - Push - Megan Phinney, Los Alamos National Laboratory

Wednesday May 7, 2025 11:05am - 11:25am CDT

Speakers

Megan Phinney

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 11:05am - 11:25am CDT
Illinois River

Charliecloud

11:05am CDT

Fast Binary Installation with Spack Splicing - John Gouwar, Northeastern University

Wednesday May 7, 2025 11:05am - 11:25am CDT

Salon E-G

Binary package managers allow for the fast installation of binary artifacts, but limit configurability to ensure compatibility between binaries due to rigid ABI requirements. Source package managers allow for more flexibility in building software, since binaries are compiled on demand, but compilation can take a considerable amount of time. Spack has existing a mechanism for mixing source and precompiled packages; however, because Spack does not model ABI compatibility between packages, all transitive dependencies of a binary package must have been built at the same time as that package in order to maintain ABI compatiblity. We present an extension to Spack, which we call splicing, that models ABI compatibility in the package ecosystem and allows seamless mixing of source and binary distribution of packages. This extension augments both the packaging language and dependency resolution engine of Spack in order to maximize reused binaries while maintaining the flexibility of source based management. Through empirical evaluation, we show that our extension incurs minimal performance overhead to dependency resolution while greatly extending the modeling capability of Spack.

Speakers

John Gouwar

Doctoral Student, Northeastern University

John Gouwar is a doctoral student at the Khoury College of Computer Sciences at Northeastern University, advised by Arjun Guha. His doctoral research, which he began in 2021 and expects to complete in 2026, focuses on programming languages and package management. Gouwar is broadly... Read More →

Wednesday May 7, 2025 11:05am - 11:25am CDT
Salon E-G

Spack, Building Spack

Spack on Windows - John Parent, Kitware, Inc.

Wednesday May 7, 2025 11:25am - 11:45am CDT

Salon E-G

While primarily run, tested, and developed on Unix-like operating systems, in particular HPC systems, scientific software is often deployed on Windows. Spack's recent expansion to Windows marks a significant milestone, enabling its powerful package management capabilities on a new platform.This talk will provide technical insight into the process of adapting Spack to support a platform orthogonal in design to anything Spack has supported in the past. We cover the path to initial support, the current state of Spack on Windows, and look at the roadmap for future Windows development. We’ll explore unique challenges supporting Windows and their solutions. This talk will address design, new features introduced to support Windows development, and how Spack brings needed robust package management to the Windows ecosystem. We’ll cover best practices for porting new or existing packages to Windows, deploying and managing Spack environments in real-world scenarios, and standardizing your Windows development workflows with a focus on common pitfalls for Windows developers. This talk will provide a pathway for attendees interested in supporting Windows to go forth and Spack!

Speakers

John Parent

Senior R&D Engineer, Kitware, Inc.

John Parent is a senior research and development engineer on the Software Solutions Team at Kitware, Inc., where he is the primary developer of the Spack package manager’s Windows support. His other work covers contributions to CMake, establishing complex CI systems and C++ /Python... Read More →

Wednesday May 7, 2025 11:25am - 11:45am CDT
Salon E-G

Spack, Building Spack

11:45am CDT

Solid Mechanics in Alamo/AMReX: From Optimized Trusses to Burning Propellants - Brandon Runnels, Iowa State University

Wednesday May 7, 2025 11:45am - 12:05pm CDT

The phase field (PF) method is used to simulate a wide range of problems in mechanics, ranging from crack propagation to topology optimization. As a diffuse interface method, PF requires AMR to be computationally feasible, but few PF methods leverage block-structured AMR. Alamo is an AMReX-based code designed to solve phase field equations with implicit elastic solves. It features a variety of phase field methods, material models, and a strong-form nonlinear elastic solver based on AMReX's MLMG. In this talk, we give a high-level overview of some of the applications of Alamo, including deflagration of solid rocket propellant, topology optimization of mechanical structures, phase field fracture, microstructure evolution, and solid-fluid interaction.

Speakers

Brandon Runnels

Associate Professor of Aerospace Engineering, Iowa State University

Wednesday May 7, 2025 11:45am - 12:05pm CDT
Rock River 1 & 2

AMReX Community Breakout

11:45am CDT

Session to be Announced

Wednesday May 7, 2025 11:45am - 12:05pm CDT

Wednesday May 7, 2025 11:45am - 12:05pm CDT
Mississippi River 1 & 2

Apptainer

11:45am CDT

Charliecloud Workshop: Wrap-up - Megan Phinney, Los Alamos National Laboratory

Wednesday May 7, 2025 11:45am - 12:05pm CDT

This workshop will provide participants with background and hands-on experience to use basic Charliecloud containers for HPC applications. Participants will build toy containers and a real HPC application, and then run them in parallel on a supercomputer. This will be a highly interactive workshop with lots of Q&A.

This section will wrap-up the workshop with final Q&A.

LA-UR-25-22140

Speakers

Megan Phinney

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 11:45am - 12:05pm CDT
Illinois River

Charliecloud

11:45am CDT

Spack CI: Past, Present, and Future - Ryan Krattiger, Kitware, Inc.

Wednesday May 7, 2025 11:45am - 12:05pm CDT

Salon E-G

Spack's Continuous Integration (CI) system is essential for building and distributing reliable HPC software packages. It has dramatically scaled from building hundreds to hundreds of thousands of packages weekly. Leveraging GitLab, Spack CI has evolved from a Linux-only build system to orchestrating runners across diverse providers, architectures, and platforms, supporting multiple domain stacks like E4S, HEP, and Radiuss. Through enhanced monitoring and data-driven methodologies, including machine learning, Spack CI has gained insights to optimizing resource allocation and analyzing failure modes. Improvements to binary caching reduce storage and prevent race conditions. The core goal is to maintain a resilient and efficient CI ecosystem, ensuring the reliability of Spack's HPC software for its expanding community.

Speakers

Ryan Krattiger

Senior Research Engineer, Kitware, Inc.

I am a CFD researcher turned software solutions engineer. Starting at a private CFD company building out HPC frameworks for handling multi-phyiscs and FSI simulations to run on heterogeneous systems. Built CI and testing workflows out of necessity and became and HPC build systems... Read More →

Wednesday May 7, 2025 11:45am - 12:05pm CDT
Salon E-G

Spack, Building Spack

12:05pm CDT

Lunch (Provided for Attendees)

Wednesday May 7, 2025 12:05pm - 1:35pm CDT

Daniel Holladay

Computational Physicist, Los Alamos National Laboratory

Daniel Holladay is the deputy project leader for computer science for the project that maintains the FLAG Lagrangian multi-physics code at the Los Alamos National Laboratory (LANL). He received a Ph.D. in Nuclear Engineering from Texas A&M University in 2018 while working as a LANL... Read More →

Joseph Schuchart

Senior Research Scientist, Stony Brook University

Joseph Schuchart is a Senior Research Scientist at the Institute for Advanced Computational Science at Stony Brook University. He has been working on distributed data flow programming models and communication models, currently working at the intersection with computational chemistry... Read More →

Kristi Belcher

Software Developer, LLNL

Kristi is a Software Developer at Lawrence Livermore National Laboratory working primarily on Umpire, an open source library that supports parallel data and memory management on HPC platforms, and MARBL, a large multi-physics simulation code. Kristi also works on the RADIUSS project... Read More →

Timothy Sliwinski

HPC Software Developer, Cooperative Institute for Research in the Atmosphere (CIRA)

Dom Heinzeller

Computational Scientist, NRL / UCAR

Wouter Deconinck

Associate Professor, University of Manitoba

Wouter Deconinck is an Associate Professor of Physics at the University of Manitoba. His research activities focus on experimental nuclear physics, in particular precision measurements of quantities that test our current best theory of fundamental particles and their interactions... Read More →

Wednesday May 7, 2025 2:15pm - 2:35pm CDT
Salon E-G

Spack, Collaborations using Spack

2:15pm CDT

Why Your Science Application Should Be Using Trilinos Linear Solvers - Jonathan Hu, Sandia National Laboratories

Wednesday May 7, 2025 2:15pm - 2:35pm CDT

Trilinos provides a variety of high-performance sparse iterative and direct linear solvers that are portable across CPU and GPU architectures. In this talk, I will provide motivation for why scientists and engineers developing a high-performance application should consider using Trilinos solvers. I'll give an overview of current solver capabilities across a spectrum of science applications, and discuss ongoing research as well as future directions.

Speakers

Jonathan Hu

Computational Scientist R&D, Sandia National Laboratories

Wednesday May 7, 2025 2:15pm - 2:35pm CDT
Mississippi River 1 & 2

Trilinos Community Meeting

2:35pm CDT

AMReX for Renewable Energy Applications at NREL - Marc Day, National Renewable Energy Laboratory

Wednesday May 7, 2025 2:35pm - 2:55pm CDT

Speakers

Marc Day

Group Manager, National Renewable Energy Laboratory

Wednesday May 7, 2025 2:35pm - 2:55pm CDT
Rock River 1 & 2

AMReX Community Breakout

2:35pm CDT

Key Charliecloud Innovation - seccomp - Reid Priedhorsky, Los Alamos National Laboratory

Wednesday May 7, 2025 2:35pm - 2:55pm CDT

Do Linux distribution package managers need the privileged operations they request to actually happen? Apparently not, at least when building container images for HPC applications. Charliecloud uses this observation to implement a root emulation mode using a Linux seccomp filter that intercepts some privileged system calls, does nothing, and returns success to the calling program. This approach provides no consistency whatsoever but is sufficient to build a wide selection of Dockerfiles, including some that Docker itself cannot build, simplifying fully-unprivileged workflows needed for HPC application containers. This talk will detail the approach along its advantages and disadvantages.

LA-UR-25-22140

Speakers

Reid Priedhorsky

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 2:35pm - 2:55pm CDT
Illinois River

Charliecloud

2:35pm CDT

Developing and Distributing HEP Software Stacks with Spack - Kyle Knoepfel & Marc Paterno, Fermi National Accelerator Laboratory

Wednesday May 7, 2025 2:35pm - 2:55pm CDT

Salon E-G

The Computational Science and AI Directorate at Fermilab is using Spack to support the development efforts of a large number of scientific programmers, in many independent projects and experiments. While independent, these projects share many dependencies. They are typically under continuous and fairly rapid development. They have to support deployment on diverse hardware. This is a different context than is typical for the management of HPC software, where Spack was born. To support our community, we have created a model that enables users to develop code with greater efficiency than is possible with Spack’s current development facilities.

In this talk we will present:
- a brief introduction to the science we support (particle physics)
how the code we work with is naturally organized into several layers of packages
- how we are using Spack to manage those layers
- how we leverage the layering to provide efficient support for developers, using our Spack extension “MPD”.
- some suggestions for changes or additions to Spack to make such work easier.

Speakers

Kyle Knoepfel

Senior software developer, Fermi National Accelerator Laboratory

I am a senior software developer at Fermilab, responsible for developing and leading computing framework efforts to meet the data-processing needs of many of Fermilab's experiments.

Marc Paterno

Computer Science Researcher, Fermi National Accelerator Laboratory

Etienne Malaboeuf

HPC Engineer, CINES/CEA

I focus on improving the performance of projects related to real-time and high-performance computing, while providing various forms of support to researchers using French supercomputers. I have worked on numerical simulation software in an HPC context, on supercomputers and on game... Read More →

Wednesday May 7, 2025 2:55pm - 3:15pm CDT
Salon E-G

Spack, Collaborations using Spack

2:55pm CDT

Trilinos CI Testing/Contribution Overview - Samuel E. Browne, Sandia National Laboratories

Wednesday May 7, 2025 2:55pm - 3:15pm CDT

The Trilinos project is one of many in the scientific software community that employs the concept of Continuous Integration and thusly has a testing process that ensures code functionality and quality as part of its contribution process. This talk is aimed at software developers and researchers interested in enhancing their understanding of Trilinos’ CI practices. We will examine the current processes that contributors must navigate, alongside emerging paradigms shaping the evolution of these practices. Key topics will include the implementation of containerized software testing environments, requirements for running tests on Trilinos compute resources, and objective standards that contributions must meet to guarantee software quality. Attendees will also gain insights into the challenges of providing a highly configurable set of software packages while maintaining user-friendly build processes. By the end of the session, participants will have a clearer understanding of how to effectively contribute to Trilinos, and the direction that DevSecOps efforts in Trilinos are heading in the future.

Speakers

Samuel E. Browne

Principal R&D Computer Scientist, Sandia National Laboratories

Wednesday May 7, 2025 2:55pm - 3:15pm CDT
Mississippi River 1 & 2

Trilinos Community Meeting

3:15pm CDT

Coffee Break

Wednesday May 7, 2025 3:15pm - 3:40pm CDT

Breaks / Special Events / Registration

3:40pm CDT

Closing Gaps in Spack for Software Application DevOps Infrastructure - Phil Sakievich, Sandia National Laboratories

Wednesday May 7, 2025 3:40pm - 3:50pm CDT

Salon E-G

As the demand for efficient DevOps infrastructure for software applications continues to grow, key components such as build processes, build time tests, regression testing, and deployment mechanisms have become critical to successful project delivery. Spack is increasingly being adopted as an orchestration architecture for DevOps, offering a framework that can adapt to various project needs. However, while the overarching patterns among projects may be similar, the specific implementation details can differ significantly, leading to challenges in achieving seamless integration. In this talk, we will present the gaps identified by researchers at Sandia National Laboratories in Spack's current services and our ongoing efforts to address these challenges.Key topics will include advancements in binary caches, binary provenance, reporting test results, enhancing build performance, and improving the overall developer user experience. Attendees will gain valuable insights into successful initiatives that have effectively closed certain gaps, as well as ongoing issues that remain open for the community to tackle.

Speakers

Phil Sakievich

Senior Computer Scientist R&D, Sandia National Laboratories

Phil comes from a high-performance computing and fluid mechanics background. He became involved with Spack during the ExaScale computing project and author of the Spack-Manager project. Phil is an active member of the Spack technical steering committee and currently leads several... Read More →

Wednesday May 7, 2025 3:40pm - 3:50pm CDT
Salon E-G

Spack, Lightning Talks

3:40pm CDT

A Mixed Formulation Vorticity-velocity Solver using AMReX Framework for Wind Farm Modeling - Balaji Muralidharan, Continuum Dynamics

Wednesday May 7, 2025 3:40pm - 4:00pm CDT

Wind energy plays a crucial role in meeting the electricity demands of the U.S.; however, high maintenance costs highlight the need for accurate predictions of unsteady loading caused by turbine layout and off-design wind conditions. Existing design tools often neglect fluid-structure interactions that drive costly fatigue loads, prompting the research community to leverage high-performance computing (HPC) solvers to study these effects. Unfortunately, such tools remain too complex and costly for industrial applications, particularly due to challenges in grid generation and setup. To address this, CDI is developing a Cartesian-based hybrid solver that integrates an incompressible vorticity-based far-field formulation with a compressible primitive variable solver in the near field. The framework is built on the AMReX library, enabling block-structured mesh refinement for efficient computation. This talk will explore both the computational and mathematical aspects of coupling these two solvers, highlighting advancements in predictive modeling for wind turbine aerodynamics.

Speakers

Balaji Muralidharan

Continuum Dynamics

Wednesday May 7, 2025 3:40pm - 4:00pm CDT
Rock River 1 & 2

AMReX Community Breakout

3:40pm CDT

The Role of Trilinos in 4C: Advancing Coupled Multiphysics Simulations - Matthias Mayr, University of the Bundeswehr Munich

Wednesday May 7, 2025 3:40pm - 4:00pm CDT

The 4C (Comprehensive Computational Community Code) multiphysics simulation framework has been developed to address complex physical phenomena across various scientific and engineering domains. From its inception, 4C has relied on the Trilinos project, an open-source software library for scalable numerical computations, as its backbone for sparse linear algebra and MPI-parallel computing. This integration enhances 4C's computational capabilities and, more importantly, allows the 4C developers to focus on their core research interest: the numerical modeling of multiphysics systems. The synergy between 4C's physics models and Trilinos' numerical solvers facilitates the simulation of coupled multiphysics systems with improved accuracy and performance. Over the years, this synergy has — in parts — evolved to a co-development of both software frameworks. This presentation will delve into the methodologies employed to incorporate Trilinos into the 4C framework, discuss software and development challenges, and showcase application case studies that demonstrate the practical benefits of this integration in simulating complex multiphysics systems such as fluid-solid interactions, contact mechanics or beam-solid interaction.

Speakers

Matthias Mayr

Head of Data Science & Computing Lab, University of the Bundeswehr Munich

Wednesday May 7, 2025 3:40pm - 4:00pm CDT
Mississippi River 1 & 2

Trilinos Community Meeting

3:40pm CDT

Charliecloud Office Hours - Reid Priedhorsky, Los Alamos National Laboratory

Wednesday May 7, 2025 3:40pm - 5:00pm CDT

Members of the Charliecloud team will be available for office hours to listen to feedback/suggestions, answer questions, and/or help debug issues.

LA-UR-25-22140

Speakers

Reid Priedhorsky

Scientist, Los Alamos National Laboratory

Wednesday May 7, 2025 3:40pm - 5:00pm CDT
Illinois River

Charliecloud

3:40pm CDT

Lightning Talks

Wednesday May 7, 2025 3:40pm - 5:00pm CDT

1. Experience Porting a Scientific Code from YAKL to Kokkos - James Foucar, Sandia National Labs (10 minutes)
The DoE climate code E3SM recently ported a medium sized scientific code, RRTMGP (computes radiative fluxes in planetary atmospheres), from a kernel launcher called YAKL to Kokkos. We'd like to share tips and pain points from this effort, particularly the struggle to get to performance parity with YAKL. We found that a 1:1 port (YAKL API is very similar to Kokkos) was not nearly sufficient to achieve good performance. The main issues were how to allocate temporary views and dealing with MDRangePolicy.

2. Benchmarking Lattice QCD Staggered Fermion Kernel Written in Kokkos - Simon Schlepphorst, Forschungszentrum Juelich GmbH (10 minutes)
Lattice quantum chromodynamics (QCD) is a numerical approach to studying the interactions of quarks and gluons, where the fundamental eqautions governing their interactions are discretized to a four dimension spacetime lattice. One of the most costly computations is the inversion of the lattice Dirac operator, a large sparse matrix. Calculating this inversion with iterative solvers leads to many applications of that operator. This study builds on previous work where we implemented the staggered fermion Dirac operator as a benchmark in Kokkos. We investigate the effects of the tiling size in combination with the use of a 4D MDRangePolicy and 7D Views.

3. Leveraging Liaisons in Your Network for Software Sustainability - Elaine M. Raybourn, Sandia National Laboratories (10 minutes)
Open source software project sustainability is a sociotechnical endeavor that often extends beyond the efforts of individual projects. HPSF and the Linux Foundation offer rich resources of expertise across communities in industry, academia, and agencies. Leveraging this collective knowledge and experience is vital to enhance project practices, especially in early identification of challenges and potential issues. This lightning talk explores the value of leveraging liaisons — key individuals who are actively participating in cross-team networks, to accelerate project sustainability. Liaisons can bridge gaps, share tacit knowledge and incentivize collaborative efforts across communities, go assist in breaking down silos. The value of leveraging liaisons was identified during the DOE Exascale Computing Project to foster strategic project alignment and outreach. Whether a small team, or a larger network of teams of teams, identifying liaisons early on can foster trust and transparency both within and across teams.

4. Vertex-CFD: A Multi-Physics Solver for Fusion Applications - Marc Olivier Delchini & Daniel Arndt, Oak Ridge National Laboratory (10 minutes)
In this talk we will introduce Vertex-CFD, a multiphysics solver that is being developed in response to needs by Oak Ridge National Laboratory (ORNL) to have accurate simulation software for use in modeling of a fusion blanket problem. Vertex-CFD is built upon Trilinos and Kokkos libraries for compatibility with CPU and GPU platforms. It is designed to generate high-fidelity solutions of multiphysics problems in complex geometries by leveraging state-of-the art computing methods and technologies. We will describe how we leverage Kokkos and Trilinos to solve the governing equations by employing a finite element method and high-order implicit temporal integrators.

5. Toucan: Revolutionizing Microstructure Prediction - Benjamin Stump, ORNL (10 minutes)
Going to describe my code, what it is doing (physically), what I need it to do computationally, how I achieved it using Kokkos and optimized it algorithmically.

6. Performance-Portable Spectral Ewald Summation with PyKokkos - Gabriel K Kosmacher, Oden Institute, The University of Texas at Austin (10 minutes)
We present a performance portable implementation of the Spectral Ewald method, employing shared memory and streaming parallelism to rapidly evaluate periodic two-body potentials in Stokes flow. The method splits dense particle evaluation into near-field and far-field components, where the near-field is singular and the far-field decays rapidly in Fourier space. Far-field interactions resemble a Nonuniform Fast Fourier Transform: source potentials are interpolated onto a uniform grid (p2g), an ndFFT is applied, Fourier potentials are scaled, an ndIFFT is applied, and the potentials are interpolated back (g2p). The p2g, g2p, and near-field (p2p) interactions use Kokkos hierarchical parallelism with scratch-pad memory and thread-vector range reductions.

7. Empowering NSM Supercomputers with Kokkos for Scalable HPC - Harsha Ugave & Samir Shaikh, Centre for Developement of Advanced Computing (C-DAC) (10 minutes)
Kokkos is transforming how high-performance applications run on National Supercomputing Mission (NSM) systems. With NSM deploying a mix of CPUs, GPUs, and other accelerators, ensuring software runs efficiently across all these platforms can be challenging. Kokkos simplifies this by providing a single, flexible programming model that adapts to different hardware without requiring major code changes. It supports multiple backends like CUDA, HIP, SYCL, and OpenMP, making it easier for developers to write performance-portable applications. For NSM’s large-scale supercomputers, Kokkos ensures better performance and scalability, allowing applications to make full use of processors, GPUs, and memory hierarchies. It also optimizes energy efficiency by improving memory access and reducing unnecessary data movement, helping to make supercomputing more sustainable. Since Kokkos is open-source and backed by an active community, it keeps up with emerging technologies, ensuring seamless adoption of next-generation NSM systems and preparing them for the future of exascale computing.

8. Real-Time Performance Characterization of the ADIOS2 Library When Kokkos Is Enabled - Ana Gainaru, Oak Ridge National Laboratory (10 minutes)
Modern performance analysis tools are increasingly capable of capturing a high volume of metrics at ever-finer granularity. This abundance of information presents an opportunity to move beyond post-mortem analysis and leverage data streaming for real-time performance monitoring and decision-making. By streaming performance data, applications can provide immediate feedback, enabling dynamic adjustments and optimizations during execution. Furthermore, this streamed data can be directed to individual scientist workstations, facilitating on-the-fly health checks and user-driven interventions to steer the application's behavior. We will demonstrate the practical application of these concepts within the ADIOS2 library, showcasing how data streaming enables detailed monitoring and analysis of an HPC application during large-scale runs.

9. Cabana: Particles, Structured Grids, and Extensions to Unstructured with Kokkos - Sam Reeve, ORNL (10 minutes)
We discuss updates to Cabana, a Kokkos+MPI library for building particle applications. Cabana was created through the U.S. Department of Energy Exascale Computing Project to enable particle simulation across methods on current and future exascale supercomputers. Cabana includes particle and structured grid parallelism, data structures, algorithms, communication, and interfaces to additional libraries, all extending and working alongside Kokkos. We focus in particular on recent efforts to integrate Cabana particles within Trilinos unstructured grids for broader support of scientific applications. We will highlight further recent Cabana development, performance and portability, and application-level demonstrations.

Speakers

Elaine M. Raybourn

Principal Member of the Technical Staff, Sandia National Laboratories

Daniel Arndt

Large Scale Computational Scientist, Oak Ridge National Laboratory

Daniel Arndt is a computational scientist at Oak Ridge National Laboratory. He is also a mathematician by training specializing on finite element simulations. His research focuses on supporting new backends in Kokkos.

Samir Shaikh

Scientist, Centre for Developement of Advanced Computing (C-DAC)

Ana Gainaru

Computer Scientist, Oak Ridge National Laboratory

Ana Gainaru is a computer scientist in the CSM division at Oak Ridge National Laboratory, working on performance optimization for large scale scientific applications and on profiling, managing, and analyzing large-scale data. She received her PhD from the University of Illinois at... Read More →

Benjamin Stump

Technical Staff, ORNL

Benjamin Stump works at the Oak Ridge National Laboratory's Manufacturing Demonstration Facility on Additive Manufacturing problems.

Gabriel K Kosmacher

Graduate Student, Oden Institute, The University of Texas at Austin

Gabriel is a PhD student at the Oden Institute for Computational Engineering & Sciences, where he is advised by George Biros. His research interests lie at the intersection of numerical analysis and scientific computing and is particularly interested in fast numerical methods for... Read More →

Harsha Ugave

HPC Engineer, Centre for Development of Advanced Computing (C-DAC), India

James Foucar

Software Engineer, Sandia National Labs

I've been a software developer for Sandia for nearly 20 years. For the last 10 yeas, I've been doing software-focussed tasks for E3SM (DoE climate model).

Marc Olivier Delchini

CFD developer and analyst, Oak Ridge National Laboratory

CFD analyst and developer at Oak Ridge National Laboratory for 10 years. Obtained his PhD in nuclear engineering from Texas A&M University.

Sam Reeve

Staff Scientist, ORNL

Sam Reeve is a staff scientist at ORNL, working at the intersection of materials and computational science. Current focuses include performance portability and software development for physics applications and simulation of mesoscale material phenomena. He leads the development of... Read More →

Simon Schlepphorst

Research Software Engineer, Forschungszentrum Juelich GmbH

After graduating with a Master's degree in physics from the University of Bonn, Simon became a Research Software Engineer at the Juelich Supercomputing Centre developing Lattice QCD codes for current and upcoming accelerators.

Wednesday May 7, 2025 3:40pm - 5:00pm CDT
Chicago River Ballroom

Kokkos User Group Meeting

3:50pm CDT

Development of Complex Software Stacks with Spack - Cedric Chevalier, CEA

Wednesday May 7, 2025 3:50pm - 4:00pm CDT

Salon E-G

In this presentation, we will describe how we manage to develop multi-physics applications with a software stack deployed with Spack.

We will describe how we have designed a workflow, first using Spack features for development, such as "setup," "dev-build," and "develop." And how we ended up creating a Spack plugin to generate custom CMake presets.

CMake presets are a portable way to set up CMake configurations. They can describe several configurations. And they can be exploited by different tools, from the command line to IDE.
Generating these presets from Spack concretizations allows users to exploit their classical environment, benefiting both from a correct installation of their dependencies as well as advanced features of the IDE that do not need to integrate explicitly with Spack.

We will present our journey between the different spack solutions, the development of a CMake "preload cache"-based answer, and we will illustrate use cases of why we ultimately switched to CMake presets.

Speakers

Cedric Chevalier

Research Scientist, CEA

Cédric Chevalier is a research scientist at CEA in France. He is interested in developing libraries for HPC simulation codes, particularly in Linear Algebra and Mesh/Graph partitioning. His work at CEA is led by providing practical ways to exploit newer hardware, use new programming... Read More →

Wednesday May 7, 2025 3:50pm - 4:00pm CDT
Salon E-G

Spack, Lightning Talks

4:00pm CDT

Feedback on Using Spack to Deploy a Development Environment for the Gyselalibxx Library - Thomas Padioleau, CEA

Wednesday May 7, 2025 4:00pm - 4:10pm CDT

Salon E-G

In this presentation I will present the feedback of packaging and deploying the dependencies of the open source library `gyselalibxx` using Spack on a local cluster and on the national supercomputer Adastra. I will explain the challenges faced to deploy the library and why we chose Spack.

Speakers

Thomas Padioleau

Engineer-Researcher, CEA

Valerio Mariani

LCLS Data Analysis Department Head, SLAC National Accelerator Laboratory

Dr. Valerio Mariani received his PhD in Biophysics at the University of Basel in Switzerland, and is currently Data Analysis Department Head at Linac Coherent Light Source (LCLS), part of the SLAC National Accelerator Laboratory. He has in the past collaborated with world-scale user... Read More →

Wednesday May 7, 2025 4:10pm - 4:20pm CDT
Salon E-G

Spack, Lightning Talks

4:20pm CDT

Towards a Zero-Install Programming Environment - Mike Kiernan & Victor Gamayunov, Microsoft

Wednesday May 7, 2025 4:20pm - 4:30pm CDT

Salon E-G

So, we got bored with installing stuff. Our project aims to accelerate time to results, improve reproducibility, and reduce reliance on proprietary programming environments and manual installs. Built on Spack, our tooling enables the rapid deployment of versioned programming environments to globally distributed HPC clusters, ensuring consistency across clusters and regions. This talk will present our solution and the problems it solves for us, discuss its benefits for HPC productivity, and invite community feedback on its broader applicability.

Speakers

Mike Kiernan

Principle Technical Program Manager, Microsoft

Mike Kiernan leads the Public Sector HPC and AI Customer Solutions and Innovation Team at Microsoft, and is based in Cambridge, UK. Joining Mike are Victor Gamayunov and Trevor Cooper-Chadwick, both Technical Program Managers in Mike's team, also based in the UK.

Victor Gamayunov

Senior Technical Program Manager, Microsoft

Fernando Posada

Group Lead, System Acceptance and User Environment, Oak Ridge National Laboratory

Caetano Melone

Software Developer, Lawrence Livermore National Laboratory

Caetano Melone is a software developer at Lawrence Livermore National Laboratory working on open-source tools for HPC developer productivity.

Wednesday May 7, 2025 4:50pm - 5:00pm CDT
Salon E-G

Spack, Lightning Talks

8:30am CDT

Registration & Badge Pick-Up

Thursday May 8, 2025 8:30am - 4:00pm CDT

Senior Research Associate, University of Oregon

Cedric Chevalier

Research Scientist, CEA

Gabriel Dos Santos

PhD Student, CEA

PhD student on the management of data structures representations in heterogeneous architecture for exascale-class HPC workloads, with a strong background in performance optimization, CPU microarchitectures and vectorization.

Matthew Norman

Climate Scientist, Oak Ridge National Laboratory

Matt Norman leads the Advanced Computing for Life Sciences and Engineering group in the Oak Ridge Leadership Computing Facility (OLCF). He works with weather and climate simulation, urban and wind turbine simulation, PDE discretizations for the Navier-Stokes Equations, GPU acceleration... Read More →

Mikael Simberg

HPC Application Engineer, Swiss National Supercomputing Centre

Mikael Simberg holds a master's degree in operations research and computer science from Aalto University in Finland. He joined the Swiss National Supercomputing Centre in 2017 where he works as a software developer helping scientific projects make the best use of modern hardware through... Read More →

Milos Gligoric

Associate Professor, The University of Texas at Austin

Milos Gligoric is an Associate Professor in Electrical and Computer Engineering at The University of Texas at Austin where he holds the Archie W. Straiton Endowed Faculty Fellowship in Engineering. His research interests are in software engineering, especially in designing techniques... Read More →

Yuuichi Asahi

Research Scientist, CEA

His recent interests are HPC and AI with NVIDIA, AMD and Intel GPUs. He has a rich experience in GPU programming models including CUDA, HIP, SYCL, Kokkos, OpenMP, OpenACC, thrust, stdpar, and senders/receivers. For exascale computing, he is highly interested in improving performance... Read More →

Thursday May 8, 2025 9:00am - 10:20am CDT
Salon A-C

Kokkos User Group Meeting

9:20am CDT

Democratizing Access to Optimized HPC Software Through Build Caches - Stephen Sachs & Heidi Poxon, AWS

Thursday May 8, 2025 9:20am - 9:40am CDT

Salon E-G

This talk presents our implementation of a build cache of pre-optimized HPC applications using Spack. By implementing architecture-specific enhancements for both x86 and ARM platforms during the build process, we created a set of stacks of optimized software accessible through build caches. Using application builds from the cache, users can reduce compute resource requirements without requiring specialized tuning expertise.
We'll demonstrate how teams can quickly deploy HPC clusters using these stacks and discuss the substantial advantages compared to building from source. We'll present comparisons to traditional builds, showing significant time-to-solution improvements. This work represents a step toward enabling the HPC community to focus on scientific discovery rather than software compilation and tuning.

Speakers

Heidi Poxon

Principal Member of Technical Staff, AWS

Stephen Sachs

Principal HPC Application Engineer, AWS

Dr. Stephen Sachs is a Principal HPC Application Engineer on the HPC Performance Engineering team at AWS. With over 15 years of domain specific experience, he specializes in application optimization and cloud-based HPC solutions. Previously, he worked as an Application Analyst at... Read More →

Thursday May 8, 2025 9:20am - 9:40am CDT
Salon E-G

Spack, Cloud + Benchmarking + Containers

9:20am CDT

Deploying AI Chatbot Assistants with Charliecloud - Jemma Stachelek, Los Alamos National Laboratory

Thursday May 8, 2025 9:20am - 10:00am CDT

Additional Authors: Tolulope Olatunbosun, Phil Romero & Mike Mason, Los Alamos National Laboratory

Retrieval Augmented Generation (RAG) systems improve the response relevance of LLMs (Large Language Models) by limiting the context to a document corpus. RAG systems have seen broad deployment as document summarization engines and AI chatbots. However, deploying these systems often assumes a privileged and “cloudy” environment with multi-container orchestration (i.e. docker compose) and unfettered internet access to pull resources (e.g. software, data, and models) on-the-fly. As an alternative, we leveraged Charliecloud’s NVIDIA GPU support capabilities to deploy a RAG chatbot in an unprivileged HPC environment where resources are pre-staged. We demonstrate the deployment of AI Chatbots using Charliecloud on a variety of hardware and software versioning.

LA-UR-25-21968

Speakers

Jemma Stachelek

Scientist, Los Alamos National Laboratory

Thursday May 8, 2025 9:20am - 10:00am CDT
Illinois River

Charliecloud

9:40am CDT

Spack, Containers, CMake: The Good, The Bad & The Ugly in the CI & Distribution of the PDI Library - Julien Bigot, CEA

Thursday May 8, 2025 9:40am - 10:00am CDT

Salon E-G

The PDI data interface is a library that supports loose coupling of simulation codes with data handling libraries: the simulation code is annotated in a library-agnostic way, and data management through external libraries is described in a YAML "data handling specification tree". Access to each data handling tool or library (HDF5, NetCDF, Python, compiled functions, Dask/Deisa, libjson, MPI, etc.) is provided through a dedicated plugin. Testing, packaging and distributing PDI is a complex problem as each plugin comes with its own dependencies, some of wich are typically not provided by supercomputer administrators. In the last five years, we have managed to devise solutions to test & validate, package & distribute the library and its plugins, largely based on spack.

In this talk, we will describe PDI, the specific problems we encounter and how we tackled them with a mix of cmake, spack, and containers. We specifically focus on the creation of a large family of spack-based container images used as test environment of the library, and on the efforts deployed to ensure easy installation on the wide range of supercomputers our downstream application rely on.

Speakers

Julien Bigot

Permanent Research Scientist, CEA

Julien is a permanent computer scientist at Maison de la Simulation at CEA. He leads the Science of Computing team. His research focuses on programming models for high-performance computing. He is especially interested in the question of separation of concerns between the simulated... Read More →

Thursday May 8, 2025 9:40am - 10:00am CDT
Salon E-G

Spack, Cloud + Benchmarking + Containers

10:00am CDT

Using Charliecloud to Wrap HTCondor Worker Nodes - Oliver Freyermuth, University of Bonn (Germany)

Thursday May 8, 2025 10:00am - 10:20am CDT

This talk will present a setup using Charliecloud to spawn virtual HTCondor compute nodes inside of jobs submitted to a SLURM cluster. The actual containers are distributed via CernVM-FS and mounted unprivilegedly at the HPC site using cvmfsexec. The spawned HTCondor nodes integrate into a larger overlay batch system to run High-Throughput compute jobs from the Worldwide LHC Computing Grid community.

Charliecloud allows to make this setup very portable with its lightweight design, minimal system dependencies and simplicity of use. CernVM-FS which is optimized for distribution of large numbers of small files of which only few might be accessed proves an ideal fit for distribution of directory-format container images. In combination with HTCondor which focuses on optimizing the total throughput and easily handles large numbers of jobs, compute resources can be used opportunistically by integrating them fully unprivilegedly into an overlay batch system. The workloads themselves can again use unprivileged containers to enable the use of user-defined software stacks.

Speakers

Oliver Freyermuth

Research Scientist for IT Operations and High Throughput Computing, University of Bonn (Germany)

Thursday May 8, 2025 10:00am - 10:20am CDT
Illinois River

Charliecloud

10:00am CDT

Spack Deployment Story at LBNL/UC Berkeley - Abhiram Chintangal, Lawrence Berkeley National Lab

Thursday May 8, 2025 10:00am - 10:20am CDT

Salon E-G

The High-Performance Computing Services group at Lawrence Berkeley National Laboratory delivers extensive computing resources to Berkeley Lab and the University of California at Berkeley, supporting approximately 4,000 users and nearly 600 research projects across diverse scientific disciplines.

Over the past year and a half, we have modernized our primarily manual software build process using Spack, enabling us to meet the growing application and workflow demands of the HPC software stack.

This presentation will highlight how we leverage Spack’s features—such as environments, views, and module sets—to meet our specific needs and requirements. Additionally, we will discuss how, over the past year, our Spack pipeline, integrated with Reframe (a testing framework), has enabled our larger infrastructure team to efficiently plan and execute large-scale OS migrations across multiple scientific clusters in a short timeframe.

Speakers

Abhiram Chintangal

Site Reliability Engineer, Lawrence Berkeley National Lab

Abhiram is a Systems Engineer with over nine years of experience specializing in meeting the computational and IT demands of scientific labs. He has a deep understanding of the complexities of software in the data-driven landscape of modern science and recognizes its critical role... Read More →

Thursday May 8, 2025 10:00am - 10:20am CDT
Salon E-G

Spack, Cloud + Benchmarking + Containers

10:20am CDT

Coffee Break

Thursday May 8, 2025 10:20am - 10:45am CDT

Breaks / Special Events / Registration

10:45am CDT

Lessons Learned from Developing and Shipping Advanced Scientific Compressors with Spack - Robert Underwood, Argonne National Laboratory

Thursday May 8, 2025 10:45am - 11:05am CDT

Salon E-G

Modern scientific applications increasingly produce extremely large volumes of data while the scalability of I/O systems has not increased at the same rate. Lossy data compression has helped many applications address these limitations, but to meet the needs of the most demanding applications, specialized compression pipelines are needed. The FZ project helps users and compression scientists collaborate to meet the I/O needs of exascale applications by making it easier to implement custom compression tools and integrate them with applications. However, to fulfill the complex needs of this diverse ecosystem of software and systems, the FZ project uses Spack to manage the complexity of developing, distributing, and deploying specialized compression pipelines to meet the needs of its developers and users.

Spoken from the perspective of someone who has tried nearly every new spack feature in the last 5 years, and who maintains over 50 packages. This talk tells the story of how the FZ project tackled that complexity with spack, and where spack can grow to meet its future challenges coupled with tips and tricks we've learned along the way.

Speakers

Robert Underwood

Assistant Computer Scientist, Argonne National Laboratory

Assistant Computer Scientist in the Mathematics and Computer Science Division at Argonne National Laboratory focusing on data and I/O for large-scale scientific apps including AI for Science using lossy compression techniques and data management. Robert developed LibPressio, which... Read More →

Thursday May 8, 2025 10:45am - 11:05am CDT
Salon E-G

Spack, Developer Workflows: Challenges and Lessons Learned

10:45am CDT

Charliecloud + Gitlab-CI: Building and Using System-Representative Base Containers - Nick Sly, Lawerence Livermore National Laboratory

Thursday May 8, 2025 10:45am - 11:25am CDT

Charliecloud is used in conjunction with Gitlab-CI to build out a matrix of system-representative containers that can be used for building target system-compatible binaries for automated building and testing production codes on NNSA lab machines. This presentation covers the method of generating the base containers as well as a couple of use cases where they have proven helpful.

Speakers

Nick Sly

Scientist, Lawerence Livermore National Laboratory

Thursday May 8, 2025 10:45am - 11:25am CDT
Illinois River

Charliecloud

10:45am CDT

Tuning and Performance

Thursday May 8, 2025 10:45am - 12:05pm CDT

Salon A-C

1. Leveraging the C Configuration Space and Tuning Library (CCS) in Kokkos Tools - Brice Videau, Argonne National Laboratory (20 minutes)
Online autotuning of runtime and applications presents untapped opportunities to increase HPC application performance and efficiency. During ECP, in order to exploit this potential, the autotuning working group at Argonne National Laboratory and the Kokkos team co-designed the Kokkos Tools tuning API and the C Configuration Space and Tuning Library (CCS). The Kokkos Tools tuning API would create a framework to plug tuners inside Kokkos and expose tuning regions to them, while the CCS library would offer an API to both capture Kokkos configuration spaces and implement tuners to optimize them. This effort led to the creation of the CCS Kokkos connector, a Kokkos tool that leverages both APIs to offer a baseline tuner for Kokkos regions. In this presentation, we will present the result of this collaboration from the perspective of CCS, the abstractions it offers and how they map to Kokkos tuning model. We will describe the capabilities of the CCS library and how it fulfills the goal of offering a standard interface to bridge the gap between tuners and applications/runtimes. We will also discuss the perspectives and future works around the CCS Kokkos connector.

2. Bottlenecks in High-Dimensional Simulations - Nils Schild, Max Planck Institute for Plasma Physics (20 minutes)
The Vlasov-Maxwell system, which describes the motion for charged particles of matter in a plasma state using a particle distribution function, is based on a 6-D phase space defined through configuration and velocity coordinates.
Considering an Eulerian grid for this system with only 32^6 degrees of freedom, the distribution function requires already 8.5 GB of memory. This implies that high-resolution simulations can only be executed on large compute clusters.
In this talk, we focus on two aspects of the open-source code BSL6D to solve a reduced version of the Vlasov-Maxwell system. The shared memory parallelization based on Kokkos applies a stencil algorithm to data, which is non-contiguous in memory, to reduce memory requirements. The inter-node communication bottleneck poses a challenge due to the large halo domain to compute domain ratio. Finally, we discuss the advantages of RAII-managed MPI communicators for distributed domains, simplifying the implementation of parallel algorithms with distributed memory concepts.

3. Accelerating SPECFEM++ with Explicit SIMD and Cache-Optimized Layouts - Rohit Kakodkar, Princeton University (20 minutes)
SPECFEM++ is a suite of computational tools based on the spectral element method used to simulate wave propagation through heterogeneous media. The project aims to unify the legacy SPECFEM codes - three separate Fortran packages (SPECFEM2D, SPECFEM3D, and SPECFEM3D_globe) - into a single C++ package. This new package aims to deliver optimal performance across different architectures by leveraging the Kokkos library. In this presentation, I will outline our efforts to enhance CPU performance using explicit SIMD types (Kokkos::Experimental::simd). High vectorization throughput can be challenging, particularly because the data involved in spectral element assembly is not always organized cache-friendly. To address this, we have implemented a strategy that prefetches the data into cache-optimized scratch views of SIMD types before executing the SIMD operations. Additionally, we have optimized data layouts using custom-defined tiled layouts that improve cache locality. As a result of these optimizations, we have achieved approximately a 2.5x speed-up compared to auto-vectorized implementations.

4. Managing Kokkos Callbacks for Benchmarking, Profiling, and Unit Testing - Maarten Arnst & Romin Tomasetti, University of Liège (20 minutes)
Many Kokkos functions have instrumentation hooks defined within the framework of Kokkos::Tools. These instrumentation hooks allow Kokkos::Tools as well as third-party tracing, profiling and testing tools to register callbacks to monitor and interact with the runtime behavior of the program. In this presentation, we will describe several utilities that we have designed to help manage such callbacks. We have implemented a manager class that can register function objects that can listen to such callbacks. And we have implemented several such function objects, such as an event recorder, an event counter, and a kernel timer that uses event stream synchronization markers on device backends. We will illustrate these utilities through their use in benchmarking, profiling, and unit testing of a Kokkos-based finite-element code.

Speakers

Brice Videau

Computer Scientist, Argonne National Laboratory

Brice is a computer scientist, co-leading the performance engineering team at Argonne Leadership Computing Facility. Brice's research topics include heterogeneous programming models, system software, auto-tuning, code generation, and code transformation.

Maarten Arnst

Associate professor, University of Liege

Associate Professor at University of Liege.

Nils Schild

PhD student, Max Planck Institute for Plasma Physics

Rohit Kakodkar

Research Software Engineer II, Princeton University

Rohit is a Research Software Engineer in Princeton University's Research Computing department. He is focused on rewriting SPECFEM, a spectral element solver designed to simulate wave propagation through heterogeneous media. SPECFEM is extensively used within the computational seismology... Read More →

Romin Tomasetti

PhD student, University of Liège

PhD student at University of Liège.

Thursday May 8, 2025 10:45am - 12:05pm CDT
Salon A-C

Kokkos User Group Meeting

11:05am CDT

Challenges Mixing Spack-Optimized Hardware Accelerator Libraries on Personal Scientific Computers - Pariksheet Nanda, University of Pittsburgh

Thursday May 8, 2025 11:05am - 11:25am CDT

Salon E-G

Personal computing devices sold today increasingly include AI hardware accelerators such as neural processing units and graphics cards with compute capability. However, scientific libraries packaged for laptop and desktop computers focus first on broad instruction set compatibility. Yet, hardware optimized libraries and behaviors can be applied at runtime as widely used by Intel MPI environmental variables. This session discusses the unique use case of the R package system for vendor neutral hardware acceleration using vendor agnostic SYCL / Kokkos. The goal is to allow scientific package developers to quickly and easily write vendor independent accelerator code with deep control and tuning capabilities that use hardware acceleration capabilities as well on laptop / desktop machines as HPC clusters. Although R is specifically discussed, ideas from this session translate to Python and other high-level language packages used in scientific computing. Additionally, this session raises technical challenges directly using Kokkos as well as Apptainer for continuous integration that would greatly benefit from early-stage feedback of those audience members at this conference.

Speakers

Pariksheet Nanda

Postdoctoral Fellow, University of Pittsburgh

Pariksheet first learned about Spack from his university research HPC supervisor who returned from Supercomuting and told him about the "cool new project we need to start using" and has been hooked ever since. When not working on research manuscripts, he enjoys reading and writing... Read More →

Thursday May 8, 2025 11:05am - 11:25am CDT
Salon E-G

Spack, Developer Workflows: Challenges and Lessons Learned

11:25am CDT

An Opinionated-Default Approach to Enhance Spack Developer Experience - Kin fai Tse, The Hong Kong University of Science and Technology

Thursday May 8, 2025 11:25am - 11:45am CDT

Salon E-G

Despite Spack's strengths as a feature-rich HPC package manager generating fast executables for HPC apps, its adoption remains limited partly due to a steep learning curve and its perception as primarily a sysadmin tool.

We propose a set of opinionated defaults that help new users quickly adopt best practices with guaranteed reproducibility and architecture compatibilities. The approach draws from conventions used in popular userspace Python package managers like pip and conda which was proven to be effective

Unlike Python, Spack is a source-distribution system, compilation errors are a common challenge. We experimented with smoke-testing compatibility compatibility across compilers, libraries, and x86_64 architectures. Results are encoded into conflict rules into the defaults, such practice can be helpful to avoid many common build failures.

We successfully deployed this approach on x86_64 platforms with substantially different purpose (DL vs HPC), demonstrating its transferability and proving current Spack features sufficient for implementation. Additional DX enhancements will be discussed. The defaults are available as an open-source repository.

Speakers

Kin fai Tse

IT Manager (Research Computing), The Hong Kong University of Science and Technology

Dr. Kin Fai TSE now serves to overseeing DGX cluster operations and HPC migration at HKUST. After his Physics Ph.D., he led MLOps at a voicebot startup (2021). Co-founding Flying Milktea (2022), he built a marketplace with ~2-week onboarding for new interns. He was lead coach for... Read More →

Thursday May 8, 2025 11:25am - 11:45am CDT
Salon E-G

Spack, Developer Workflows: Challenges and Lessons Learned

11:25am CDT

Maintaining the Debian Charliecloud Package - Peter Wienemann, Independent

Thursday May 8, 2025 11:25am - 12:05pm CDT

Charliecloud has been in the Debian archive since the early development days of Charliecloud. It was initially packaged by Lucas Nussbaum at the end of 2017/beginning of 2018. The speaker joined the packaging effort in January 2018 and continuously contributed to it since then. This talk will give a brief introduction into Debian and describe how its tool set was useful to improve the Debian Charliecloud package and feed improvements back into the upstream project. But also the information flow from upstream authors to package maintainers has been exemplary. This presentation will provide a few examples showing this fruitful interplay.

Speakers

Peter Wienemann

Independent

Thursday May 8, 2025 11:25am - 12:05pm CDT
Illinois River

Charliecloud

11:45am CDT

Developing and Managing Data Acquisition Software Using Spack - Eric Flumerfelt, Fermi National Accelerator Laboratory

Thursday May 8, 2025 11:45am - 12:05pm CDT

Salon E-G

The Data Acquisition systems of particle physics experiments regularly push the boundaries of high-throughput computing, demanding low-latency collection of data from thousands of devices, collating data into time-sliced events, processing these events and making trigger decisions, and writing the selected data streams to disk. To accomplish these tasks, the DAQ Engineering and Operations department at Fermilab leverages multiple software libraries and builds reusable DAQ frameworks on top. These libraries must be delivered in well-defined bundles and are thoroughly tested for compatibility and functionality before being deployed to live detectors. We have several techniques used to ensure that a consistent set of dependencies can be delivered and re-created at need. We must also support active development of DAQ software components, ideally in an environment as close as possible to that of the detectors. This development often occurs across multiple packages which have to be built in concert and features tested in a consistent and reproducible manner.
I will present our scheme for accomplishing these goals using Spack environments, bundle packages, and Github Actions-based CI.

Speakers

Eric Flumerfelt

Computational Physics Developer, Fermi National Accelerator Laboratory

I have been developing data acquisition systems at Fermilab since 2014. I have worked with a number of particle physics experiments, from small test-beam experiments which run for two weeks to large international collaborations.

Thursday May 8, 2025 11:45am - 12:05pm CDT
Salon E-G

Spack, Developer Workflows: Challenges and Lessons Learned

12:05pm CDT

Lunch (Provided for Attendees)

Thursday May 8, 2025 12:05pm - 1:35pm CDT

Ansar Calloo

Research engineer, CEA

Ansar obtained his PhD in deterministic neutron transport at CEA. For the past fifteen years, he has been working on improving simulations for reactor physics applications first at EDF R&D, then CEA. His research interests involve nuclear reactor model, numerical methods to solve... Read More →

Etienne Malaboeuf

HPC Engineer, CINES/CEA

Evan Suggs

Staff Researcher, Tennessee Technological University

Evan Drake Suggs is a Research Scientist at Tennessee Technological University in Cookeville, Tennessee. In 2023, Suggs graduated with a Master's degree in Data Science from the University of Tennessee at Chattanooga and presented his thesis work on MPI+Kokkos using the ExaMPI implementation... Read More →

Junchao Zhang

Principal Specialist, Research Software Engineering, Argonne National Laboratory

Junchao Zhang is a software developer at Argonne. He currently works on the Portable, Extensible Toolkit for Scientific Computation (PETSc) project. Before joining PETSc, he was an MPICH developer at Argonne and developed the MPI Fortran 2008 binding and MPI tool interface of MPI-3.0... Read More →

Travis Whyte

Postdoc, Jülich Supercomputing Centre

I graduated from Baylor University with a Ph.D. in Physics, focusing on algorithmic improvements for lattice QCD simulations. Since then, I have continued to work in the field, focusing on improving iterative solvers, scattering simulations and HPC software development.

Thursday May 8, 2025 1:35pm - 3:15pm CDT
Salon A-C

Kokkos User Group Meeting

1:55pm CDT

BEE: Orchestrating Workflows with Containerized Applications Leveraging Charliecloud - Krishna Chilleri, Los Alamos National Laboratory

Thursday May 8, 2025 1:55pm - 2:15pm CDT