Loading…
May 5-8, 2025
Chicago, IL
View More Details & Registration

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for the event to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to find out more information.

This schedule is automatically displayed in Central Time (UTC/GMT -6 hours). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Venue: Chicago River Ballroom clear filter
arrow_back View All Dates
Wednesday, May 7
 

1:35pm CDT

Adopting Kokkos
Wednesday May 7, 2025 1:35pm - 3:15pm CDT
1. A Brief Overview of LANL's use of Kokkos - Daniel Holladay, Los Alamos National Laboratory (20 minutes)
Since the commissioning of the first petascale machine, Roadrunner, in 2009 at the Los Alamos
National Laboratory (LANL), the ability for physics codes at LANL to take advantage of accelerators
has provided utility and productivity improvements for code users. The ability to take advantage of an
accelerator, and more specifically general purpose graphics processing units (GPGPUs), will quickly
move from a productivity enhancement to absolutely necessary as more than 90% of the compute
capability of the El Capitan supercomputer at Lawrence Livermore National Laboratory (LLNL) will
only be accessed through effective use of its GPGPUs, a task which has traditionally been
accomplished with vendor specific software extensions such as CUDA or HIP. Many projects with
code bases ranging from large and established FORTRAN codes to new c++ based projects have
made the decision to use Kokkos as the tool that will enable effective use of LLNL's El Capitan
compute resources as well as future machines which could likely benefit from Kokkos's capabilities.
In this talk I will give an overview of several physics code projects at LANL and their usage of Kokkos.

2. Enhancing Fortran Code for Operational Weather Forecasting with Kokkos: Results and Lessons Learned - Timothy Sliwinski, Cooperative Institute for Research in the Atmosphere (CIRA) (20 minutes)
At NOAA, much of the code for numerical weather prediction (NWP) and operational weather forecasting is built upon Fortran, into which decades of scientific research knowledge and expertise has been invested. Therefore, moving away from Fortran and potentially breaking what has been a highly reliable system for many years is a significant challenge.
To demonstrate new methods to modernize NOAA’s NWP models, Kokkos was selected due to its ability to work across multiple GPUs and CPUs with a single source code and the presence of the Fortran Language Compatibility Layer (FLCL), easing development of the interface between Fortran and C++ Kokkos kernels. As a first step, the YSU Planetary Boundary Layer (PBL) scheme was chosen as the target and a prototype with Kokkos was developed, tested, and performance benchmarked. In this presentation, we report the performance of this new Kokkos-enhanced Fortran code on CPU and an Nvidia GPU, the challenges of the C/Fortran interface, potential future prospects for the use of Kokkos at NOAA, and overall lessons learned from this project for anyone else interested in using Kokkos with existing Fortran source codes.

3. Using Umpire's Memory Management Capabilities with Kokkos - Kristi Belcher, LLNL (20 minutes)
Umpire is an open-source data and memory management library created at Lawrence Livermore National Laboratory (LLNL). Although Umpire is part of the RAJA Portability Suite, it was made to be modular and can therefore be used with Kokkos and other performance portability abstractions. Umpire provides memory pools which avoid expensive calls to the underlying device-specific API making allocations, large or small, performant in HPC environments. Umpire provides numerous types of memory resources and allocators (i.e. Device, Host, Unified Memory, IPC Shared Memory, etc.). In this talk, I will discuss key Umpire features and capabilities and showcase a Kokkos example with Umpire.

4. Early Experiences Using Kokkos for Multi-Resolution Analysis - Joseph Schuchart, Stony Brook University (20 minutes)
MADNESS is a framework for multi-resolution analysis with application in quantum chemistry. In this talk, we will present some early experiences in using Kokkos in a port of MADNESS to the TTG data-flow programming model, which includes both a restructuring of the existing program flow and a port to accelerators.
Speakers
avatar for Daniel Holladay

Daniel Holladay

Computational Physicist, Los Alamos National Laboratory
Daniel Holladay is the deputy project leader for computer science for the project that maintains the FLAG Lagrangian multi-physics code at the Los Alamos National Laboratory (LANL). He received a Ph.D. in Nuclear Engineering from Texas A&M University in 2018 while working as a LANL... Read More →
avatar for Joseph Schuchart

Joseph Schuchart

Senior Research Scientist, Stony Brook University
Joseph Schuchart is a Senior Research Scientist at the Institute for Advanced Computational Science at Stony Brook University. He has been working on distributed data flow programming models and communication models, currently working at the intersection with computational chemistry... Read More →
avatar for Kristi Belcher

Kristi Belcher

Software Developer, LLNL
Kristi is a Software Developer at Lawrence Livermore National Laboratory working primarily on Umpire, an open source library that supports parallel data and memory management on HPC platforms, and MARBL, a large multi-physics simulation code. Kristi also works on the RADIUSS project... Read More →
avatar for Timothy Sliwinski

Timothy Sliwinski

HPC Software Developer, Cooperative Institute for Research in the Atmosphere (CIRA)
Dr. Timothy Sliwinski is an atmospheric scientist with the Cooperative Institute for Research in the Atmosphere at Colorado State University. Working directly with NOAA Global System Laboratory federal scientists in the Scientific Computing Branch, Dr. Sliwinski has worked on multiple... Read More →
Wednesday May 7, 2025 1:35pm - 3:15pm CDT
Chicago River Ballroom

3:40pm CDT

Lightning Talks
Wednesday May 7, 2025 3:40pm - 5:00pm CDT
1. Experience Porting a Scientific Code from YAKL to Kokkos - James Foucar, Sandia National Labs (10 minutes)
The DoE climate code E3SM recently ported a medium sized scientific code, RRTMGP (computes radiative fluxes in planetary atmospheres), from a kernel launcher called YAKL to Kokkos. We'd like to share tips and pain points from this effort, particularly the struggle to get to performance parity with YAKL. We found that a 1:1 port (YAKL API is very similar to Kokkos) was not nearly sufficient to achieve good performance. The main issues were how to allocate temporary views and dealing with MDRangePolicy.

2. Benchmarking Lattice QCD Staggered Fermion Kernel Written in Kokkos - Simon Schlepphorst, Forschungszentrum Juelich GmbH (10 minutes)
Lattice quantum chromodynamics (QCD) is a numerical approach to studying the interactions of quarks and gluons, where the fundamental eqautions governing their interactions are discretized to a four dimension spacetime lattice. One of the most costly computations is the inversion of the lattice Dirac operator, a large sparse matrix. Calculating this inversion with iterative solvers leads to many applications of that operator. This study builds on previous work where we implemented the staggered fermion Dirac operator as a benchmark in Kokkos. We investigate the effects of the tiling size in combination with the use of a 4D MDRangePolicy and 7D Views.

3. Leveraging Liaisons in Your Network for Software Sustainability - Elaine M. Raybourn, Sandia National Laboratories (10 minutes)
Open source software project sustainability is a sociotechnical endeavor that often extends beyond the efforts of individual projects. HPSF and the Linux Foundation offer rich resources of expertise across communities in industry, academia, and agencies. Leveraging this collective knowledge and experience is vital to enhance project practices, especially in early identification of challenges and potential issues. This lightning talk explores the value of leveraging liaisons — key individuals who are actively participating in cross-team networks, to accelerate project sustainability. Liaisons can bridge gaps, share tacit knowledge and incentivize collaborative efforts across communities, go assist in breaking down silos. The value of leveraging liaisons was identified during the DOE Exascale Computing Project to foster strategic project alignment and outreach. Whether a small team, or a larger network of teams of teams, identifying liaisons early on can foster trust and transparency both within and across teams.

4. Vertex-CFD: A Multi-Physics Solver for Fusion Applications - Marc Olivier Delchini & Daniel Arndt, Oak Ridge National Laboratory (10 minutes)
In this talk we will introduce Vertex-CFD, a multiphysics solver that is being developed in response to needs by Oak Ridge National Laboratory (ORNL) to have accurate simulation software for use in modeling of a fusion blanket problem. Vertex-CFD is built upon Trilinos and Kokkos libraries for compatibility with CPU and GPU platforms. It is designed to generate high-fidelity solutions of multiphysics problems in complex geometries by leveraging state-of-the art computing methods and technologies. We will describe how we leverage Kokkos and Trilinos to solve the governing equations by employing a finite element method and high-order implicit temporal integrators.

5. Toucan: Revolutionizing Microstructure Prediction - Benjamin Stump, ORNL (10 minutes)
Going to describe my code, what it is doing (physically), what I need it to do computationally, how I achieved it using Kokkos and optimized it algorithmically.

6. Performance-Portable Spectral Ewald Summation with PyKokkos - Gabriel K Kosmacher, Oden Institute, The University of Texas at Austin (10 minutes)
We present a performance portable implementation of the Spectral Ewald method, employing shared memory and streaming parallelism to rapidly evaluate periodic two-body potentials in Stokes flow. The method splits dense particle evaluation into near-field and far-field components, where the near-field is singular and the far-field decays rapidly in Fourier space. Far-field interactions resemble a Nonuniform Fast Fourier Transform: source potentials are interpolated onto a uniform grid (p2g), an ndFFT is applied, Fourier potentials are scaled, an ndIFFT is applied, and the potentials are interpolated back (g2p). The p2g, g2p, and near-field (p2p) interactions use Kokkos hierarchical parallelism with scratch-pad memory and thread-vector range reductions.

7. Empowering NSM Supercomputers with Kokkos for Scalable HPC - Harsha Ugave & Samir Shaikh, Centre for Developement of Advanced Computing (C-DAC) (10 minutes)
Kokkos is transforming how high-performance applications run on National Supercomputing Mission (NSM) systems. With NSM deploying a mix of CPUs, GPUs, and other accelerators, ensuring software runs efficiently across all these platforms can be challenging. Kokkos simplifies this by providing a single, flexible programming model that adapts to different hardware without requiring major code changes. It supports multiple backends like CUDA, HIP, SYCL, and OpenMP, making it easier for developers to write performance-portable applications. For NSM’s large-scale supercomputers, Kokkos ensures better performance and scalability, allowing applications to make full use of processors, GPUs, and memory hierarchies. It also optimizes energy efficiency by improving memory access and reducing unnecessary data movement, helping to make supercomputing more sustainable. Since Kokkos is open-source and backed by an active community, it keeps up with emerging technologies, ensuring seamless adoption of next-generation NSM systems and preparing them for the future of exascale computing.

8. Real-Time Performance Characterization of the ADIOS2 Library When Kokkos Is Enabled - Ana Gainaru, Oak Ridge National Laboratory (10 minutes)
Modern performance analysis tools are increasingly capable of capturing a high volume of metrics at ever-finer granularity. This abundance of information presents an opportunity to move beyond post-mortem analysis and leverage data streaming for real-time performance monitoring and decision-making. By streaming performance data, applications can provide immediate feedback, enabling dynamic adjustments and optimizations during execution. Furthermore, this streamed data can be directed to individual scientist workstations, facilitating on-the-fly health checks and user-driven interventions to steer the application's behavior. We will demonstrate the practical application of these concepts within the ADIOS2 library, showcasing how data streaming enables detailed monitoring and analysis of an HPC application during large-scale runs.

9. Cabana: Particles, Structured Grids, and Extensions to Unstructured with Kokkos - Sam Reeve, ORNL (10 minutes)
We discuss updates to Cabana, a Kokkos+MPI library for building particle applications. Cabana was created through the U.S. Department of Energy Exascale Computing Project to enable particle simulation across methods on current and future exascale supercomputers. Cabana includes particle and structured grid parallelism, data structures, algorithms, communication, and interfaces to additional libraries, all extending and working alongside Kokkos. We focus in particular on recent efforts to integrate Cabana particles within Trilinos unstructured grids for broader support of scientific applications. We will highlight further recent Cabana development, performance and portability, and application-level demonstrations.
Speakers
avatar for Elaine M. Raybourn

Elaine M. Raybourn

Principal Member of the Technical Staff, Sandia National Laboratories
Elaine M. Raybourn is a social scientist at Sandia National Laboratories. She has worked in the UK (British Telecom), Germany (Fraunhofer FIT), and France (INRIA) as a Fellow of the European Research Consortium in Informatics and Mathematics (ERCIM). She supports the DOE Office of... Read More →
avatar for Daniel Arndt

Daniel Arndt

Large Scale Computational Scientist, Oak Ridge National Laboratory
Daniel Arndt is a computational scientist at Oak Ridge National Laboratory. He is also a mathematician by training specializing on finite element simulations. His research focuses on supporting new backends in Kokkos.
avatar for Samir Shaikh

Samir Shaikh

Scientist, Centre for Developement of Advanced Computing (C-DAC)
Samir Shaikh is an HPC specialist at C-DAC, Pune, optimizing large-scale workloads, parallel computing, and system architecture. As a Scientist C, he enhances HPC performance for AI/ML, scientific computing, and NSM supercomputers. An IIT Guwahati M.Tech graduate, he has contributed... Read More →
avatar for Ana Gainaru

Ana Gainaru

Computer Scientist, Oak Ridge National Laboratory
Ana Gainaru is a computer scientist in the CSM division at Oak Ridge National Laboratory, working on performance optimization for large scale scientific applications and on profiling, managing, and analyzing large-scale data. She received her PhD from the University of Illinois at... Read More →
avatar for Benjamin Stump

Benjamin Stump

Technical Staff, ORNL
Benjamin Stump works at the Oak Ridge National Laboratory's Manufacturing Demonstration Facility on Additive Manufacturing problems.
avatar for Gabriel K Kosmacher

Gabriel K Kosmacher

Graduate Student, Oden Institute, The University of Texas at Austin
Gabriel is a PhD student at the Oden Institute for Computational Engineering & Sciences, where he is advised by George Biros. His research interests lie at the intersection of numerical analysis and scientific computing and is particularly interested in fast numerical methods for... Read More →
avatar for Harsha Ugave

Harsha Ugave

HPC Engineer, Centre for Development of Advanced Computing (C-DAC), India
Harsha Ugave is an HPC Engineer at C-DAC Pune, specializing in performance portability, parallel computing, and system optimization. She plays a key role in deploying and tuning HPC applications under the National Supercomputing Mission (NSM). Her work ensures efficient execution... Read More →
avatar for James Foucar

James Foucar

Software Engineer, Sandia National Labs
I've been a software developer for Sandia for nearly 20 years. For the last 10 yeas, I've been doing software-focussed tasks for E3SM (DoE climate model).
avatar for Marc Olivier Delchini

Marc Olivier Delchini

CFD developer and analyst, Oak Ridge National Laboratory
CFD analyst and developer at Oak Ridge National Laboratory for 10 years. Obtained his PhD in nuclear engineering from Texas A&M University.
avatar for Sam Reeve

Sam Reeve

Staff Scientist, ORNL
Sam Reeve is a staff scientist at ORNL, working at the intersection of materials and computational science. Current focuses include performance portability and software development for physics applications and simulation of mesoscale material phenomena. He leads the development of... Read More →
avatar for Simon Schlepphorst

Simon Schlepphorst

Research Software Engineer, Forschungszentrum Juelich GmbH
After graduating with a Master's degree in physics from the University of Bonn, Simon became a Research Software Engineer at the Juelich Supercomputing Centre developing Lattice QCD codes for current and upcoming accelerators.
Wednesday May 7, 2025 3:40pm - 5:00pm CDT
Chicago River Ballroom
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -