Advances in Applied Computer Science Invited Speaker Series

Geodynamics - understanding the internal dynamics of the Earth and other planets - depends heavily on the use of numerical algorithms and their software implementations to make sense of the data we collect and to understand our exposure to geohazards and the availability of georesources. The software we use to understand these processes varies from solvers for wave propagation and elastic deformation over reactive multi-phase flow models and magneto-hydro-dynamics to micro-scale mineral physics processes and ab initio material property prediction. As varied as the software is typically their development history, and with the advent of modern standards for research software the geodynamics community decided in 2005 to establish an organization dedicated to improving software standards - the Computational Infrastructure for Geodynamics, CIG. 20 years later I will discuss the role that CIG has played in supporting research software in the field of computational geodynamics, and the lessons we learned. I will also present application cases of CIG software, with a particular focus on the ASPECT software - the Advanced Solver for Planetary Evolution, Convection, and Tectonics - and how it helps us to understand deformation processes in Earth's interior, such as the movement of tectonic plates, and the influence of mineral phase transitions on Earth's long-term evolution.

Rene Gassmoeller is a staff scientist at the GEOMAR Helmholtz Centre for Ocean Research Kiel, and previously was the technical lead for the NSF-funded community organization Computational Infrastructure for Geodynamics. He completed his PhD at the German Research Centre for Geosciences (GFZ) in Potsdam, Germany and did postdoctoral work at Texas A&M University, Colorado State University and the University of California, Davis as well as an appointment as visiting assistant professor and research scientist at the University of Florida. His research interests lie in computational geodynamics, numerical mathematics, and software engineering, with a particular focus on processes in the deep Earth and developing the software we need to understand them.

First digital biophysical model of the entire human cardiovascular system

Cardiovascular disease is the leading cause of death worldwide. While substantial progress has been made in understanding and managing these diseases, current strategies have not been sufficient to reverse increasing incidence and burden. A potential research solution is the cardiovascular digital twin, a virtual replica of the human circulatory system. However, a digital twin of the entire human vasculature has never been accomplished due to the large computational costs. The goal of this work was to determine the feasibility of a CVDT that includes modeling all vessels in the human body, including physiologically-relevant biophysical mechanisms. We used a fractal algorithm to generate all 34 billion blood vessels of the human body, and calculated the time-dependent blood flow using an integrated heart model. We included nitric-oxide-mediated vasodilation, as well as vessel deformation and rupture using peridynamics. To test the computational feasibility, we determined the complexity, parallel scalability, and the amount of resources required, including execution time, memory usage, and floating-point operations. We found the CVDT to be computationally feasible, with all simulations requiring fewer than 30 minutes of wall-clock time. With further computational optimizations and biophysical improvements, this model has potential to shift the change the paradigm of cardiovascular research and patient care.

Dr. Maxwell Cole recently earned his Ph.D. in physics from Louisiana State University, where he worked on developing the first digital twin of the entire human cardiovascular system. His research utilized high-performance computing to simulate biophysical processes at a systemic scale, aiming to create new computational tools that illuminate how diseases develop and localize in the body. Dr. Cole is now a medical physics resident at the University of California, San Diego, leveraging physics to advance patient care through improved prevention, diagnosis, and treatment.

Computational cosmology meets AI: the birth of a surrogate model for subscale physics

This is the story of how my UCSD graduate student (Azton Wells) hijacked the computational cosmology thesis project I gave him and turned it into an integrated ModSim/AI thesis. Azton wanted to apply what he had learned about machine learning in CS classes to the problem I suggested. The question was: How do the first stars in the universe pollute the universe with heavy elements and how does that affect the properties of the first galaxies forming subsequently? The product of his thesis research is StarNet [1,2,3], a deep learning based surrogate model for primordial star formation and feedback. In this talk I trace how the surrogate model emerged through trial and error, and how the model was trained using data produced by resolved numerical simulations of primordial star formation and feedback in protogalaxies. I discuss the crucial differences between resolved scales, unresolved scales, and inference scales in the context of our model. I then describe how the surrogate model, prototyped in PyTorch, was exported so that it can be used for inference in running Enzo and Enzo-E simulations. I touch on inference mesh data structures incorporated into Enzo-E for this purpose. I then show some results of simulations using StarNet of the formation of the first galaxies in the universe.

Michael L Norman is Distinguished Professor of Astronomy and Astrophysics at UC San Diego and former director of the San Diego Supercomputer Center. He received his PhD in Engineering and Applied Science from UC Davis while working at the Lawrence Livermore National Laboratory. Subsequently he held research positions at the Max Planck Institute for Astrophysics, Los Alamos National Laboratory, and National Center for Supercomputing Applications, UIUC. His research group, the Laboratory for Computational Astrophysics (est. 1991) develops community application software for astrophysical simulation on supercomputers including ZEUS-2D, ZEUS-3D, ZEUS-MP, Enzo, and Enzo-E. His scientific interests include astrophysical and cosmological fuid dynamics with applications to star formation, interstellar medium, supernova remnants, astrophysical jets, galaxy formation, and X-ray clusters. His technical interests include algorithm development, code development, and parallel computing. In his role as SDSC director, he served as PI for the Gordon, Comet, and Expanse national HPC systems and is currently PI of the CloudBank cloud access project funded by the NSF.

Introducing Enzo-E, an extreme scale AMR radiation hydrodynamic cosmology code built on Charm++

Enzo-E is a new, extreme-scale implementation of the venerable Enzo community code (enzo-project.org) for astrophysics and cosmology simulation. Enzo has been in widespread use since the early 2000’s but was not designed for today’s HPC architectures. Enzo-E achieves its extreme scalability through the adoption of a forest-of-octrees AMR mesh approach and the use of the Charm++ parallel object framework which manages the fully distributed AMR data structure. Charm++ is a message-driven asynchronous many-task framework developed at UIUC (charm.cs.illinois.edu). In this talk I briefly review Enzo-E’s software design, core AMR methods, physics algorithms, and scalability results. Recently, we incorporated a deep learning surrogate model for star formation and feedback which relaxes the resolution requirements in large cosmological simulations of the first galaxies. Current development work is focused on implementing hierarchical adaptive time stepping similar to Enzo’s. Performance studies suggest a 5-10x speedup over the current global timestep approach.

AI for code development at LANL

As AI rapidly transforms code development, with industry giants like Microsoft reporting up to 30% of new code being AI-generated, this talk explores the AI resources available to LANL developers. We'll overview the large language models accessible to LANL employees, including ChatGPT Enterprise, AWS-hosted LLMs, and locally-running open-weight models. The presentation will cover both chat UI applications and API integrations for IDEs, Python scripts, and agentic workflows. Real-world examples will demonstrate AI's impact on productivity in LANL code projects, along with insights into emerging agentic workflows. This talk aims to equip LANL developers with practical knowledge of AI tools to enhance their coding efficiency and innovation.

Tim is the MCATK Monte Carlo radiation transport code team leader in XCP-3. He joined the lab as a Metropolis Postdoctoral Fellow in 2017 after receiving his Ph.D. in Nuclear Engineering and Radiological Sciences and Scientific Computing from the University of Michigan. Tim's work interests focus on productivity–accelerating simulations on GPUs and accelerating code development workflows. He is an early adopter of AI at LANL and continues to push for greater AI resources for code developers.

Shedding Light on Interaction Binaries: Radiation Hydrodynamics with Octo-Tiger

Octo-TIGER is an adaptive, octree-based code for modeling self-gravitating hydrodynamics in interacting binaries. We have extended Octo-TIGER to include gray radiation transport via the M1 moment method, coupling radiation energy and flux to the existing PPM hydrodynamics and FMM gravity solvers within a task-based HPX framework. Radiation variables are discretized on the same AMR mesh and evolved using an implicit-explicit scheme to handle both optically thick and thin regimes while conserving total energy and radiative flux. We validate the RHD module with standard radiation test problems, including a planar radiation front propagating through uniform media, radiative equilibration in matter–radiation coupling tests, and diffusion of a Gaussian radiation pulse. These benchmarks confirm the accuracy and efficiency of the M1 implementation in Octo-TIGER, laying the groundwork for forthcoming production runs of interacting binaries with full radiation feedback.

Dominic Marcello is the lead developer of the Octo-TIGER project at the Center for Computation & Technology at Louisiana State University. After earning his PhD in Physics in 2011, he served as a postdoctoral researcher in LSU’s Physics Department before joining CCT as a staff researcher. His work focuses on high-performance computational methods for astrophysics, particularly simulating interacting binary systems using adaptive mesh techniques, advanced hydrodynamics solvers, and radiation transport.

Chapel's Batteries-Included Approach for Portable Parallel Programming

Virtually all modern processors, including GPUs, from the consumer-grade to the highest-end are parallel. However, none of the mainstream programming languages support different flavors of hardware parallelism as a first-class concept. Capability to exploit parallelism in the hardware is typically achieved by augmenting inherently sequential programming languages like C, C++, or Fortran with add-ons like OpenMP, MPI, CUDA, OpenCL, or Kokkos.

Chapel stands out as a programming language designed with parallelism and locality as first-class concepts in the programming language itself. That, and other modern programming features, have enabled Chapel users to overcome the barriers to parallel programming and HPC. In this talk, I will go over how parallelism and locality are expressed in Chapel, and how they can be used to target GPUs in a vendor-neutral manner. I will also refer to several success stories from the Chapel community.

Engin Kayraklioglu is a Principal Engineer at Hewlett Packard Enterprise (HPE), where he is a member of the Chapel team. His duties include leading GPU support effort, as well as the Arkouda project (https://ark ouda-www.github.io/). Before joining HPE (formerly Cray Inc.), Engin received his Ph.D. from The George Washington University in 2019. His interests are high-performance computing, programmer productivity and performance optimizations.

Accelerated Quantum Supercomputing with CUDA-Q

Quantum computing holds the promise of transforming industries and ushering in a new era of high-performance computing. However, achieving this potential depends heavily on AI supercomputing to address algorithm research and overcome the challenges of design, control, and integration that impede the development of practical quantum devices. This presentation explores the future landscape of large-scale, commercially viable hybrid quantum devices, enabled by GPU-accelerated simulations, AI-enhanced algorithms, and tightly coupled HPC control systems. We also introduce CUDA-Q, the open source, qubit-agnostic platform driving this vision forward.

As a Senior Technical Marketing Engineer at NVIDIA, Monica VanDieren specializes in quantum and high-performance computing, driving initiatives such as the CUDA-Q Academic program. Before joining NVIDIA, Monica worked on IBM’s Quantum Accelerator programs. With a Ph.D. in Mathematical Sciences from Carnegie Mellon University, she brings over 20 years of academic experience at universities such as Stanford, University of Michigan, and Robert Morris University to her work.

LCI: a Lightweight Communication Interface for Asynchronous Multithreaded Communication

The Lightweight Communication Interface (LCI) is an experimental communication library aiming for better asynchronous multithreaded communication support, both in terms of performance and programmability. It is also a research tool helping us understand how to design communication libraries to better fit the needs of dynamic programming systems/applications. It features a simple, incrementally refinable interface unifying all common point-to-point communication primitives and an atomic-based runtime for maximum threading efficiency. It has been integrated into established asynchronous many-task systems such as HPX and shown significant performance improvement on microbenchmarks/real-world applications. This talk will present an overview of its interface and software design and showcase its performance.

Jiakun Yan is a fifth-year Ph.D. student at UIUC, advised by Prof. Marc Snir. His research involves exploring better communication library designs for highly dynamic/irregular programming systems and applications. He is the main contributor to the Lightweight Communication Interface (LCI) Project and the HPX LCI parcelport.

Scaling AI for Scientific Discovery: Faster Kernels, Efficient Models, Better Physics

As AI becomes a powerful tool in scientific computing, two challenges emerge: (1) efficiently scaling models on modern hardware, and (2) ensuring these models can learn complex physics with fidelity and generalizability. In this talk, I will discuss our work at this intersection. First, I will introduce Fused3S, a GPU-optimized algorithm for sparse attention—the backbone of graph neural networks and transformers. By fusing matrix operations, Fused3S reduces data movement and maximizes tensor core utilization, achieving state-of-the-art performance across diverse workloads. Then, I will present BubbleML and Bubbleformer, our efforts to model boiling dynamics with ML. Boiling is fundamental to energy, aerospace, and nuclear applications, yet remains difficult to model due to the interplay of turbulence, phase change, and nucleation. By combining large-scale simulation datasets with transformer architectures, Bubbleformer forecasts boiling dynamics across fluids, geometries, and operating conditions. Together, these efforts illustrate how scaling AI—both computationally and scientifically—can accelerate discovery across disciplines. I will conclude with open challenges and opportunities in AI-driven scientific computing, from hardware-aware models to foundation models for physics.

Aparna Chandramowlishwaran is an Associate Professor at the University of California, Irvine, in the Department of Electrical Engineering and Computer Science. She received her Ph.D. in Computational Science and Engineering from Georgia Tech in 2013 and was a research scientist at MIT prior to joining UCI as an Assistant Professor in 2015. Her research lab— HPC Forge—aims at advancing computational science using high-performance computing and machine learning. She currently serves as the associate editor of the ACM Transactions on AI for Science.

Driving Continuous Integration and Developer Workflows with Spack

Spack makes it easy to install dependencies for our software on multiple HPC platforms. However, there is little guidance on how to structure Spack environments for larger projects, share common Spack installations with code teams and utilize them in an effective way for continuous integration and development. This presentation will share some of the lessons learned from deploying chained Spack installations for multiple code teams at LANL on various HPC platforms both on site and on other Tri-Lab systems, how to structure such deployments for reusability and upgradability, and make them deployable even on air-gapped systems. It will also show how we utilize Spack's build facilities to drive CMake-based projects on GitLab for continuous integration, without having to replicate build configuration logic in GitLab files, while giving developers an easy-to-follow workflow for recreating CI runs in various configurations.

Richard is a research software engineer in the Applied Computer Science Group (CCS-7) at Los Alamos National Laboratory (LANL) with a background in Mechatronics, high-performance computing, and software engineering. He is currently contributing to the core development of LAMMPS, FleCSI and working on DevOps for multiple other LANL projects.

Introducing the Lamellar Runtime: A Modern Approach to High-Performance Computing

In the realm of High-Performance Computing (HPC), achieving peak system performance while maintaining code safety and concurrency has always been a challenging endeavor. Traditional HPC frameworks often struggle with memory safety issues, race conditions, and the complexities of parallel programming. In this talk, I introduce Lamellar, a new HPC runtime that leverages the modern programming language Rust to address these enduring challenges. Rust, known for its powerful type system and memory safety guarantees without a garbage collector, is rapidly gaining traction within the systems programming community. However, its potential in the HPC domain is yet to be fully explored. The Lamellar runtime harnesses Rust's strengths, providing a robust, scalable, and safe environment for developing and executing high-performance applications. This talk is designed for computational domain and computer scientists who are well-acquainted with HPC concepts but may be new to Rust. The talk will cover the following key topics:

Dr. Ryan Friese is a senior computer scientist at Pacific Northwest National Laboratory in the Future Computing Technologies group. His research interests span hardware/software co-design of runtime and system software for novel architectures, HPC (high performance computing) network simulation and modeling, the analysis and optimization of data movement in large scale distributed workflows, and performance modeling of irregular applications. His recent work has focused on enabling memory safe programming on HPC systems by leading the development of the Lamellar Runtime, an asynchronous distributed runtime written in the Rust Programming Language. He received his PhD in Electrical & Computer Engineering in 2015 from Colorado State University.

Deploying and Supporting Julia at NERSC

Julia has been supported as a first-class language on NERSC's systems for over 10 years [1]. In this talk we will discuss how Julia has been deployed at scale, the technical issues encountered and how they were eventually overcome. We will also explore the challenges faced with developing intuitive and portable distributed HPC applications that are capable of targeting modern GPU architectures, and NERSC's vision for supporting modern interdisciplinary and multi-facility workflows.

[1] https://info.juliahub.com/case-studies/celeste

Johannes Blaschke is a HPC workflow performance expert leading the NERSC Science Acceleration Program (NESAP). His research interests include urgent and interactive HPC; and programming environments and models for cross-facility workflows. Johannes supports Julia on NERSC's systems, including one of the first examples of integrating MPI.jl and Distributed.jl with the HPE's Slingshot network technology. Johannes is a zealous advocate for Julia as an HPC programming language, and a contributor and organizer of Julia tutorials and BoFs at SC, JuliaCon and within the DoE.

The value proposition of the Julia language for DOE’s mission

We present a summary of our research and community efforts exploring the Julia language for the scientific mission of the US Department of Energy (DOE) at the intersection of high-performance computing (HPC) and high-productivity. Powered by the LLVM compiler infrastructure combined with a unifying ecosystem and friendly scientific syntax, Julia attempts to lower cost of a “two-language and multiple ecosystems” paradigm (e.g. Python+compiled language). Along with the Julia intro and HPC hands-on tutorials, we present our efforts on: (i) building an accessible performance portable CPU/GPU library: JACC.jl, (ii) the outcome of external venues (SC BoFs, tutorials) and workshops at Oak Ridge National Laboratory (ORNL), and (iii) our research, best paper at SC23 WORKS, on the unifying value for using a single front-end language on Frontier, the second fastest supercomputer in the world, and (iv) our work, best paper at SC24 XLOOP, connecting ORNL’s experimental and computational facilities using JACC.jl. Hence, Julia aspires to make more accessible the future landscape of heterogeneous, AI-driven, and energy-aware computing by leveraging existing investments outside DOE in LLVM and commercial applications of the language.

William Godoy is a senior computer scientist in the Computer Science and Mathematics Division at Oak Ridge National Laboratory (ORNL). His interests are in high-performance computing, parallel programming systems, scientific software and workflows. At ORNL, he contributed to the Exascale Computing Project applications -QMCPACK- and software technologies portfolios – ADIOS2, Julia/LLVM, and projects impacting ORNL’s computing and neutron science facilities. Godoy currently works across research projects funded by the US Department of Energy Advanced Scientific Computing Research (ASCR) program. Prior to ORNL, he was a staff member at Intel Corporation and a postdoctoral fellow at NASA Langley Research Center. Godoy received PhD and MSc degrees from the University at Buffalo, The State University of New York, and a BSc from the National Engineering University (UNI) Lima, Peru, in mechanical engineering. He is a senior member of the IEEE, and a member of ACM, ASME and US-RSE serving in several venues and technical committees.

Transparent Resource Adaptivity for Task-Based Applications on Supercomputers

Traditional static resource allocation in supercomputers (jobs retain a fixed set of resources) leads to inefficiencies. Resource adaptivity (jobs can change resources at runtime) significantly increases supercomputer efficiency.
This talk will exploit Asynchronous Many-Task (AMT) programming, which is well suited for adaptivity due to its transparent resource management. An AMT runtime system dynamically assigns user-defined small tasks to workers to achieve load balancing and adapt to resource changes.
We will discuss techniques for malleability and evolving capabilities that allow programs to dynamically change resources without interrupting computation. Automatic load detection heuristics determine when to start or terminate processes, which is particularly beneficial for unpredictable workloads. Practicality is demonstrated by adapting the GLB library. A generic communication interface allows interaction between programs and resource managers. Evaluations with a prototype resource manager show significant improvements in batch makespan, node utilization, and job turnaround time for both malleable and evolving jobs.

Jonas is a dedicated computer scientist specializing in High Performance Computing. He received his Bachelor’s and Master’s degrees from the University of Kassel, Germany, where he also earned his Ph.D. in 2022. He is currently working as an substitute chair for the Software Engineering Group at the same university and is also writing his habilitation.
Jonas' research interests include load balancing, fault tolerance, and resource adaptivity for Asynchronous Many-Task (AMT) systems. Recently, he has focused on resource adaptivity in general to optimize the efficient use of supercomputing resources. His work covers a broad spectrum, including the development of advanced job scheduling algorithms, the improvement of application programming using AMT systems, and the interaction between resource managers and jobs.

Rust and type safety for MPI and scientific computing

Speaker: Jed Brown
University of Colorado Boulder
Associate Professor of Computer Science

Abstract:

Rust is a modern language that provides type- and memory-safety without a garbage collector, using a concept called lifetimes. Many see Rust as a successor to languages like C and C++, and there are many interested individuals in the computational science community, yet few major projects have made the switch. I'll introduce the language and its ecosystem, including the state of scientific computing libraries. We'll discuss what soundness means for libraries and examine rsmpi, which safely exposes MPI and allows catching many common bugs at compile-time. We'll also discuss type-system approaches to collective semantics, and conclude with an outlook on Rust for scientific computing.

Bio:

Jed leads the Physical Prediction, Inference, and Design group at CU Boulder. He is a maintainer of PETSc, libCEED, and rsmpi (Rust bindings to MPI), and is active in many open source communities. He works on high-performance numerical software infrastructure for computational science and engineering, as well as applications such as structural mechanics and materials science, non-Newtonian and turbulent flows, and plasmas. He is co-director of the PSAAP-3 Multidisciplinary Simulation Center for Micromorphic Multiphysics Porous and Particulate Materials Simulations Within Exascale Computing Workflows.

Materials:

Slides

Thermonuclear electron-capture supernovae - Motivating a long-overdue update to the supernova modeling pipeline for the exascale computing age

Speaker: Alexander Holas
Heidelberg Institute for Theoretical Studies

Abstract:

The thermonuclear supernova modeling pipeline has been refined for over four decades and has achieved substantial success in modeling various supernova subtypes. Nonetheless, continuous innovation is essential for maintaining supernova modeling at the forefront of computational astrophysics. In this work, we examine a novel scenario, so called thermonuclear electron-capture supernovae. Originally proposed by Jones et al. (2016), this scenario consists of a collapsing sAGB star that only narrowly escape collapse to a neutron star by runaway thermonuclear thermonuclear burning. Here, we explore the specific circumstances under which such a thermonuclear explosion can occur and under which conditions the collapse can be averted by nuclear burning. Subsequently, we leverage this scenario to motivate a long-overdue update to the thermonuclear supernova modeling pipeline, both by increasing the complexity of the physics included, as well as updating the underlying codebase for the latest exascale computing clusters. In particular, we advocate the integration of radiation hydrodynamics and the transition towards a performance portable programming model.

Bio:

Undergraduate at Ulm University
Master Studies at the Technical University of Munich and the Max Planck Institute for Astrophysics (Thesis: Determination of the Expansion Rate of the Universe by Means of Type II‑P Supernovae)
PhD at the Heidelberg Institute for Theoretical Studies/ Heidelberg University on numerical simulations of thermonuclear supernovae (Working project title: Thermonuclear electron-capture supernovae - thermonuclear explosion or gravitational collapse?)
Fellow at the International Max Planck Research School Heidelberg

RISC-V HPC Terrain Familiarization

Speaker: Chris Taylor
Tactical Computing Lab
Principle Research Engineer

Abstract:

The number of RISC-V commercial products increased substantially this past year. This presentation is an orientation to the range of RISC-V hardware, HPC software support, the community, and the current state of HPC-relevant ISA extensions. Acquiring RISC-V hardware is no longer a question of when - it is possible now.

Bio:

Chris is a senior principle research engineer at Tactical Computing Labs. His work experience includes compilers, runtime systems, systems level software, numerical libraries, applied math problems, and hardware simulation. He has a Masters Degree in Computer Science from Georgia Tech and an undergraduate degree in Computer Science from Clemson.

Asynchronous-Many-Task Systems: Challenges and Opportunities - Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX

Speaker: Gregor Daiß
University of Stuttgart
Institute for Parallel and Distributed System

Abstract:

Dynamic and adaptive mesh refinement is pivotal in high-resolution, multi-physics simulations, necessitating precise physics resolution in localized areas across expansive domains. Today's supercomputers' extreme heterogeneity and large number of compute nodes present a significant challenge for such dynamically adaptive codes, highlighting the importance of both scalability and performance portability. Our research focuses on how to address this by integrating the asynchronous, many-task runtime system HPX with the performance-portability framework Kokkos and SIMD types. To demonstrate and benchmark our solutions at scale, we incorporated them into Octo-Tiger, an adaptive, massively parallel application for the simulation of binary star systems and their outcomes. Thanks to this, Octo-Tiger now supports a diverse set of processors, accelerators, as well as network backends and can scale on various supercomputers, such as Perlmutter, Frontier, and Fugaku. In this talk, we outline our various integrations between HPX and Kokkos. Furthermore, we show the challenges we encountered when using these frameworks together in Octo-Tiger and how we addressed them, ultimately achieving scalability on a selection of current supercomputers.

Bio:

Gregor Daiß is a PhD student at the University of Stuttgart, specializing in high-performance computing. His main interests include task-based runtime systems, distributed computing, performance-portability as well as refactoring large-scale simulations and porting them to accelerators. Current work mostly involves both Kokkos (for performance-portability) and HPX (task-based runtime system) for these purposes.

Journal of Open Source Software: bot-assisted open peer review and publication

Speaker: Kyle Niemeyer
Oregon State University
School of Mechanical, Industrial, and Manufacturing Engineering
Associate Professor

Abstract:

The Journal of Open Source Software (JOSS) is an open-access, no-fee scholarly journal that publishes quality open-source research software based on open peer review. JOSS was founded in 2016 with the dual objectives of giving traditional academic publication credit for software work and improving the quality of research software. Since its founding, JOSS has published over 2500 software papers—and counting!—with over 80 active editors spread across seven topic-area tracks. To handle this level of submissions and publishing, relying on a fully volunteer team, JOSS relies on GitHub and a system of open tools for reviewing and publishing submissions, driven by chatbot commands. Authors submit short Markdown papers along with links to their software's repository, which are compiled to PDF via Pandoc. JOSS’s editorial bot performs automated health checks on submissions, and reviews take place in GitHub issues, with authors, editors, and reviewers issuing bot commands via comments. This talk will describe the publication experience of JOSS and its machinery, and how it can be adapted by other communities.

Bio:

Kyle E. Niemeyer is an Associate Professor at Oregon State University in the School of Mechanical, Industrial, and Manufacturing Engineering. He also serves as the Associate School Head for Undergraduate Programs. He leads the Niemeyer Research Group, which uses computational modeling to study various phenomena involving fluid flows, including combustion and chemical kinetics, and related topics like numerical methods and parallel computing. He is also a strong advocate of open access, open source software, and open science in general, and has contributed in the area of standardizing research software citation. Kyle has received multiple prestigious fellowships throughout his career, including the AAAS Science & Technology Policy Fellowship in 2022, the Better Scientific Software (BSSw) Fellowship in 2019, the NSF Graduate Research Fellowship in 2010, and the National Defense Science and Engineering Graduate Fellowship in 2009. Kyle received his Ph.D. in Mechanical Engineering from Case Western Reserve University in 2013. He received BS and MS degrees in Aerospace Engineering from Case Western Reserve University in 2009 and 2010, respectively.

Material:

Slides

Improving MPI Interoperability for Modern Languages and Systems

Speaker: Joseph Schuchart
Stony Brook University
Institute for Advanced Computational Science
Senior Research Scientist

Abstract:

The Message Passing Interface standard has long been the lingua franca of HPC. Its design has enabled the development of many distributed parallel applications. After 30 years, the field of high-performance computing has seen several programming paradigms come and go. However, MPI has yet to address the challenges of accelerator-based computing, the advent of modern languages such as Rust, Python, and C++, and fully asynchronous programming models. This talk will provide insights into current efforts on modernizing MPI, from accelerator integration to improved datatype handling for modern languages.

Bio:

Joseph Schuchart is a Senior Research Scientist at the Institute for Advanced Computational Sciences at Stony Brook University. His research revolves around distributed asynchronous and task-based programming models, communication libraries, and design aspects of integrating different models. Joseph received his M.Sc. in Computer Science from Dresden University of Technology in 2012 and his PhD from the University of Stuttgart in 2020. He is an active member of the MPI Forum and a contributor to the Open MPI project.

Sponsored by Information Science & Technology Institute (ISTI) LA-UR-24-30763

Advances in Applied Computer Science Invited Speaker Series

Overview

Talks

What We Need to Model Planetary Interiors: The Role of Research Software in Computational Geodynamics

First digital biophysical model of the entire human cardiovascular system

Computational cosmology meets AI: the birth of a surrogate model for subscale physics

Introducing Enzo-E, an extreme scale AMR radiation hydrodynamic cosmology code built on Charm++

AI for code development at LANL

Shedding Light on Interaction Binaries: Radiation Hydrodynamics with Octo-Tiger

Chapel's Batteries-Included Approach for Portable Parallel Programming

Accelerated Quantum Supercomputing with CUDA-Q

LCI: a Lightweight Communication Interface for Asynchronous Multithreaded Communication

Scaling AI for Scientific Discovery: Faster Kernels, Efficient Models, Better Physics

Driving Continuous Integration and Developer Workflows with Spack

Introducing the Lamellar Runtime: A Modern Approach to High-Performance Computing

Deploying and Supporting Julia at NERSC

The value proposition of the Julia language for DOE’s mission

Transparent Resource Adaptivity for Task-Based Applications on Supercomputers

Rust and type safety for MPI and scientific computing

Thermonuclear electron-capture supernovae - Motivating a long-overdue update to the supernova modeling pipeline for the exascale computing age

RISC-V HPC Terrain Familiarization

Asynchronous-Many-Task Systems: Challenges and Opportunities - Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX

Journal of Open Source Software: bot-assisted open peer review and publication

Improving MPI Interoperability for Modern Languages and Systems

Speaker	Title	Date
Rene Gassmoeller	What We Need to Model Planetary Interiors: The Role of Research Software in Computational Geodynamics	10/15/2025
	Fiscal year 2025
Maxwell Cole	First digital biophysical model of the entire human cardiovascular system	08/27/2025
Michael L Norman	Computational cosmology meets AI: the birth of a surrogate model for subscale physics	08/13/2025
Michael L Norman	Introducing Enzo-E, an extreme scale AMR radiation hydrodynamic cosmology code built on Charm++	07/23/2025
Tim Burke	AI for code development at LANL	07/09/2025
Dominic Marcello	Shedding Light on Interaction Binaries: Radiation Hydrodynamics with Octo-Tiger	06/25/2025
Engin Kayraklioglu	Chapel's Batteries-Included Approach for Portable Parallel Programming	06/18/2025
Jiakun Yan	LCI: a Lightweight Communication Interface for Asynchronous Multithreaded Communication	05/27/2025
Monica Van Dieren	Accelerated Quantum Supercomputing with CUDA-Q	05/21/2025
Aparna Chandramowlishwaran	Scaling AI for Scientific Discovery: Faster Kernels, Efficient Models, Better Physics	04/15/2025
Richard Berger	Driving Continuous Integration and Developer Workflows with Spack	04/08/2025
Ryan Friese	Introducing the Lamellar Runtime: A Modern Approach to High-Performance Computing	04/02/2025
Johannes Blaschke	Deploying and Supporting Julia at NERSC	03/12/2025
William Godoy	The value proposition of the Julia language for DOE’s mission	03/12/2025
Jonas Posner	Transparent Resource Adaptivity for Task-Based Applications on Supercomputers	02/24/2025
Jed Brown	Rust and type safety for MPI and scientific computing	02/12/2025
Alexander Holas	Thermonuclear electron-capture supernovae - Motivating a long-overdue update to the supernova modeling pipeline for the exascale computing age	01/09/2025
Gregor Daiß	Asynchronous-Many-Task Systems: Challenges and Opportunities - Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX	12/11/2024
Chris Taylor	RISC-V HPC Terrain Familiarization	12/04/2024
	Fiscal year 2024
Kyle Niemeyer	Journal of Open Source Software: bot-assisted open peer review and publication	09/19/2024
Joseph Schuchart	Improving MPI Interoperability for Modern Languages and Systems	08/21/2024