Name: Deploying AI Chatbot Assistants with Charliecloud - Jemma Stachelek, Los Alamos National Laboratory
Start: 2025-05-08T09:20:00-0500
End: 2025-05-08T10:00:00-0500

May 5-8, 2025
Chicago, IL
View More Details & Registration

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for the event to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to find out more information.

This schedule is automatically displayed in Central Time (UTC/GMT -6 hours). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date."

IMPORTANT NOTE: Timing of sessions and room locations are subject to change.

Thursday May 8, 2025 9:20am - 10:00am CDT

Illinois River

Additional Authors: Tolulope Olatunbosun, Phil Romero & Mike Mason, Los Alamos National Laboratory

Retrieval Augmented Generation (RAG) systems improve the response relevance of LLMs (Large Language Models) by limiting the context to a document corpus. RAG systems have seen broad deployment as document summarization engines and AI chatbots. However, deploying these systems often assumes a privileged and “cloudy” environment with multi-container orchestration (i.e. docker compose) and unfettered internet access to pull resources (e.g. software, data, and models) on-the-fly. As an alternative, we leveraged Charliecloud’s NVIDIA GPU support capabilities to deploy a RAG chatbot in an unprivileged HPC environment where resources are pre-staged. We demonstrate the deployment of AI Chatbots using Charliecloud on a variety of hardware and software versioning.

LA-UR-25-21968

Speakers

Jemma Stachelek

HPC AI/ML Software Engineer, Los Alamos National Laboratory

Thursday May 8, 2025 9:20am - 10:00am CDT
Illinois River

Charliecloud

HPSF Conference 2025

Jemma Stachelek

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!