Speaker: Sebastian Wind, NHR@FAU
Slides: https://hpc.fau.de/files/2025/03/2025-03-11_HPCCafe_LLMfuerDummies.pdf
Abstract:
Large Language Models (LLMs) are revolutionizing the way we interact with artificial intelligence, and the open-source community plays a pivotal role in driving their accessibility and innovation. This talk delves into the inner workings of LLMs, exploring their foundational mechanisms and architectures. Additionally, we examine how these models can be efficiently trained on high-performance computing (HPC) systems, leveraging state-of-the-art scaling strategies and principles derived from scaling laws. By understanding these methodologies, attendees will gain valuable insights into the challenges and opportunities of developing and deploying LLMs in diverse computational environments.
Material from past events is available at: https://hpc.fau.de/teaching/hpc-cafe/