IAS Physics Group Meeting
The Harmonic Oscillator of Large Language Models
Abstract: Recently, I have been trying to understand how large language models (LLMs) work, what their basic building blocks are and how they are trained. In this talk I want to share some of my understanding with you by discussing the so-called ‘Transformer’ architecture. This architecture is the driving force behind most of the current AI revolution, and therefore quintessential for understanding the LLMs around today.
Date & Time
November 08, 2023 | 11:00am – 12:15pm