Informal Seminar on Machine Learning for Theoretical Physicists
Large Language Models
The Institute for Advanced Study requires that all adult visitors, collaborators, conference and on-campus seminar attendees and outside vendors coming to the Institute are required to have completed a COVID-19 vaccination and booster in order to enter the IAS campus. Individuals must be prepared to present proof of vaccination if asked and are expected to follow the Institute's Covid-19 Procedures. Masks are optional while indoors. Additional information can be found at:
https://www.ias.edu/covid-19-procedures
Abstract: This will be a discussion about large language models such as OpenAI’s GPT series, oriented towards physicists.
After a brief survey of the state of the art, we describe transformer models in detail, and discuss current ideas on how they work and how models trained to predict the next word in a text are able to perform other tasks displaying intelligence.