Informal Seminar on Machine Learning for Theoretical Physicists

Large Language Models

The Institute for Advanced Study requires that all adult visitors, collaborators, conference and on-campus seminar attendees and outside vendors coming to the Institute are required to have completed a COVID-19 vaccination and booster in order to enter the IAS campus. Individuals must be prepared to present proof of vaccination if asked and are expected to follow the Institute's Covid-19 Procedures. Masks are optional while indoors. Additional information can be found at:
https://www.ias.edu/covid-19-procedures

Abstract: This will be a discussion about large language models such as OpenAI’s GPT series, oriented towards physicists.

After a brief survey of the state of the art, we describe transformer models in detail, and discuss current ideas on how they work and how models trained to predict the next word in a text are able to perform other tasks displaying intelligence.

Date & Time

April 20, 2023 | 1:45pm – 3:00pm

Location

Bloomberg Hall Lecture Hall

Affiliation

CMSA, Harvard University

Categories

Tags