Deep Learning from the Perspective of a Physicist

Speaker
Daniel Roberts
Date
Mon October 23rd 2023, 2:00 - 3:00pm
Affiliation
Sequoia Capital and Massachusetts Institute of Technology
Event Sponsor
Stanford Institute for Theoretical Physics
Location
Varian 355

In one of many worlds, we will either (i) discuss a way to minimally extend the traditional workhorse of machine learning, the linear model, by considering the quadratic model, an interacting model that exhibits feature learning, (ii) present a solvable model of neural scaling laws that we can understand using tools from random matrix theory, or (iii) give pedagogical introduction to "language models" and the transformer architecture that powers exciting AI systems such as ChatGPT. (We probably live in the world described by the third item.)