I’m a research scientist and cofounder of OpenAI. I lead the reinforcement learning (RL) team, where we’re working on using RL algorithms (trial-and-error learning) to improve language models like GPT. Previously, I received my PhD in Computer Science from UC Berkeley, where I had the good fortune of being advised by Pieter Abbeel.
Prior to my recent work in RL, I spent some time working on robotics, enabling robots to tie knots and plan movement using trajectory optimization. Before that, I did a brief stint in neuroscience at Berkeley before switching to machine learning, and before that, I studied physics at Caltech.