I am a Research Scientist at Anthropic and PhD student at MIT CSAIL (on leave) advised by Jacob Andreas and Julie Shah. I’ve spent summers at the Boston Dynamics AI Institute, MIT-IBM Watson AI Lab, Facebook AI Research (FAIR), and before grad school, two years as an AI Resident at Microsoft Research. I did my undergrad at Yale, where I got my start in research with Brian Scassellati and read a lot of dead philosophers.
I’m interested in building agents that learn representations from rich human knowledge, whether directly (e.g. from users) or through priors (e.g. from LMs). Currently, I’m thinking a lot about how to utilize pretrained models in conjunction with human feedback to interactively learn aligned preferences/rewards.
A history buff at heart, I care deeply about working with non-academic communities to create safe, ethical, and equitable AI. I currently serve as a Special Government Employee for the Defense Innovation Unit (DIU). In a previous life, I worked at the White House Office of Science and Technology Policy (OSTP), National Institute of Standards and Technology (NIST), and Schmidt Futures. I also serve on the advisory board of the Yale Jackson School of Global Affairs, where I co-teach a course on AI for policymakers.
I love being outdoors, even in the brutal Boston winters. A current goal is to run a sub-3:00 marathon (this is how I’m doing). Reach out to chat about research, policy, or running! Preferred subject line: Your cat is dope.
email | cv | google scholar | twitter | linkedin
Ph.D. Computer Science, 2023 -
Massachusetts Institute of Technology
M.S. Computer Science, 2023
Massachusetts Institute of Technology
B.S. Cognitive Science, 2018
Yale University
B.A. Global Affairs, 2018
Yale University
[Sep 2024] Our paper Adaptive Language-Guided Abstraction from Contrastive Explanations was accepted to CoRL 2024.
[Aug 2024] I am taking leave from MIT to lead national security evaluations on the Frontier Red Team at Anthropic.
[Aug 2024] Attending RLC! I’ll be presenting Pragmatic Feature Preferences in the RL Beyond Rewards Workshop.
[Jul 2024] Attending ICML! I’ll be presenting Pragmatic Feature Preferences in the main conference. I’ll also be attending the Alignment Workshop beforehand.
[Jul 2024] Attending RSS! I’m honored to be part of the 2024 RSS Pioneers cohort, as well as help organize the Social Intelligence in Humans and Robots Workshop and the Task Specification Workshop.
[May 2024] I started at Anthropic! I’ll be working to help make big models safer.