Journal

Getting Aligned on Representational Alignment

Preprint

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

TMLR, 2023

Make Greenhouse-Gas Accounting Reliable — Build Interoperable Systems

Nature, 2022