Journal

Getting Aligned on Representational Alignment

Preprint

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

TMLR, 2023 (Finalist, Oustanding Certification)

Make Greenhouse-Gas Accounting Reliable — Build Interoperable Systems

Nature, 2022