Publications

Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Feedback
Learning with Language-Guided State Abstractions
Preference-Conditioned Language-Guided Abstractions
Aligning Robot Representations with Humans
Getting Aligned on Representational Alignment
Human-Guided Complexity-Controlled Abstractions
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation
Strengthening Subcommunities: Towards Sustainable Growth in AI Research
Make Greenhouse-Gas Accounting Reliable — Build Interoperable Systems
Investigations of Performance and Bias in Human-AI Teamwork in Hiring
On the Nature of Bias Percolation: Assessing Multiaxial Collaboration in Human-AI Systems
Human-Machine Collaboration for Fast Land Cover Mapping
What You See Is What You Get? The Impact of Representation Criteria on Human Bias in Hiring
An Integrated Machine Learning Approach To Studying Terrorism
Conceptual Feasibility Study of the Hyperloop Vehicle for Next-Generation Transport
Early Detection of Boko Haram Attacks in Nigeria