Preprints
Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Improving Human-AI Coordination through Online Adversarial Training and Generative Models
2025
ReaLJam: Real-Time, Synchronous Human-AI Music Jamming with Reinforcement Learning-Tuned Transformers
Extended Abstracts of The ACM Conference on Human Factors in Computing Systems (CHI) 2025
Achieving Human Level Competitive Robot Table Tennis
IEEE International Conference on Robotics and Automation (ICRA) 2025 (Best Paper Finalist)
Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination
International Conference on Machine Learning (ICML) 2025 (Oral Paper-Top 1%) and CogSci 2025
InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
International Conference on Learning Representations (ICLR) 2025
2024
Infer Human’s Intentions Before Following Natural Language Instructions
AAAI Conference on Artificial Intelligence (AAAI) 2025
Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
AAAI Conference on Artificial Intelligence (AAAI) 2025 (Oral Paper - Top 5%)
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
Neural Information Processing Systems (NeurIPS) 2024 (Spotlight-Top 2%)
Learning to Cooperate with Humans Using Generative Agents
Neural Information Processing Systems (NeurIPS) 2024
Impossibility theorems for feature attribution
Proceedings of the National Academy of Sciences (PNAS) 2024
Adaptive Accompaniment with ReaLchords
International Conference on Machine Learning (ICML), 2024