I’m a 5th year Artificial Intelligence PhD student at Berkeley advised by Anca Dragan and Stuart Russell within BAIR and CHAI. I’m thankful to be supported by the NSF Fellowship.

I’m interested in humans’ preference, value, and belief changes, and how they may be affected by interactions with AI systems. I’ve studied this both in generality (with the language of DR-MDPs), and more specifically in the context of recommender systems, investigating how the choice of algorithm might affect us users. I’m probably best known for my work on human-AI collaboration, and developing the Overcooked-AI benchmark.

Outside of research, I enjoy inline skating 🛹, watching movies 🎥, and finding new music 🎵. Before immigrating to the US, I grew up in the amazingly chaotic city of Livorno 🇮🇹 – visit if you get the chance!

Publications

Marcus Williams*, Micah Carroll*, Adhyyan Narang, Constantin Weisser, Brendan Murphy, Anca Dragan
ICLR 2025
Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton
Philosophical Studies 2024
Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan
ICML 2024
Micah Carroll*, Alan Chan*, Henry Ashton, David Krueger
EAAMO 2023
Stephen Casper, Xander Davies, ..., Micah Carroll, ..., Erdem Bıyık, Anca Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell
TMLR 2023
Smitha Milli, Micah Carroll, Sashrika Pandey, Yike Wang, Anca Dragan
PNAS Nexus 2025
Alan Chan, Rebecca Salganik, Alva Markelius, Chris Pang, Nitarshan Rajkumar, Dmitrii Krasheninnikov, Lauro Langosco, Zhonghao He, Yawen Duan, Micah Carroll, Michelle Lin, Alex Mayhew, Katherine Collins, Maryam Molamohammadi, John Burden, Wanru Zhao, Shalaleh Rismani, Konstantinos Voudouris, Umang Bhatt, Adrian Weller, David Krueger, Tegan Maharaj
FAccT 2023
Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell
ICML 2023
Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin
NeurIPS 2022 (Oral)
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan
ICML 2022 (Spotlight)
Mesut Yang, Micah Carroll, Anca Dragan
Human-in-the-loop Learning (HILL) Workshop, NeurIPS 2022
David Zhang, Micah Carroll, Andreea Bobu, Anca Dragan
Human-in-the-loop Learning (HILL) Workshop, NeurIPS 2022
Paul Knott, Micah Carroll, Sam Devlin, Kamil Ciosek, Katja Hofmann, Anca Dragan, Rohin Shah
AAMAS 2021
Micah Carroll, Rohin Shah, Mark Ho, Tom Griffiths, Sanjit Seshia, Pieter Abbeel, Anca Dragan
NeurIPS 2019

Selected Talks

On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback

FAR.ai Alignment Workshop, Santa Cruz 2024

AI Alignment with Changing and Influenceable Reward Functions

PIBBS Speaker Series 2024

Research Mentorship

  • George Ingebretsen (Berkeley EECS undergrad)
  • Karim Abdel Sadek (incoming PhD student)
  • Marcus Williams (now OpenAI)
  • Constantin Weisser (now Haize Labs)
  • Lukas Fluri (incoming PhD Student at ETH Zurich)
  • Calvin Bo Zhang (now Scale AI)
  • Tianyi (Alex) Qiu (now Anthropic Fellow)
  • Davis Foote
  • Ethan Mendez (now PhD Student at Georgia Tech)
  • Austin Jang (incoming PhD Student)
  • Francis Geng (now PhD Student at UCSD)
  • Sebastian Zhao (Berkeley EECS undergrad)
  • Yike Wang (now PhD Student at University of Washington)
  • David Zhang (now SWE at Observe, Inc.)
  • Sashrika Pandey (now SWE at Figma)
  • Anrui Gu (now ML Engineer at Smith & Associates)
  • Mesut Yang (now SWE at Ironclad)
  • Nathan Miller (now SWE at Microsoft)
  • Paul Knott (now CSET)

Learning

Teaching