Current Research Interest
AI Safety and Interpretability. My research seeks principled approaches to control and align large language models with human values.
I'm always excited to collaborate with researchers at CMU and beyond. If you're working on related problems or have ideas you'd like to explore together, please feel free to reach out!