Current Research Interest
AI Safety and Interpretability. My research seeks principled approaches to improve the robustness, controllability, and alignment of large language models with human values.
I'm always excited to collaborate with researchers at CMU and beyond. If you're working on related problems or have ideas you'd like to explore together, please feel free to reach out!