Published several papers such as 'Humanity's Last Exam', 'Improving Alignment and Robustness with Circuit Breakers', 'HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal', etc. Also contributed to large open-source projects like BLOOM.
Research Experience
Working at Center for AI Safety, involved in multiple research projects including Humanity's Last Exam, Improving Alignment and Robustness with Circuit Breakers, HarmBench, etc.
Education
Received a B.S. in Computer Science from Case Western Reserve University in 2023. Worked with Trieu H. Trinh and Minh-Thang Luong (DeepMind) during undergraduate studies.
Background
Currently a Research Engineer at Center for AI Safety, working with Dan Hendrycks. Interested in AI Safety.
Miscellany
Ranked 1st in North America for Amumu (3rd globally) and 3rd in North America for Gragas in League of Legends, Season 2023.