Publications: Full list available on his Google Scholar. Teaching: Co-taught an introductory workshop on building transformers at the Kempner Institute and gave guest lectures for CS 2281R on hardware, DDP, checkpointing, compute primitives, and parallelization.
Research Experience
Currently a research scientist at the Kempner Institute at Harvard, working on machine learning and artificial intelligence. Previously a research scientist at Amazon (formerly Alexa AI), working on production speech/language models. Also a Simons Collaboration postdoctoral fellow at McGill University, working on high-energy theoretical physics.
Education
PhD: 2018 from Johns Hopkins University, supervised by Jared Kaplan. Undergraduate: UC Berkeley, major in Physics, worked on dark matter detection.
Background
Research Interests: Building and scaling up foundation models, including quantization, post-training, and systems scaling. Professional Fields: Machine learning, artificial intelligence, theoretical high-energy physics.
Miscellany
Personal Interests: Squash, long walks, biking, weightlifting, photography.