Research interests broadly span architectures, data, and learning algorithms for language models. In particular, explored how language models can adapt to new tasks and domains with minimal data.
Miscellany
Enjoys optimizing with first and second moments, exchanging mathematical puzzles, organizing HackMIT, wearing free Jane Street t-shirts, playing tennis/badminton/climbing, and eating spicy food.