Built/contributed to large-scale, broad-coverage resources like the Indic NLP Library, IndicTrans/Sata-Anuvaadak Translation systems, IndicLLMSuite, Airavata LLM, IIT Bombay Parallel Corpus, Samanantar Corpus, Indic NLP/NLG Suite, and Aksharantar/BrahmiNet transliteration corpora; 3 papers accepted to ACL 2025.
Research Experience
Principal Applied Researcher in the Microsoft Machine Translation team; Co-founder and co-lead of AI4Bharat; Served as an adjunct faculty in the Department of Computer Science, IIT Madras.
Education
Ph.D. in 2018 from the Department of Computer Science and Engineering, IIT Bombay, under the guidance of Prof. Pushpak Bhattacharyya at the Center for Indian Language Technology. His doctoral research focused on various facets of machine translation and transliteration between related languages.
Background
NLP Researcher working on Reasoning Models, Machine Translation, Multilingual Learning and Indian Language NLP. A Principal Applied Researcher in the Microsoft Machine Translation team and a founding member and co-lead of AI4Bharat.
Miscellany
Interested in building tools and resources for Indian language NLP; Involved in multiple talks and workshops, including a tutorial on Building Multilingual NLP datasets at scale at IASNLP Summer School, and a talk at OdiaGen (IIT Bhubaneshwar) on reasoning models; Member of the Academic Council at IIIT-Hyderabad.