Published a paper on pre-tokenization/tokenization at COLING 2025; Successfully organized a workshop on Challenges in Processing South Asian Languages (CHiPSAL-2025); Published an article on GenAI and Creative literature (Tamil) in Gnanam Monthly magazine; Secured a Google grant to develop a Sri Lankan Tamil Corpus (Speech and Text); Secured a DAAD-SDG grant to develop resources for South Asian languages together with PhD supervisors, evaluators, and their team from the University of Konstanz, the University of Moratuwa, and the University of Engineering and Technology, Pakistan.
Research Experience
Herz Fellow (2023-2024) at the University of Konstanz, Germany; Founding Member of Yarl IT Hub; IVLP Participant, U.S. Department of State; Organized a workshop on Challenges in Processing South Asian Languages (CHiPSAL-2025); Delivered a five-day course at the 34th European Summer School in Logic, Language and Information (ESSLLI); Delivered a keynote address at the International Conference on Tamil Computing and Information Technology at the University of Texas at Dallas, USA.
Education
PhD (2022); MSc in Computer Science (2010); BSc (Hons) in Computer Science (2006); Tamil Junior Pundit (Bala Pundit) (2016).
Background
A Computational Linguist focusing on understanding and modeling natural languages, with a special focus on Tamil and other low-resource languages, using computational methods. Currently, a Senior Lecturer in Computer Science at the University of Jaffna, Sri Lanka, and also a visiting researcher and a member of the Computational Linguistics Group in the Department of Linguistics at the University of Konstanz, Germany.
Miscellany
Editors-in Chief (2025 -) - Vingnanam Journal of Science; President (2024-2025) of the Jaffna Science Association; Recruiting Research assistants; Looking for PhD/Masters students to work on corpus building and low-resource language resource development.