Samuel Cahyawijaya
Scholar

Samuel Cahyawijaya

Google Scholar ID: w5w_WZEAAAAJ
Cohere
Low-Resource NLPUnderrepresented LanguagesMultilingualCosslingualZero/Few-shot learning
Citations & Impact
All-time
Citations
8,010
 
H-index
31
 
i10-index
52
 
Publications
20
 
Co-authors
32
list available
Resume (English only)
Academic Achievements
  • Resource Award at IJCNLP-AACL 2023 for NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
  • Area Chair’s Award at IJCNLP-AACL 2023 for A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
  • Best Paper Award at SEALP 2023 for IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems
  • Outstanding Paper Award at EACL 2023 for NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
  • Best Student Paper Award at DialDoc 2022 for Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters
  • Honorable Mention Award at NLP4ConvAI for XPersona: Evaluating Multilingual Personalized Chatbot
  • Hong Kong PhD Fellowship from Research Grants Council of Hong Kong (September 2021)
  • Merit Award of e-Inclusion category at INAICTA 2014 (August 2014)
  • Semi-Finalist of the World Citizenship Category of Imagine Cup 2014 (August 2014)
  • 1st Place Winner of Data Mining Competition at Gemastik 6 (October 2013)
  • 1st Place Winner of Gemastik 6 Debugging Competition at Gemastik 6 (October 2013)
  • 3rd Place Winner of Samsung App Challenge 2013 (September 2013)
  • 1st Place Winner of the Innovation Category of Imagine Cup 2013 (March 2013)
Background
  • Currently working as a 3rd year PhD student at the Centre for Artificial Intelligence Research (CAiRE) at HKUST, focusing on multilingualism for low-resource languages, especially in Southeast Asian languages. Also supervising undergraduate and master students interested in NLP research.
Miscellany
  • Interests include researching multicultural and multilingualism across various language families and modalities.