Scholar
Sebastin Santy
Google Scholar ID: HsyMg08AAAAJ
University of Washington
Machine Learning
Natural Language Processing
Human-Computer Interaction
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,499
H-index
11
i10-index
11
Publications
20
Co-authors
35
list available
Contact
GitHub
Open ↗
Publications
3 items
Economics of Sourcing Human Data
2025
Cited
0
Multilingual Diversity Improves Vision-Language Representations
Neural Information Processing Systems · 2024
Cited
10
Computer Vision Datasets and Models Exhibit Cultural and Linguistic Diversity in Perception
2023
Cited
3
Resume (English only)
Academic Achievements
“When Incentives Backfire, Data Stops Being Human” accepted at ICML 2025 (Position Paper)
“Multilingual Diversity Improves Vision-Language Representations” accepted at NeurIPS 2024 (Spotlight)
“Semantic and Expressive Variations in Image Captions Across Languages” accepted at CVPR 2025
“Characterizing Design Biases of Datasets and Models” accepted at ACL 2023 with Outstanding Paper Award
“State and Fate of Linguistic Diversity and Inclusion in the NLP World” accepted at ACL 2020, covered by multiple media outlets
“Language Translation as a Socio-Technical System” published at COMPASS 2021
“Learnings from Technological Interventions in a Low Resource Language” published at LREC 2020
“Unsung Challenges of Building and Deploying Language Technologies for LRL Communities” published at ICON 2019
“BLIP: Facilitating the Exploration of Undesirable Consequences of Digital Technologies” accepted at CHI 2024
“INMT: Interactive Neural Machine Translation” demo paper at EMNLP 2019
“CoSSAT: Code-Switched Speech Annotation Tool” published at AnnoNLP @ EMNLP 2019
“Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices” published in ACL 2021 Findings
“BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?” published at AdaptNLP@EACL 2021
“Towards Task Understanding in Visual Settings” accepted as a student abstract at AAAI 2019
Co-authors
35 total
Monojit Choudhury
Professor of Natural Language Processing, MBZUAI
KALIKA BALI
Researcher, Microsoft Research Labs India
Pratik Joshi
Google DeepMind
Co-author 4
Katharina Reinecke
University of Washington
Ranjay Krishna
University of Washington, Allen Institute for AI
Jena D. Hwang
Allen Institute for AI
Jenny T. Liang
Ph.D. student, Carnegie Mellon University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up