Scholar
Chawin Sitawarin
Google Scholar ID: AxUAEQ4AAAAJ
Google DeepMind
machine learning
artificial intelligence
security
privacy
adversarial examples
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,263
H-index
19
i10-index
23
Publications
20
Co-authors
66
list available
Contact
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
8 items
Soft Instruction De-escalation Defense
2025
Cited
0
Extracting alignment data in open models
2025
Cited
0
The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections
2025
Cited
0
Does More Inference-Time Compute Really Help Robustness?
2025
Cited
0
Defending Against Prompt Injection With a Few DefensiveTokens
2025
Cited
0
How much do language models memorize?
2025
Cited
0
Lessons from Defending Gemini Against Indirect Prompt Injections
2025
Cited
0
JailbreaksOverTime: Detecting Jailbreak Attacks Under Distribution Shift
2025
Cited
0
Resume (English only)
Co-authors
66 total
David Wagner
Professor of Computer Science, UC Berkeley
Prateek Mittal
Professor, Princeton University
Arjun Bhagoji
Assistant Professor, IIT Bombay
Co-author 4
Julien Piet
UC Berkeley
Co-author 6
Sizhe Chen
UC Berkeley, Meta FAIR
Vikash Sehwag
Google Deepmind; Princeton University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up