Scholar
Felix Hofstätter
Google Scholar ID: zRIuwQ8AAAAJ
Unknown affiliation
Artificial Intelligence
Machine Learning
AI Safety
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
77
H-index
4
i10-index
2
Publications
12
Co-authors
0
Contact
No contact links provided.
Publications
4 items
Stress Testing Deliberative Alignment for Anti-Scheming Training
2025
Cited
0
Probing Evaluation Awareness of Language Models
2025
Cited
0
The Elicitation Game: Evaluating Capability Elicitation Techniques
2025
Cited
0
AI Sandbagging: Language Models can Strategically Underperform on Evaluations
arXiv.org · 2024
Cited
11
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up