AgoraResearch hub
ExploreLibraryProfile
Account
Mantas Mazeika
Scholar

Mantas Mazeika

Google Scholar ID: fGeEmLQAAAAJ
Center for AI Safety
ML SafetyAI SafetyMachine EthicsML Reliability
Google Scholar↗
Citations & Impact
All-time
Citations
17,612
 
H-index
26
 
i10-index
28
 
Publications
20
 
Co-authors
5
list available
Contact
No contact links provided.
Publications
14 items
Aggressive Compression Enables LLM Weight Theft
arXiv.org · 2026
Cited
0
Best Practices for Biorisk Evaluations on Open-Weight Bio-Foundation Models
2025
Cited
0
Remote Labor Index: Measuring AI Automation of Remote Work
2025
Cited
0
A Definition of AGI
2025
Cited
0
MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes
2025
Cited
0
TextQuests: How Good are LLMs at Text-Based Video Games?
2025
Cited
0
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
2025
Cited
0
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
2025
Cited
0
Resume (English only)
Co-authors
5 total
Dan Hendrycks
Dan Hendrycks
Director of the Center for AI Safety (advisor for xAI and Scale)
Dawn Song
Dawn Song
Professor of Computer Science, UC Berkeley
Andy Zou
Andy Zou
PhD Student, Carnegie Mellon University
Bo Li
Bo Li
University of Illinois at Urbana–Champaign
David Forsyth
David Forsyth
Professor of Computer Science, University of Illinois, Urbana Champaign

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?