Scholar

Mantas Mazeika

Google Scholar ID: fGeEmLQAAAAJ

Center for AI Safety

ML SafetyAI SafetyMachine EthicsML Reliability

Google Scholar↗

Citations & Impact

All-time

Citations

17,612

H-index

26

i10-index

28

Publications

20

Co-authors

5

list available

Contact

No contact links provided.

Publications

14 items

Aggressive Compression Enables LLM Weight Theft

arXiv.org · 2026

Cited

0

Best Practices for Biorisk Evaluations on Open-Weight Bio-Foundation Models

2025

Cited

0

Remote Labor Index: Measuring AI Automation of Remote Work

2025

Cited

0

A Definition of AGI

2025

Cited

0

MoReBench: Evaluating Procedural and Pluralistic Moral Reasoning in Language Models, More than Outcomes

2025

Cited

0

TextQuests: How Good are LLMs at Text-Based Video Games?

2025

Cited

0

The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems

2025

Cited

0

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

2025

Cited

0

Resume (English only)

Co-authors

5 total

Director of the Center for AI Safety (advisor for xAI and Scale)

Professor of Computer Science, UC Berkeley

PhD Student, Carnegie Mellon University

University of Illinois at Urbana–Champaign

Professor of Computer Science, University of Illinois, Urbana Champaign