Scholar

Florian Mai

Google Scholar ID: MfETM20AAAAJ

Junior Research Group Leader, Uni Bonn

AI alignmentLLM reasoningLLMs

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

450

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailfmai@uni-bonn.de GitHubOpen ↗

Publications

9 items

Reasoning Primitives in Hybrid and Non-Hybrid LLMs

2026

Cited

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

2026

Cited

Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

2026

Cited

IKnow: Instruction-Knowledge-Aware Continual Pretraining for Effective Domain Adaptation

2025

Cited

AI Alignment Strategies from a Risk Perspective: Independent Safety Mechanisms or Shared Failures?

2025

Cited

Survey-to-Behavior: Downstream Alignment of Human Values in LLMs via Survey Questions

2025

Cited

In-Training Defenses against Emergent Misalignment in Language Models

2025

Cited

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

2025

Cited

Resume (English only)

Academic Achievements

Publications: 1. 'Superalignment with Dynamic Human Values' accepted at ICLR 2025 Workshop on Bidirectional Human-AI Alignment; 2. JQL paper accepted at EMNLP 2025; 3. Survey-to-Behavior preprint; 4. Preprint on in-training defenses against emergent misalignment; 5. JQL preprint.
Academic Activities: 1. Organized the International Conference on Large-Scale AI Risks; 2. Participated in a panel discussion on trustworthy AI at the Deutsches Museum Bonn; 3. Offered a seminar course on the ethics of Artificial General Intelligence.

Research Experience

Position: Junior Research Group Leader at the mAI alignment lab, University of Bonn. Research Projects: 1. Scalable Oversight by Learning to Decompose Tasks; 2. Emergent Misalignment; 3. Value Alignment.

Background

Research Interests: Scalable Oversight, Value Alignment, Emergent Misalignment, Reasoning Models, LLM Training. Professional Field: AI Alignment and Safety Issues. Introduction: Leads the mAI alignment lab at the University of Bonn, focusing on ensuring that current and future advanced AI systems act reliably in accordance with human values.

Miscellany