Scholar

Michael Dennis

Google Scholar ID: WXXu26AAAAAJ

Google DeepMind

Open-EndednessUnsupervised Environment DesignAI Safety

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,073

H-index

i10-index

Publications

Co-authors

Contact

Emailmichael_dennis@cs.berkeley.edu CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

8 items

Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments

2026

Cited

Beyond Fixed Tasks: Unsupervised Environment Design for Task-Level Pairs

2025

Cited

Robust and Diverse Multi-Agent Learning via Rational Policy Gradient

2025

Cited

Generating Creative Chess Puzzles

2025

Cited

Evaluating In Silico Creativity: An Expert Review of AI Chess Compositions

2025

Cited

Mitigating Goal Misgeneralization with Minimax Regret

2025

Cited

Multi-Agent Risks from Advanced AI

2025

Cited

BAMDP Shaping: a Unified Framework for Intrinsic Motivation and Reward Shaping

2024

Cited

Resume (English only)

Academic Achievements

Published several papers, including:
- PAIRED: Presented at NeurIPS 2020 (top 1% of submissions), which introduces a method to find minimax regret policies through training an adversary to generate levels that are hard for the protagonist but easy for the antagonist.
- Adversarial Policies: Investigated how deep reinforcement learning agents can be affected by adversarial strategies from other agents, demonstrating the existence of such policies in zero-sum games involving simulated humanoid robots.

Research Experience

Currently a Research Scientist on Google Deepmind's Openendedness team. Previously, conducted research as a Ph.D. student at CHAI.

Education

Ph.D. student at the Center for Human-Compatible AI (CHAI), advised by Stuart Russell. Prior research focused on computer science theory and computational geometry.

Background

Interested in the intersection between problem specification and open-ended complexity, focusing on Unsupervised Environment Design (UED) to automatically build complex and challenging environments for promoting efficient learning and transfer. Also deeply involved in decision theory.

Miscellany

Connects via Email, Twitter, Google Scholar, and GitHub.

Co-authors

0 total

Co-authors: 0 (list not available)