Research Engineer, Frontier Evals & Environments

OpenAI
San Francisco2025-04-13

About the job

We seek exceptional research engineers that can push the boundaries of our frontier models. Specifically, we are looking for those that will help us shape our empirical grasp of the whole spectrum of AI capabilities measurement and will own individual threads within this endeavor end-to-end.

Responsibilities

Create ambitious RL environments to push our models to their limits

Work on measuring frontier model capabilities, skills, and behaviors

Develop new methodologies for automatically exploring the behavior of these models

Help steer training for our largest training runs, and see the future first

Design scalable systems and processes to support continuous evaluation

Build self-improvement loops to automate model understanding

Qualifications

Minimum

Passionate and knowledgeable about AGI/ASI measurement

Strong engineering and statistical analysis skills

Able to think outside the box and have a robust “red-teaming mindset”

Experienced in ML research engineering, stochastic systems, observability and monitoring, LLM-enabled applications, and/or another technical domain applicable to AI evaluations

Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end

Preferred

First-hand experience in red-teaming systems—be it computer systems or otherwise

An ability to work cross-functionally

Excellent communication skills