About the job
We seek exceptional research engineers that can push the boundaries of our frontier models. Specifically, we are looking for those that will help us shape our empirical grasp of the whole spectrum of AI capabilities measurement and will own individual threads within this endeavor end-to-end.
Responsibilities
Create ambitious RL environments to push our models to their limits
Work on measuring frontier model capabilities, skills, and behaviors
Develop new methodologies for automatically exploring the behavior of these models
Help steer training for our largest training runs, and see the future first
Design scalable systems and processes to support continuous evaluation
Build self-improvement loops to automate model understanding
Qualifications
Minimum
Passionate and knowledgeable about AGI/ASI measurement
Strong engineering and statistical analysis skills
Able to think outside the box and have a robust “red-teaming mindset”
Experienced in ML research engineering, stochastic systems, observability and monitoring, LLM-enabled applications, and/or another technical domain applicable to AI evaluations
Able to operate effectively in a dynamic and extremely fast-paced research environment as well as scope and deliver projects end-to-end
Preferred
First-hand experience in red-teaming systems—be it computer systems or otherwise
An ability to work cross-functionally
Excellent communication skills