About the job
We build frontier foundation models that power intelligent experiences at Apple. Our team works across the full training lifecycle: including pre-training foundation models, and developing mid-training approaches that bridge general capability and task-specific performance. What makes our work distinct is that we're engineering models specifically for Apple silicon and optimized for experiences that are private, personal, and deeply integrated into the OS. We're solving frontier problems in reward modeling to resist reward hacking, handling sparse and delayed rewards in agentic settings, and aligning models reliably across the spectrum from open-ended creative tasks to precise, action-taking workflows. If you're drawn to hard problems where the research and the product are inseparable, this is the team
Responsibilities
No responsibilities listed.
Qualifications
Minimum
Demonstrated expertise in deep learning with a publication record in relevant conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, COLM, ACL, NAACL, EMNLP, KDD, ACL) or a track record in applying deep learning techniques to products
Proficient programming skills in Python and one of the deep learning toolkits such as JAX, PyTorch, or Tensorflow
Ability to work in a collaborative environment.
PhD, or equivalent practical experience, in Computer Science, or related technical field.
Preferred
Experience training or adapting vision-language models
Experience with image/video generation or editing
Post-training, mid-training large language models or multimodal models.
Reinforcement learning, on-policy distillation.