Machine Learning Research Engineer, Siri Comprehension & Planning, Siri Agent Modeling

About the job

The future of AI is on-device. On the Siri team, we're solving the defining challenge: building and shipping powerful, LLMs that run with extreme efficiency across hundreds of millions of iPhones, Watches, and Macs. You will own the full ML stack, from data to deployment, bridging the gap between bleeding-edge research and product-critical models. This is where the hardest problems in applied ML are being solved.

Responsibilities

Training & Fine-Tuning: Architect and train LLMs specifically for assistant behavior. This is your opportunity to define what "reasoning" and "task completion" mean for a billion users.

Performance & Optimization: Master the art of model surgery. You'll design and implement novel context and adapter strategies to make our models faster and smarter within the tight constraints of on-device deployment.

Data as a Product: You understand that world-class models are built on world-class data. You will own the curation, augmentation, and evaluation pipelines that are the lifeblood of our systems.

System Architecture: Your insights will shape not just the model, but the entire system. You'll work with cross-functional experts in software, research, and product to build a cohesive, intelligent experience.

Qualifications

Minimum

Experience with shipping LLM based products

Excellent python engineering skills and knowledge of deep learning frameworks (one of Pytorch, Jax, Tensorflow)

Machine Learning Research track record or Excellent knowledge of Machine Learning State of the Art

Masters or PhD in Computer Science, Machine Learning Engineering or equivalent professional experience

Preferred

Knowledge of Natural Language Processing

Experience with Small Language Models

Experience with Agentic AI