Data Scientist - Synthetic Data and ML Evaluations

About the job

The Special Projects team at Apple is developing novel user-facing conversational features that leverage the multimodal capabilities of state-of-the-art foundation models. A key component of this process is the ability to produce complex simulated scenario data, in order to train and evaluate agentic AI models. We are looking for a skilled Data Scientist to work closely with our Simulation and Machine Learning Evaluations teams to generate large synthetic datasets, analyze the gap between simulated and real data, and evaluate and fine-tune agentic AI model performance at various tasks. A successful candidate is experienced in managing large, multi-modal datasets, in translating subjective product requirements into objective criteria, and has strong statistical analysis skills.

Responsibilities

Work closely with ML Engineers to understand model evaluation needs

Work closely with the Simulation team to design and generate large, multi-modal datasets for model evaluation

Analyze the gap between simulated and real data

Collaborate with the Simulation team to prioritize and address simulation performance gaps

Qualifications

Minimum

BA or Master’s degree in Computer Science, Data Science, or related field

2+ years of hands-on experience working with large data sets

Proficiency in Python

Excellent communication skills

Preferred

PhD in Computer Science, Data Science, Statistics, or other STEM field

Hands-on industry experience with product focused statistical analysis

Experience working with large-scale multimodal data and data-annotation pipelines

Experience working with simulations to produce large datasets

Experience with experimental design, A/B testing and Failure Analysis

A track record of publications or technical presentations in Data Science