About the job
We’re seeking a Member of Technical Staff – Applied Scientist to help design and build advanced Copilot features such as Deep Research and Web artifact generation. This role demands deep expertise in large language models (LLMs) and a strong architectural mindset to shape complex, user-facing systems. You’ll contribute to the evolution of Copilot by developing scalable methods for evaluating feature performance, designing data collection pipelines for prompt engineering and fine-tuning, and training content classifiers that support intelligent, context-aware interactions. The ideal candidate brings hands-on experience in building applications powered by LLMs, along with a solid foundation in data science and machine learning. You’re a proactive collaborator who communicates clearly, thrives in fast-paced environments, and takes ownership of delivering world-class consumer experiences. If you enjoy pushing the boundaries of AI-driven products and working across disciplines to create intuitive, high-impact solutions, we’d love to hear from you.
Responsibilities
Architect and implement advanced Copilot features such as Deep Research and Web artifact generation
Lead evaluation efforts of models deployed within Copilot, ensuring performance aligns with product goals
Design scalable systems that leverage large language models (LLMs) to deliver intelligent, user-facing experiences
Develop evaluation frameworks and metrics to assess feature performance and user impact
Conduct thorough reviews of data analysis and techniques to identify gaps and areas for re-examination
Build data collection pipelines to support prompt engineering and fine-tuning of LLMs
Track advances in industry and academia, identify relevant state-of-the-art research, and adapt algorithms to drive innovation
Train and optimize content classifiers to enable context-aware interactions within Copilot
Independently write efficient, readable, and extensible code and model pipelines
Contribute to defining the model quality roadmap for Copilot, balancing technical rigor with business and product priorities
Collaborate cross-functionally with product, engineering, and research teams to ship high-quality consumer features
Commit to a customer-oriented focus by validating customer perspectives, understanding broader context, and serving as a trusted advisor
Communicate technical concepts clearly and contribute to team-wide knowledge sharing
Take initiative in a fast-paced environment, driving innovation and continuous improvement in AI-powered products
Qualifications
Minimum
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)OR equivalent experience.
Experience prompting, evaluating, and working with large language models.
Experience writing production-quality Python code.
Preferred
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)OR equivalent experience.
Demonstrated interest in Responsible AI.