About the job
Copilot Tuning is a new product that aims to fine-tune large language models (LLMs) on tenant data, enabling task-specific agents and solutions. We are a small, nimble team that is advancing the state of the art of models in M365 Copilot. Come join our team and help transform the LLM experience in the enterprise. We are seeking Senior Applied Scientists and Principal Applied Scientists (Multiple Positions) with strong research skills and the desire to pursue the cutting edge in model development that pushes technological boundaries. We are looking for candidates with interest and experience in language model training, data pipelines, and shipping high-quality models, willing to delve deep with customer data and generalize the learnings for the broader product.
Responsibilities
Write and execute training pipelines for large language models post-training.
Design experiments to show the effectiveness of LLM-based solutions.
Design and implement inference solutions that incorporate post-trained models following product specifications and work with broader team to ship these solutions to customers.
Document experiments and communicate results across the team.
Mentor early in career team members.
Qualifications
Minimum
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred
Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
OR equivalent experience.
2+ years of experience training/fine tuning AI/ML models, preferably LLMs/SLMs (small learning model).
2+ years of experience with Python and/or ML frameworks such as PyTorch.
4+ years experience creating publications (e.g., patents, libraries, peer-reviewed academic papers).
2+ years experience presenting at conferences or other events in the outside research/industry community as an invited speaker.
2+ years of experience building Generative AI pipelines, e.g. with RAG (Retrieval augmented generation).