Principal Machine Learning Engineer

Microsoft
San Francisco Bay area, USA / New York City metropolitan area, USA2026-04-13onsite

About the job

As a Principal Machine Learning Engineer, you will work on the Data Labeling and classification on large scale multi modal Copilot data part of the Microsoft AI (MAI) organization. We’re looking for a hands-on ML engineer to prototype and productionize complex classification flows on real production logs, operate prompted classifiers at scale (ad hoc and scheduled), and build secure, compliant data-labeling pipelines. We’re looking for someone with experience in data pipelines, data science, and machine learning, as well as a strong communicator and great teammate. The right candidate takes the initiative and enjoys building world-class consumer experiences and products in a fast-paced environment.

Responsibilities

Build evaluation loops (precision/recall, calibration, drift, human-in-the-loop) and publish dashboards/SLOs.

Generalize machine learning (ML) solutions into repeatable frameworks.

Operationalize prompted classifiers at scale (batch & streaming), including orchestration, autoscaling, monitoring, and cost guardrails.

Conduct thorough review of data analysis and techniques used to summarize the process review and highlight areas that have been missed or need re-examining.

Collaborate cross-functionally with DS, Security, and Platform to define schemas, access patterns, and governance.

Independently write efficient, readable, extensible code and model pipelines.

Commit to a customer-oriented focus by acknowledging customer needs and perspectives, validating customer perspectives, focusing on broader customer context, and serving as a trusted advisor.

Qualifications

Minimum

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Preferred

7+ years' experience writing production-quality Python or Java or Scala code.

5+ years' experience in distributed systems design and implementation of large scale data processing systems

3+ years' experience building ML data pipelines using atleast one of AML, Promptflow, Langchain or LangGraph

Demonstrated interest in Responsible AI.

Experience prompting, evaluating, and working with large language models.