Principal Product Manager, AI Model Security

Microsoft
San Francisco Bay area, USA / New York City metropolitan area, USA2026-04-21onsite

About the job

We are hiring a Product Manager to own AI model security — the discipline of making our frontier models resilient against adversarial attack and purpose-built for security practitioners. This role has a dual mandate: (1) harden our models against the full spectrum of LLM security threats and (2) partner closely with Microsoft Security product teams to ensure our models deliver best-in-class capabilities for real-world security workflows.

Responsibilities

Own the model security roadmap: Define and prioritize the security hardening strategy for our frontier models across the full OWASP LLM threat surface — prompt injection (direct and indirect), data exfiltration, jailbreak resistance, system prompt leakage, training data extraction, and adversarial manipulation of agentic workflows.

Drive zero-day and exploit defense: Work with researchers to evaluate and mitigate the risk of models being used to generate zero-day exploits, malware, or novel attack vectors. Define thresholds, build evaluation datasets, and own the decision framework for what the model should and should not be capable of in the security domain.

Build and scale red-teaming frameworks: Design, run, and iterate adversarial testing programs — both automated and human-driven — to continuously probe model vulnerabilities. Establish metrics (e.g., jailbreak success rate, injection bypass rate, exfiltration resistance) and drive measurable improvement over time.

Partner with Microsoft Security product teams: Work closely with Azure Security and Security Copilot teams to translate their product requirements into model training priorities. Ensure our models are purpose-built for threat detection, incident triage, vulnerability assessment, log analysis, and compliance reasoning.

Define security-specific model evaluations: Build benchmark suites and evaluation frameworks that measure real-world security usefulness — not just academic performance. Drive training data strategy to improve domain-specific model quality for security practitioners.

Qualifications

Minimum

Bachelor's Degree AND 5+ years experience in product management, security engineering, or software development OR equivalent experience

Demonstrated hands-on experience with AI/ML systems — you have personally built, evaluated, or shipped ML-powered products or security tools

Deep familiarity with LLM security threats: prompt injection, jailbreaking, data exfiltration, adversarial attacks on generative models — through professional experience, red-teaming, or security research

Experience defining product requirements and driving decisions in partnership with researchers or ML engineers

Track record of building evaluation systems, security benchmarks, or adversarial testing frameworks — not just consuming them

Ability to operate autonomously, make decisions with incomplete information, and drive projects from ambiguity to shipped outcomes

Preferred

Technical background in computer science, security, or AI/ML — a postgraduate degree is a plus but not required

Experience in offensive security, penetration testing, or red teaming — ideally applied to AI/ML systems

Familiarity with security workflows and tooling (SIEM, SOAR, EDR, threat intelligence platforms) and how practitioners use them in production

Understanding of the model lifecycle (pre-training, fine-tuning, RLHF, deployment, monitoring) and where security interventions are most effective

Experience working with or within enterprise security organizations (e.g., Microsoft Security, CrowdStrike, Palo Alto Networks, or similar)

Published research, blog posts, or public contributions in AI security, adversarial ML, or LLM red teaming