About the job
Scale AI’s Public Sector team is growing in the Generative AI space, and we’re seeking an Strategic Projects Lead to own high-impact projects that drive revenue and experimentation. In this role, you’ll work across operations, engineering, and customer engagement to produce world-class training and test and evaluation data for Large Language Models for our Public Sector customers.
Responsibilities
Develop, build, and maintain the infrastructure required to ensure data pipelines are efficient, scalable, and produce high-quality outputs
Take ownership of day-to-day progress on high-priority data production pipelines, ensuring projects move forward efficiently
Partner with subject matter experts in their fields to validate the quality of our data and to translate deep domain knowledge into scalable processes and measurable outcomes
Work closely with customers to understand their requirements and design data taxonomies that optimize model performance.
Utilize analytics and data visualization tools to track progress, identify bottlenecks, and make data-driven decisions to optimize pipeline performance
Influence cross-org collaboration to define and advance human data strategy, influencing technical and non-technical stakeholders to ensure data quality, scalability, and long-term platform leverage
Own larger and larger components of our data delivery processes, until you ultimately serve as the full owner of our most visible and high impact customer pipelines
Qualifications
Minimum
5+ years of experience in product development, data science, or operations
A history of successful project management and comfort in ambiguity
Ability to analyze complex operational data, build queries, and identify trends to inform decisions and optimize processes
Technical aptitude to understand how to produce data for state of the art post-training techniques such as supervised fine tuning (SFT), reinforcement learning through human feedback (RLHF), Reinforcement Learning with Verifiable Rewards (RLVR) etc
Preferred
Experience working in defense tech and/or an AI company
A technical degree in fields like computer science, data science, or engineering
A deep understanding of ML operations for generative AI workflows / products
An active Top Secret security clearance