Software Engineer 4 – Data and Feature Infrastructure, AI Platform

About the job

At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next. Machine Learning/Artificial Intelligence powers innovation in all areas of the business, from helping members choose the right title for them through personalization, to better understanding our audience and our content slate, to optimizing our payment processing and other revenue-focused initiatives. Building highly scalable and differentiated ML infrastructure is key to accelerating this innovation. ML models can only be as good as the data we provide them. That's why we continue to innovate on making data and feature engineering as simple, scalable, and efficient as possible. Are you interested in joining us on this mission? You will have the opportunity to build cutting-edge data and feature infrastructure that will power ML models across various domains, including personalized recommendations, payments, games, ads, and more.

Responsibilities

Design and build a near-real-time feature computation engine to generate ML features for both high-throughput training and low-latency inference applications.

Operate and manage the feature computation pipelines and feature serving infrastructure for various ML models across multiple ML domains.

Build and scale systems that accelerate training through performant data loading, transformation, and writing.

Create frameworks to streamline and expedite the availability of new data for training and serving.

Develop feature stores that enable feature discovery and sharing.

Increase the productivity of ML practitioners by making it easy to define and access features and labels for experimentation and productization.

Qualifications

Minimum

Experience in building ML or data infrastructure

Strong empathy and passion for providing a fantastic user experience to ML practitioners

Experience in building and operating 24/7 high-traffic and low-latency online applications

Experience with large-scale data processing frameworks such as Spark, Flink, and Kafka

Experience with data storage and serving technologies such as Iceberg and Cassandra

Experience in working with and optimizing Java and/or Python codebases

Experience with public clouds, especially AWS

Self-driven and highly motivated team player

Preferred

Experience in building and operating ML feature stores, such as Chronon

Experience in building embedding-based retrieval systems

Experience working with Notebooks such as Jupyter or Polynote