Senior Software Engineer, AI/ML GenAI

Google
Sunnyvale, CA, USA

About the job

The mission of our team is to provide Enterprise customers with reliable, performant, cost-effective, and easy-to-use open-source and partner GenAI models. The service enables scalable and reliable serving of these models on the Vertex AI platform with a variety of features (e.g., Dynamic Shared Quota, Provisioned Throughput, etc.). In this role, you will be passionate about building AI/ML model serving infrastructure, machine learning, GenAI/LLM serving, and capacity management to work with us and build successful products.

Responsibilities

Design and implement GenAI solutions, leverage ML infrastructure, and evaluate tradeoffs between different techniques and their application domains.

Design and develop the enterprise-grade AI model serving platform to empower cloud customers with the latest open-source generative AI/LLM models on GCP/Vertex AI platform.

Drive cross-team collaborations and alignments to deliver features directly impacting enterprise customer business.

Work with external customers to understand their requirements, analyze capacity requirements, onboard workload, and perform benchmarking and load testing.

Maintain high standards for quality through design reviews and code reviews.

Qualifications

Minimum

Bachelor’s degree or equivalent practical experience.

5 years of experience programming in Python or C++.

3 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).

3 years of experience testing, maintaining, or launching software products.

1 year of experience with GenAI techniques (e.g., LLMs, Multi-Modal, Large Vision Models) or with GenAI-related concepts (language modeling, computer vision).

1 year of experience with software design and architecture.

Preferred

Master's degree or PhD in Computer Science or related technical field.

5 years of experience with data structures and algorithms.

1 year of experience in a technical leadership role.

Experience developing accessible technologies.