Senior Software Development Engineer, EC2 Trainium AI Infra

Amazon
USA, WA, Seattle2026-05-11ONSITE

About the job

The Software Development Engineer will lead the team in technical strategy, design, build, and operation of infrastructure services including provisioning and availability of AWS Trainium-based AI servers. This role requires expertise in architecting large-scale systems, building micro services, and cross-functional collaboration with several other teams such as capacity management, hardware engineering, and datacenter teams to manage AI/ML infrastructure.

Responsibilities

Design and develop innovative technologies that power the infrastructure supporting AI workloads on Ultraservers

Lead technical projects establishing EC2 as the pioneer in cloud computing for AI/ML workloads across diverse applications including LLMs, multimodal systems, and emerging model architectures.

Collaborate with various teams to influence architecture of provisioning systems and improve to operate at scale and efficiently.

Build customer relationships by investigating complex performance challenges, developing solutions, and publishing actionable best practices through multiple channels.

Qualifications

Minimum

5+ years of non-internship professional software development experience

5+ years of programming with at least one software programming language experience

5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience

Experience as a mentor, tech lead or leading an engineering team

Preferred

5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

Bachelor's degree in computer science or equivalent