Systems Development Eng (AWS Generative AI & ML Servers), AWS Hardware Engineering Accelerators

Amazon
Seattle, WA, USA / Cupertino, CA, USA / Austin, TX, USA2026-04-01ONSITE

About the job

Do you want to build the backbone of Generative AI cloud at AWS? Do you want to build the future of the cloud for AI training and inference? Want to do industry leading work delivering continuous price performance improvements in the cloud for AI model training for multi billion variable LLMs? Come Join us in designing, delivering and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads.

Responsibilities

Be a technical leader for your team, setting the standards for engineering best practices and operational excellence. Lead projects that require the work of multiple engineers, combining new and existing hardware, software, systems, processes, and tooling appropriate to the task.

Design technology solutions and architectures that solve complex business or technical problems related to server system testability, reliability, and diagnosis. Decompose difficult problems into straightforward tasks, components, or features that can be delivered in parallel by you and others.

Own your team’s systems, proactively identifying and fixing extant risks, limitations, and deficiencies. Work to reduce complexity and enable greater agility for your team and partner teams.

Partner with your manager and other team leaders to develop your team’s technical strategy. Be a key influencer in team strategy and goals, including influencing Technical Program Managers and Product Managers.

Resolve the contributing causes of endemic problems, including architectural deficiencies and areas where your team limits the innovation of other teams. This may require influencing software and systems decisions made by other teams.

Lead reviews of architecture, design, operations, process, or post-incident analysis for your team, and actively participate in those of other teams. Your process improvements enable teams in your organization to deliver more resilient products with less effort.

Deliver solutions using the most appropriate combination of hardware, software, systems design, architecture, process, or operations. When the most appropriate solution requires expertise you lack, own the problem but partner with experts in other job families.

Effectively communicate technical designs and decisions in writing. Produce exemplary artifacts including code, designs, documentation, and runbooks that are straightforward for others to adopt, maintain, and extend.

Actively hire and develop others, coaching and mentoring throughout your organization. When eligible, provide support with technical assessments for promotions, ensuring these processes are unbiased and inclusive.

Bring perspective and provide context for current technology choices and guide future technology choices for your team. Identify strategic opportunities that solve tactical problems and determine when to invest in one over the other for greatest impact.

Qualifications

Minimum

You are knowledgeable of the full technical stack - vertically from baremetal server hardware up to the software in userland, and everything in the middle.

You have tremendous interest in cloud scale and curious how systems and software decisions impact the user.

You insist on highest-standards and are able to develop tactical solutions/tools to diagnose and fix issues.

You are an excellent systems debugger - finding interaction issues between components on server systems.

You are a leader with strong organizational, planning, and communication skills.

You are a builder!

Preferred

- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations

- Experience taking a leading role in building complex software or computing infrastructure that has been successfully delivered to customers

- Experience using managed ML/AI solutions

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Cupertino - 173,900.00 - 235,200.00 USD annually

USA, TX, Austin - 151,200.00 - 204,600.00 USD annually

USA, WA, Seattle - 151,200.00 - 204,600.00 USD annually