About the job
We are seeking an HBM/DDRx validation expert with a heavy focus on validation of AWS next generation ML Chips, Cards and server integration. As a member of our memory team, you will have the opportunity to participate in the execution of HBM across all Trainium platforms, with the goal of improving the characteristics of HBM for our world leading Trainium AI servers. Our HBM engineers need to independently work with vendors, understand the settings, write/modify tests, debug and collect data in the fleet.
Responsibilities
Collaborate with architects, design teams, and software engineers on our next generation ML chips
Support on-going debug and operations of previous ML chips within manufacturing and the data center
Dive deep into IP integration, packaging, silicon bring up, characterization, and validation of our HBM subsystems
Independently develop the scripts you need to execute and collaborate with software engineers as your needs scale
Qualifications
Minimum
BS in Electrical Engineering, Computer Engineering, Systems Engineering, Computer Science or related field.
5+ years of experience in Silicon development with
3+ years in SOC/IO/Subsystems
Good understanding of DDR/HBM at the PHY and controller level
Good knowledge of DDR/HBM training, timing parameters and/or controller features
Support the physical design team with IP integration, silicon design, 2.5D packaging, clocking and timing constraints
Ability to create scripts (lua, bash, python, etc.) to accomplish functional day to day tasks.
Drive cross-functional triage effort on functional and performance issues
Perform system-level debug and root-cause analysis through bring-up, characterization, validation and production phase
Experience Working with 3rd party IP and memory vendors
Preferred
MS in Electrical Engineering, Computer Engineering, Systems Engineering, Computer Science or related field.
Strong Firmware development skills within embedded environments
Good leadership skills and ability to multi-task and thrive in a dynamic environment
Knowledge of HBM, DDRx and related protocols
Good communication skills and interpersonal skills