About the job
Do you want to shape the future of Generative AI at AWS? Join the team building the foundation of the world’s most advanced cloud for AI training and inference — where multi-billion-parameter models come to life at scale. Here, you’ll design, deliver, and operate next-generation infrastructure that powers breakthrough innovation in AI/ML and HPC workloads. If you’re passionate about pushing the limits of performance, efficiency, and scalability in the cloud, this is your opportunity to build the systems that define what’s next for AWS — and for the entire AI industry.
Responsibilities
interfacing with our internal and external customers to understand project requirements and facilitate system development on top of your server design; solving operational challenges to our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs; working directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale
Qualifications
Minimum
- Bachelor's degree or above in electrical engineering, computer engineering, or equivalent
- Experience with server, storage, networking, or large-scale distributed systems
- Experience in developing functional specifications, design verification plans and functional test procedures
- Experience troubleshooting issues and root cause analysis
- Experience leading ODMs and other suppliers in the product development and manufacturing processes
Preferred
- Master's degree in Electrical Engineering, Computer Engineering, or a related technical field
- Experience working with interdisciplinary teams to execute product design from concept to production
- Expertise in product development disciplines such as, thermal, mechanical, power, FW/SW, reliability, and sustaining
- Experience deploying and operating hardware and applications across large data centers.