About the job
The Software Development Engineer will design, build, and maintain cloud-based provisioning and recovery systems for AWS Trainium-based AI UltraServers. This role requires expertise in AWS services, system architecture, and cross-functional collaboration with Capacity Management, Hardware Engineering, and Datacenter Operations to manage AI/ML infrastructure.
Responsibilities
- The Software Development Engineer is responsible for building and maintaining scalable micro services.
- They are adept at system design that solves the business problem efficiently.
- Work in environments where the technology strategy is defined but the solution design is not
- Build cloud-based solutions using AWS native services for scaling infrastructure frameworks
- Create observable systems with appropriate metrics and alarming
- Collaborate with customers and stakeholders to convert business needs into technical designs
- Participate in code reviews and technical assessments
Qualifications
Minimum
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 1+ years of software development engineer or related occupational experience
- 1+ years of designing and developing large-scale, multi-tiered, multi-threaded, embedded or distributed software applications, tools, systems, and services using: C#, C++, Java, or Perl experience
- 1+ years of Object Oriented Design experience
- Bachelor's degree or foreign equivalent in Computer Science, Engineering, Mathematics, or a related field
- Experience programming with at least one software programming language
Preferred
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent