About the job
You will join a dynamic team working at the cutting edge of the GenAI revolution by applying AI to AI. You will work on building agents, tools, and models to simplify and accelerate customer adoption of Neuron, the software stack supporting Amazon's Machine Learning silicon: Trainium. Partnering with external and internal customers, you will identify key obstacles and opportunities to accelerate their migration to AWS's ML silicon. You will be the technical lead for a team building AI agents and tools that simplify AWS Neuron adoption, and drive the team's vision and strategy in this space critical to AWS's Generative AI business.
Responsibilities
Research implementations that deliver the best possible experiences for customers.
Deliver on goals to improve the time and effort it takes to port and optimize Machine Learning workloads on Neuron.
Solve challenging technical problems, often ones not solved before, at every layer of the stack
Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.
Build high-quality, highly available, always-on products.
Potentially contribute intellectual property through patents
Assist in the career development of others, actively mentoring individuals and the community on advanced technical issues and helping managers guide the career growth of their team members.
Exert technical influence over the team, increasing their productivity and effectiveness by sharing your deep knowledge and experience.
Qualifications
Minimum
5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
Experience as a mentor, tech lead or leading an engineering team
5+ years of non-internship professional software development experience
5+ years of programming with at least one software programming language experience
Hands-on technical experience working in the Generative AI space
Excellent written and verbal communication skills with the ability to present complex technical information concisely to executives and non-technical leaders.
Experience in one or more of the following areas: ML compilers, production coding agents, GenAI model architecture, model training, neural network optimization, or alternatively applied math.
Preferred
Master's degree in computer science or equivalent
Master's degree or above in computer science or equivalent
2+ years in machine learning or other computational modeling environments with an emphasis on hosting, building or optimizing models for diverse hardware platforms
Proven track record in building AI agents that automate ML workload optimization, ML compiler tuning, distributed inference and training, or ML kernel authoring and optimization
Experience working with open-source software communities in the optimization space or related areas
Domain-level knowledge of AWS services
Knowledge of the state-of-the-art technology used in the Machine Learning space and its mathematical underpinning