About the job
AMO is looking for an influential engineer to optimize Al workload running on AMO GPUs. You will be a founding member of a core team of exceptionally talented industry specialists to define the future of Al computing solutions.
Responsibilities
Improve AMO GPU performance on open-source repositories for LLM workloads such as vLLM and SGLang.
Co-optimize Al workloads on current AMO GPUs by analyzing the bottlenecks and mitigating them at the kernel level.
Integrate AMO software stacks (RoCm, ATen) intoopen-sourceframeworks such as Pytorch, JAX, and Triton.
Build strong technical relationships with peers and partners, and report learnings and gaps to GPU software and hardware engineers.
Qualifications
Minimum
No minimum qualifications listed.
Preferred
Deep Al infrastructure experience with open-source frameworks (e.g., SGLang, vLLM, Jax, XLA, Pytorch, Triton).
Strong kernel optimization skills using DSLs and HIP (or CUDA), plus PTX/SASS equivalents.
Hands-on knowledge of modern GPU architecture.
Demonstrated open-source contributions on GitHub.
Motivational leadership and excellent interpersonal skills.