About the job
We are looking for a Senior Applied Research Scientist who is experienced with training large language models and/or large multimodal models. In this role, you will explore novel LLM/LMM architectures and large-scale training techniques to advance the state-of-the-arts. You will be part of a world-class research team working on pre-training, fine-tuning, RL, and aligning large language and multimodal models, in addition to keeping up-to-date to the latest progress and trends in LLM/LMM and foundation models.
Responsibilities
Train, finetune, and RL for LLMs/LMMs.
Improve on the state-of-the-art LLMs/LMMs..
Accelerate the training and inference speed of LLMs/LMMs.
Research novel ML techniques and model architectures.
Influence the direction of AMD AI platform.
Publish your work at top-tier venues.
Qualifications
Minimum
No minimum qualifications listed.
Preferred
Experience in developing and debugging in Python.
Experience in ML Framework such as PyTorch, JAX or TensorFlow
Experience with distributed training.
Expertise on LLM/LMM pretraining, finetuning, and/or RL.
Expertise on transformer architecture.
Strong publication record in top tier conferences and journals.