Senior Research Scientist, Post-Training LLM and DLM

About the job

We are now looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion Language Model (DLM) post-training and system optimization. Are you excited to shape the future of large-scale generative AI? NVIDIA is at the forefront of foundation models and generative AI systems, enabling cutting-edge research and real-world deployment at unprecedented scale. Our team is dedicated to advancing post-training algorithms, building efficient large-scale systems, and developing evaluation frameworks to ensure reliability and scalability. Join us to work with world-class researchers and engineers on building the next generation of AI.

Responsibilities

Designing and implementing post-training algorithms LLMs and DLMs.

Driving efficiency and scalability improvements across training pipelines and serving systems

Collaborating with researchers to translate cutting-edge ideas into production-ready implementations.

Exploring new paradigms for evaluation.

Demonstrating strong engineering practices, and contributing to open-source communities.

Qualifications

Minimum

PhD in Computer Science, Electrical Engineering, or related field, or equivalent research experience in LLMs, systems, or related areas.

2+ years of experiences in machine learning, systems, distributed computing, or large-scale model training.

Proficiency in Python with hands-on experience in frameworks such as PyTorch.

Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming.

Proven ability to collaborate across research and engineering teams in multifaceted environments.

Preferred

Expertise in post-training LLMs with novel algorithmic/data pipelines

Experience developing andscaling large distributed systems for deep learning.

Contributions to open-source LLM systems or large-scale AI infrastructure.