Senior Research Scientist, Efficient Deep Learning

Nvidia
US, CA, Santa Clara2026-01-09onsite

About the job

NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries but also has impact in the real world. We are particularly excited about methods for post-training model optimization (pruning, quantization, NAS), efficient architecture design, adaptive/dynamic inference, resource-efficient training and fine-tuning, and so forth. You will work within an amazing and collaborative research team that consistently publishes at the top venues in computer vision and machine learning. Our existing expertise includes computer vision, deep learning, generative models, and so forth. Your contributions have the chance to create real impact on our products.

Responsibilities

Research, design and implement novel methods for efficient deep learning.

Publish original research. Speak at conferences and events.

Collaborate on research with internal team members, internal teams as well as external researchers. Mentor interns.

Work with product groups to transfer technology.

Qualifications

Minimum

A Ph.D. in Computer Science/Engineering, Electrical Engineering, etc., or equivalent experience in industrial research labs.

3+ years or relevant post-graduate research experience

Excellent knowledge of theory and practice of computer vision methods, as well as deep learning.

A background in pruning, quantization, NAS, efficient backbones is required.

Excellent programming skills in Python and PyTorch; C++ and parallel programming (e.g., CUDA).

Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.

An outstanding research track record and strong communications skills.

Preferred

Experience with large language models and large vision-language models is a plus.