Senior Deep Learning and Computer Vision Engineer - Autonomous Vehicles

Nvidia
US, CA, Santa Clara / US, WA, Redmond / US, WA, Seattle2026-03-17onsite

About the job

We are looking for a Deep Learning and Computer Vision engineer for our Autonomous Vehicles team. The role involves applying state-of-the-art techniques to build ground truth for autonomous vehicles, a critical aspect of our next-generation products. You will have the opportunity to work with top researchers and engineers in the field of deep learning and computer vision to deliver impact to our customers around the world.

Responsibilities

Stay up to date with the latest research and innovations in deep learning, implement and experiment with new ideas.

Foster collaborative partnerships with our researchers and engineers, transforming innovative research into robust, industrial-strength machine learning models.

Taking approaches from initial evaluation and experimentation all the way to shipping.

Defining and collecting training datasets.

Building training pipelines and real-time inference run-times (PyTorch, TensorFlow, TensorRT, Python, C++)

Qualifications

Minimum

PhD with 1+ year, or MS (or equivalent experience) with 5+ years, of relevant experience in Computer Science, Computer Engineering, or a related technical field.

Proven experience building robust software.

Passionate about Artificial Intelligence for robotics and autonomous navigation

Strive to learn new things and like solving hard problems

Math knowledge

Experience in Deep Learning / Machine Learning. You have a background in Computer vision and/or Planning/Control.

Programming and debugging skills in C++ and/or Python.

Good communication and analytical skills. Ability to work with multiple teams in a dynamic environment.

Preferred

Background in applying latest AI methods to solve Computer Vision and Autonomous Vehicles problems

Experience with Unsupervised or Self-supervised Learning

Involvement with architecture optimization, pruning, curriculum & multi-task training

Experience fusing data from different sensor modalities (e.g. Images and LIDAR data) to enable information conflation, label propagation, cross training.