[2026] Senior Machine Learning Engineer, Multimodal AI, Computer Vision and Graphics

About the job

At Roblox, computer vision and graphics power the way our global community discovers, creates, and interacts on our virtual platform. This role involves building cutting-edge models to interpret and analyze every form of content—experiences, text, images, videos, and 3D avatars. Your work will directly influence core systems that drive the next generation of creation, search, recommendations, and trust & safety initiatives across our massive ecosystem.

Responsibilities

Design and implement foundation models for visual and 3D-based creation, search, and recommendations, ensuring a high level of fidelity, relevance, and ranking.

Break down complex product requirements into iterative deliverable stages, moving applied research into high-scale production systems.

Implement innovative visual and multi-modal models that power core Roblox functions (e.g., world creation, avatar systems, search, and recommendations).

Build high precision facial age estimation across demographics from ground up including various fraud detection techniques for a robust and safe user identity system.

Qualifications

Minimum

Possessing or pursuing a PhD in computer science, engineering, or a related field, with a thesis aligned to Roblox’s research areas.

Expertise in one or more areas: computer vision, multimodal learning, 3D Graphics, or large-scale representation learning.

Experience developing and training deep learning models using modern frameworks (PyTorch, TensorFlow, JAX).

A strong research track record, evidenced by multiple publications and presentations in top-tier, peer-reviewed venues.

Proficiency in one or more programming languages (e.g., Python, C++, Go, Java) and experience building and optimizing large-scale systems.

Preferred