About the job
Help craft the next generation of spatial awareness and applied perception technologies at Apple. Our team focuses on innovating, building, and productizing new ways for our devices to understand and interact with the physical world and the user. We leverage cutting-edge computer vision, foundation models, and machine learning to enable highly intuitive, seamless forms of human-computer interaction.
Responsibilities
We are looking for a highly motivated Machine Learning and Computer Vision Engineer to develop state-of-the-art perception algorithms. In this role, you will tackle complex challenges in egocentric vision, action recognition, and sensor-based perception. You will be responsible for designing highly efficient ML training pipelines, fine-tuning vision transformers, and utilizing active learning techniques to maximize model performance while minimizing data annotation costs. If you are passionate about pushing the boundaries of spatial computing and efficient ML, we want you on our team.
Qualifications
Minimum
Ph.D. (or M.S. with equivalent applied research experience) in Computer Engineering, Computer Science, Electrical Engineering, or a related field.
Deep expertise in Machine Learning and Computer Vision, with a strong focus on Vision Transformers (ViTs), CNNs, and Foundation Models.
Proven track record of developing and optimizing efficient ML training pipelines, specifically utilizing Active Learning, domain adaptation, or semi-supervised learning to reduce data annotation bottlenecks.
Hands-on experience with egocentric video understanding, action recognition, or scene classification.
Strong programming skills in Python and C/C++, with deep proficiency in modern deep learning frameworks (PyTorch, TensorFlow).
Experience working with novel or multi-modal sensor data (e.g., polarization, depth, IR) and synthetic/semi-synthetic data generation.
Ability to optimize data-loading and training infrastructure to maximize GPU efficiency.
Familiarity with 3D computer vision and spatial/robotic perception.
Preferred
Takes initiative to address new problems and responsibility for the results
Excellent collaboration skills and a proven ability to publish in top-tier ML/CV conferences or journals
Interested in creating new technologies and user experiences.
Passion for actively learning new skills, techniques, and programming languages/libraries/frameworks.
Thrives in a collaborative environment and communicates effectively.
Creative and inquisitive, seeking to solve everyday problems in innovative ways.
Laser-focused on the smallest details that are meaningful to our customers.
Desire to be exposed to all aspects of development: prototyping, planning, designing, productizing, launching, and scaling.