About the job
Help deliver one of the best foundational models in the world at Microsoft AI. At Microsoft AI, we are on a mission to train the world’s most capable AI frontier models, pushing the boundaries of scale, performance and product deployment. The Pre-Training team at Microsoft AI tackles some of the most challenging problems in deep learning at scale. As a team, we will deliver one of the best foundation models in the world, forming the foundation of many initiatives across Microsoft AI. We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field.
Responsibilities
Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations; Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack; Collaborate closely with teams on infrastructure, data, post-training, and multimodality; Embody our culture and values.
Qualifications
Minimum
Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Preferred
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience; Demonstrated experience in large-scale AI; Passionate about conversational AI and its deployment; Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers; Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI; Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team.