Senior Researcher - Foundations of Generative AI- Microsoft Research

About the job

Microsoft Research AI Frontiers lab is seeking applications for the position of Senior Researcher – Foundations of Generative AI to join their team in New York, NY. The mission of the AI Frontiers lab is to expand the pareto frontier of Artificial Intelligence (AI) capabilities, efficiency, and safety through innovations in foundation models and learning agent platforms. As a Senior Researcher – Foundations of Generative AI, you will play a crucial role in leading, developing, improving, and exploring new architectures, representations, and learning objectives that unlock new capabilities and/or scalability. Your work will have a significant impact on the development of cutting-edge technologies, advancing the state-of-the-art, and providing practical solutions to real-world problems.

Responsibilities

Apply research and engineering skills to develop, prototype, and evaluate cutting-edge research ideas. Work closely with other researchers and engineers to rapidly prototype and test new research ideas, driving a high-impact agenda and publishing results where appropriate. Collaborate hands-on with other researchers, engineers, and internal and external product groups to deliver high-impact solutions to real-world problems. Embody our culture and values.

Qualifications

Minimum

Doctorate (or currently pursuing) in Computer Science or relevant field OR equivalent experience.

Preferred

Doctorate in Computer Science or relevant field AND 2+ years related research experience OR equivalent experience. Research program demonstrated by public artifacts like models, tools, code in the AI space or publications at the following conferences: NeurIPS, ICML, ICLR, ACL, NAACL, CVPR, COLT, ECCV, ICCV, EMNLP. 2+ years of academic or industry experience in developing, applying, and/or implementing algorithms for machine learning/statistics, using common ML engineering programming languages and platforms such as Python, Python numerical libraries, PyTorch, TensorFlow and/or HuggingFace. Experience publishing academic papers as a lead author or essential contributor in a top AI conference or journal. Deep understanding of frontier model architectures, especially transformers and state space models. Hands-on experience building and working with Large Language Models (LLMs) or multimodal models (VLMs, VLAs), including pre-training, fine-tuning, and inference. 2+ years of industry or academic experience with building, debugging and optimizing large-scale ML training pipelines. Demonstrated software engineering excellence building and deploying prototypes, applications, or open-source (OSS) technologies. Providing a link to a GitHub profile and/or code samples on your CV/resume, is highly encouraged. Ability to work independently and ramp-up quickly on complex projects or unfamiliar code. Ability to collaborate, communicate effectively, and work as part of a multi-disciplinary team. Keen interest in real-world applications and impact.