Research Intern - LLM Performance Optimization

Microsoft
Microsoft worksite location2025-12-16onsite

About the job

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Qualifications

Minimum

Currently enrolled in a PhD program in Computer Science or a related STEM field. At least 1 year of experience with Large Language Model architecture or inference performance optimization.

Preferred

Demonstrated ability to assess and fix kernel performance bottlenecks for GPUs or other high performance parallel computer architectures. Familiarity with optimizing compiler architecture and intermediate representations (such as LLVMIR or MLIR). Ability to think unconventionally to derive creative and innovative solutions.