Research Intern - Model Optimization and HW Acceleration

Microsoft
San Francisco Bay area / New York City metropolitan area2025-11-21onsite

About the job

Research Interns at Microsoft work in a dynamic environment with world-class research labs, pursuing innovation in various scientific and technical disciplines. The Applied Sciences Group (ASG) is seeking a highly motivated and talented PhD student to join the team as a Research Intern, focusing on Large Language Models (LLMs), Vision-Language Models (VLMs), LLM/VLM based agents, and Generative Image & Video.

Responsibilities

Conduct research in one or more of the following areas: LLMs, VLMs, and Generative Image & Video.

Design and implement experiments to test new hypotheses and validate research findings.

Develop and optimise algorithms and models for various AI applications.

Collaborate with team members to integrate research outcomes into practical solutions.

Present research findings to the team and contribute to publications and patents.

Qualifications

Minimum

Enrolled in a full time bachelor's, master's, MBA, or PhD program in area in computer science, computer engineering, electrical engineering, or a related field during the academic term immediately before their internship. At least one additional quarter/semester of school remaining following the completion of the internship.

Preferred

First-author publication(s) at top-tier venues or demonstrably comparable research impact

Experience with emerging memory technologies such as CXL based memory

Clear communication, self-motivated, curiosity, and a bias for hands-on experimentation

Interpersonal skills, cross-group, and cross-culture collaboration.

Familiarity with computer systems' architecture, cache and memory hierarchy