Research Intern - AI Network Observability

Microsoft
San Francisco Bay area / New York City metropolitan area2025-12-05onsite

About the job

As a Research Intern in the Strategic Planning and Architecture (SPARC) group, you will contribute to the research, design, and development of tools to provide insights into multi-path network transports for large-scale Artificial Intelligence (AI) datacenter environments. Your work will focus on building high-performance tracing and analysis systems capable of capturing packet-level behavior at extremely high speeds (up to 800Gbps). These tools will enhance observability for next-generation transport protocols supporting AI workloads. The role offers opportunities to prototype solutions on real hardware and collaborate with engineers to improve reliability and strengthen the explainability of AI intra-datacenter networking.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Engage early with their mentors to clearly formulate a plan of work for the 12 weeks of the Research Internship. Clearly and frequently document and communicate their progress, adjusting the plan as the project evolves. Show initiative and think unconventionally to derive creative and innovative solutions.

Qualifications

Minimum

Currently enrolled in a PhD program in Computer Science or a related STEM field. Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship. In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.

Preferred

Applicants should demonstrate depth of knowledge in datacenter networking and systems research. Experience in high performance programming network data paths (e.g., using C++). Experience in RDMA and/or DPDK. Experience in RoCE, knowledge of TCP, UDP, IP, ethernet.