About the job
TikTok's Server platform team is responsible for architecting, designing, and building the best server and storage system to meet the requirements of high-performance, low cost and easy to operate. By joining this team, you will work with the best engineers and talents in this industry and have a broad opportunity to get in touch with the latest AI application system and newly emerged technology in computing, storage and silicon validation. You will gain remarkable hardware architecture, development and validation experience in the most advanced hardware infrastructure at a massive scale.
Responsibilities
Develop application benchmarks, tools and performance optimization method for GPU/AI system.
Identify the system bottleneck/opportunity with deep system-level data-driven study, explore innovative options through SW-HW co-design, and lead them towards implementation.
Develop GPU/AI system TCO model, based on application benchmark and performance optimization.
Work with industry consortiums and open standard committees to investigate the emerging standards or technologies, and contribute our research results to the industry.
Work with our technology partners and suppliers to setup POC or prototypes to evaluate and test the new technologies or architectural designs.
Qualifications
Minimum
Must be able to commit to a 12-week full-time work period during Fall 2026
Thesis in GPU/AI platform architecture and/or application performance optimization design or software hardware co-design.
Deep understanding of computer system architecture, especially on GPU/AI SoC or Platform Architecture, Interconnect Fabric, and Memory sub-system.
Experienced in GPU/AI system application performance optimization or software hardware co-design.
Strong knowledge and proficiency in software development in C/C++, scripting languages such as Python.
Understand the implementation of GPU/AI virtualization technology, deep learning architecture, and distributed system.
Preferred
No preferred qualifications listed.