Staff Backend Engineer, Global Online Data Store Platform

TikTok
San Jose, California

About the job

The TikTok Datastore Service team designs, builds, and governs the core online storage infrastructure that powers our global products. Our mission is to provide data services with ultimate performance, reliability, and intelligence for hundreds of millions of users and countless business scenarios worldwide. Here, you will tackle world-class challenges in globalization, multi-region active-active architecture, compliance, and cost efficiency, while leveraging AI to drive the evolution of next-generation storage architecture.

Responsibilities

- Architect, develop, and evolve TikTok's global online data store platform (including databases, caches, and data synchronization), focusing on global business scenarios to solve core challenges such as multi-region deployment, active-active architecture, data consistency, compliance, and cost optimization.

- Pioneer the innovation and implementation of AI in the storage domain, building core AI skills for online data storage. You will explore the deep integration of AI with storage systems (e.g., AI-assisted data insights, AI-assisted architecture governance, anomaly detection) and advance the application of AIOps in large-scale storage systems.

- Act as a subject-matter expert on storage, collaborating closely with business teams and architects. Provide professional technical consultation, and design and drive the implementation of global data storage solutions that meet business needs while ensuring high performance, availability, and consistency.

Qualifications

Minimum

- Bachelor's degree or higher in Computer Science or a related field, with 5+ years of experience in backend development or infrastructure engineering.

- Proficiency in at least one mainstream programming language (e.g., Go, C++, Java, Python) with strong coding skills, a commitment to clean code, and an AI-Native coding mindset.

- Deep expertise in at least one mainstream online storage technology (e.g., MySQL, Redis, HBase) with hands-on experience in large-scale online applications.

- Solid foundation in distributed systems theory, with a deep understanding of core principles like distributed consistency, high availability, and scalability. - Proven ability to design and deliver complex systems.

Preferred

- Experience with global infrastructure projects, including multi-region deployment, cross-region data synchronization, and data compliance governance.

- Practical experience with AI-Native governance, AIOps, or AI-assisted system optimization.

- Familiarity with emerging paradigms like Agentic Architecture, or experience in designing/implementing agent-based systems.

- A strong passion for technology and a curiosity-driven mindset, with excellent problem-solving, communication, and teamwork skills.

- Experience in designing and developing core modules of large-scale distributed systems (e.g., storage, messaging queues, distributed scheduling) is a significant plus.