About the job
We are building a planet-scale multi-modal database and infrastructure for executing agents (including orchestration and memory) from the ground up. You will be part of the team that is reimagining the databases in the era of Large Language Models (LLMs) by deeply integrating Artificial Intelligence (AI) into all parts of the stack. You will lead and collaborate with a team of passionate engineers, driving ideas to impactful results in a fast-paced environment. You will be responsible for operating the service for some of the largest enterprise customers. You will be working on operations, live site, deployment, monitoring, compliance, alerting and maintaining the Service Level Agreement (SLA) for our service.
Responsibilities
This is an individual contributor role requiring hands-on coding in C++, C#/Java.
Independently execute in the face of ambiguity.
Leads identification of dependencies and the development of design documents for a product, application, service, or platform.
Writes efficient systems code and able to debug distributed systems.
Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
Qualifications
Minimum
Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C++, C#, or Java.
OR Equivalent experience.
Preferred
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C++, C#, Java
OR equivalent experience.
Experience in shipping products and scalable, reliable services.
Currently programming/coding in your current or most recent role.
Hands on experience with asynchronous programming and concurrency (threads, tasks, futures, async/await).
Experience with Azure Kubernetes Service (AKS), Amazon Elastic Kubernetes Service (EKS), and/or Google Kubernetes Engine (GKE)
Experience in building database engines, query engines, indexing solutions (columnar, full-text, vector), at scale.
Experience with programming CUDA, AI systems at scale.
Experience with live site operations, Site Reliability Engineering (SRE) or production support roles.
Experience with Helm.