About the job
Design and build robust knowledge frameworks, enabling AI systems to dynamically interact with structured and unstructured public health data, and facilitating real-time insights and decision-making support. Leverage knowledge of the design and implementation of base architectures, data modeling for AI agents, and Memory and State Management, and the construction of RAG Pipeline Design to facilitate multi-modal data integration partnering closely with data engineers on data pipeline coordination. Support performance optimization of retrieval processes for public health use cases and define rigorous data quality and curation standards. Collaborate with AI teams and maintain thorough documentation of data architecture. Ensure compliance with data privacy standards, anonymization protocols, and ethical guidelines critical to public health data management. This position is located in Atlanta, GA.
Responsibilities
Design and build robust knowledge frameworks, enabling AI systems to dynamically interact with structured and unstructured public health data, and facilitating real-time insights and decision-making support. Leverage knowledge of the design and implementation of base architectures, data modeling for AI agents, and Memory and State Management, and the construction of RAG Pipeline Design to facilitate multi-modal data integration partnering closely with data engineers on data pipeline coordination. Support performance optimization of retrieval processes for public health use cases and define rigorous data quality and curation standards. Collaborate with AI teams and maintain thorough documentation of data architecture. Ensure compliance with data privacy standards, anonymization protocols, and ethical guidelines critical to public health data management.
Qualifications
Minimum
2+ years of experience in AI/ML, Generative AI, or knowledge-based systems
1+ years of experience with cloud platforms
Experience in knowledge management and semantic search
Experience with vector databases, including Azure AI Search, PostgreSQL PgVector, Pinecone, Weaviate, or FAISS
Experience in data engineering and scripting, including with Python, SQL, and API-driven architectures
Knowledge of enterprise search technologies such as Elasticsearch and Solr, and dense vector search methodologies
Ability to obtain and maintain a Public Trust or Suitability/Fitness determination based on client requirements
Bachelor’s degree
Preferred
Experience developing AI Agents and multi-agent architectures
Experience with agent and orchestration frameworks, including LangChain, LlamaIndex, and Pydantic-AI
Experience with full-stack development across front-end and back-end systems
Experience designing APIs and microservices architectures, including FastAPI or Flask, and caching mechanisms, including Redis
Experience in data-centric AI/ML projects within healthcare, biomedical, or public health sectors
Experience working with modern data platforms, including Databricks, Apache Spark, and Snowflake
Experience in data governance, curation processes, and quality assurance practices within regulated health environments
Knowledge of cloud-based AI data platforms such as Azure Foundry, AWS Bedrock, or GCP Vertex AI
Possession of excellent communication skills, for facilitating collaboration between technical teams, public health experts, and stakeholders