About the job
Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering AI-powered solutions that transform how advertisers make strategic decisions. We deliver billions of ad impressions and process massive volumes of advertiser data every single day. You'll work with us to pioneer breakthrough approaches in how AI agents access and reason over real-time advertiser data at scale. We are using generative AI and agentic systems to help advertising agents provide instant, strategic advice to millions of advertisers. You will need to invent new techniques for agent orchestration, context optimization, and code generation to ensure we're delivering accurate, trustworthy insights with minimal latency and token consumption. You'll create feedback loops to ensure our solutions are constantly evaluating themselves and improving. The Ads Real-Time Data Service team is seeking an exceptional Applied Scientist to research and develop novel approaches for agent-data interaction. The Ads Real-Time Data Service team is solving one of the most critical challenges in advertising AI: instant access to advertiser context. We're building the infrastructure that provides immediate, pre-computed access to advertiser data via Model Context Protocol (MCP) servers—an emerging standard for AI agent-data interaction. We're building summarized data for context using a mix of state of the art techniques like CodeAct and RAG-based embeddings, achieving a fundamental transformation in how AI agents interact with data. This role balances applied research (60%) with productionization (40%), giving you the opportunity to both advance the state of the art and see your innovations deployed at Amazon scale.
Responsibilities
Research and develop novel algorithms for agent-data interaction patterns that minimize latency, token consumption, and error rates
Investigate multi-agent orchestration strategies for complex advertiser queries requiring data from multiple sources
Develop techniques for automatic query optimization and caching strategies based on agent behavior patterns
Invent new methods for compressing advertiser context representations while preserving semantic meaning and analytical utility
Research optimal metadata generation techniques that help large language models understand and reason over structured advertiser data
Design evaluations to measure the impact of different data representations on agent response quality and token efficiency
Develop adaptive context selection algorithms that dynamically choose relevant data based on query intent
Pioneer new RAG-based embedding approaches optimized for real-time advertiser data delivery with sub-second latency
Research and implement semantic search and retrieval techniques for advertiser datasets using vector embeddings
Design advertiser context frameworks that enable automatic schema mapping from advertiser concepts to data representations
Develop evaluation frameworks to measure performance across dimensions of latency, accuracy, and developer experience
Design and execute rigorous experiments comparing traditional API orchestration versus CodeAct patterns and RAG-based approaches across metrics like success rate, latency, token consumption, and response quality
Analyze large-scale advertiser interaction data to identify patterns, bottlenecks, and optimization opportunities
Collaborate with engineering teams to productionize research innovations and deploy them to 30+ advertising agents and skills
Establish evaluation metrics and benchmarks for agent-data interaction performance
Partner with agent builder teams to understand their data requirements and constraints
Work with platform engineers to implement and optimize MCP servers, data pipelines, and sandbox execution environments
Collaborate with product managers to translate research insights into product features and roadmap priorities
Stay current on latest advancements in agentic AI research, specifically in large language models, multi-agent systems, chain of thought reasoning, and autonomous agents
Author technical papers for top-tier conferences on agent orchestration, context optimization, RAG-based embeddings, and real-time data integration
File patents for novel techniques in agent-data interaction, token optimization, and CodeAct patterns
Present research findings at internal tech talks and external conferences
Mentor engineers and junior scientists on machine learning techniques, experimental design, and research methodologies
Qualifications
Minimum
- PhD in Computer Science, Machine Learning, Artificial Intelligence, Statistics, or a related quantitative field, or equivalent experience
- 5+ years of experience applying machine learning, deep learning, or natural language processing to solve business problems
- Experience designing, building, and deploying production ML systems at scale
- Experience with large language models (LLMs), generative AI, or agentic AI systems
- Experience with RAG (Retrieval-Augmented Generation) techniques and vector embeddings
- Experience with experimentation, A/B testing, and causal inference
- Strong programming skills in Python and experience with ML frameworks (e.g., PyTorch, TensorFlow)
- Proven track record of publishing research in top-tier conferences (e.g., NeurIPS, ICML, ACL, KDD)
Preferred
- Experience with Model Context Protocol (MCP) servers or similar agent-data interaction standards
- Experience with CodeAct patterns or other agentic execution frameworks
- Experience building real-time data services or low-latency data infrastructure
- Experience with semantic search, vector databases, or similarity search libraries
- Experience with advertiser data, digital advertising, or e-commerce domains
- Experience mentoring junior scientists or engineers
- Experience filing patents or contributing to open-source AI/ML projects