About the job
The Principle Event Data Analyst (AI & Advanced Analytics) is responsible for leading AI driven investigation and analysis of workplace incidents and data events using a systematic, analytics first approach. This role focuses on identifying impacted data at scale through automation, machine learning, and advanced analytics, while collaborating with cross functional teams including security, legal, privacy, and engineering. This role plays a key part in building and evolving an AI enabled investigative data platform, leveraging modern data lake architectures, distributed processing, and production AI/ML pipelines to accelerate insight generation and decision making.
Responsibilities
Lead AI assisted analysis of data events and incident investigations using structured and unstructured data
Design and implement automated pipelines to ingest, normalize, deduplicate, enrich, and classify large scale datasets
Apply machine learning, entity resolution, and data fingerprinting techniques to identify impacted individuals and data elements
Build and maintain production analytics solutions using Python, PySpark, and Databricks
Develop and deploy AI driven workflows (including Generative AI use cases) to improve investigative speed, accuracy, and scalability
Secure, preserve, and validate data sources to maintain analytical integrity and evidentiary standards
Partner with security, legal, and privacy teams to translate complex analytical findings into clear, actionable insights
Create dashboards, visualizations, and analytical summaries for executive and technical stakeholders
Operate effectively in time sensitive environments while maintaining analytical rigor, objectivity, and confidentiality
Contribute to the evolution of Optum's AI and analytics capabilities within the cybersecurity and data protection space
Qualifications
Minimum
Bachelor's degree or 6+ years of equivalent professional experience
5+ years of experience in advanced data analytics, data science, or AI focused analytical roles
5+ years of experience developing, testing, and deploying production grade Python code
3+ years of experience with DevOps and CI/CD tooling (GitHub Actions, Docker, Kubernetes, Terraform)
3+ years of experience with distributed data processing using PySpark and Databricks
Solid expertise in SQL, data modeling, and data integration
Experience working with data lake architectures and medallion schemas
Proven ability to translate analytical outputs into business and risk based insights
Preferred
Advanced degree in Data Science, Computer Science, or a related field
Hands on experience with machine learning, predictive analytics, and classification models
Experience delivering AI driven analytics within a cybersecurity or incident response environment
Experience deploying Generative AI solutions into production analytics workflows
Experience with entity resolution, MDM technologies, or identity matching
Demonstrated familiarity with cloud platforms such as AWS, Azure, or Google Cloud
Experience building AI models for file classification, data fingerprinting, or anomaly detection
Demonstrated familiarity with eDiscovery platforms and large scale data investigations
Proven solid understanding of modern analytics platforms and scalable data architectures