Principal AI Data Analyst - Remote or Hybrid from MN or DC

Optum / UnitedHealth Group
Eden Prairie, MN / Remote from anywhere within the U.S. / Minneapolis, MN

About the job

The Principle Event Data Analyst (AI & Advanced Analytics) is responsible for leading AI driven investigation and analysis of workplace incidents and data events using a systematic, analytics first approach. This role focuses on identifying impacted data at scale through automation, machine learning, and advanced analytics, while collaborating with cross functional teams including security, legal, privacy, and engineering. This role plays a key part in building and evolving an AI enabled investigative data platform, leveraging modern data lake architectures, distributed processing, and production AI/ML pipelines to accelerate insight generation and decision making.

Responsibilities

Lead AI assisted analysis of data events and incident investigations using structured and unstructured data

Design and implement automated pipelines to ingest, normalize, deduplicate, enrich, and classify large scale datasets

Apply machine learning, entity resolution, and data fingerprinting techniques to identify impacted individuals and data elements

Build and maintain production analytics solutions using Python, PySpark, and Databricks

Develop and deploy AI driven workflows (including Generative AI use cases) to improve investigative speed, accuracy, and scalability

Secure, preserve, and validate data sources to maintain analytical integrity and evidentiary standards

Partner with security, legal, and privacy teams to translate complex analytical findings into clear, actionable insights

Create dashboards, visualizations, and analytical summaries for executive and technical stakeholders

Operate effectively in time sensitive environments while maintaining analytical rigor, objectivity, and confidentiality

Contribute to the evolution of Optum's AI and analytics capabilities within the cybersecurity and data protection space

Qualifications

Minimum

Bachelor's degree or 6+ years of equivalent professional experience

5+ years of experience in advanced data analytics, data science, or AI focused analytical roles

5+ years of experience developing, testing, and deploying production grade Python code

3+ years of experience with DevOps and CI/CD tooling (GitHub Actions, Docker, Kubernetes, Terraform)

3+ years of experience with distributed data processing using PySpark and Databricks

Solid expertise in SQL, data modeling, and data integration

Experience working with data lake architectures and medallion schemas

Proven ability to translate analytical outputs into business and risk based insights

Preferred

Advanced degree in Data Science, Computer Science, or a related field

Hands on experience with machine learning, predictive analytics, and classification models

Experience delivering AI driven analytics within a cybersecurity or incident response environment

Experience deploying Generative AI solutions into production analytics workflows

Experience with entity resolution, MDM technologies, or identity matching

Demonstrated familiarity with cloud platforms such as AWS, Azure, or Google Cloud

Experience building AI models for file classification, data fingerprinting, or anomaly detection

Demonstrated familiarity with eDiscovery platforms and large scale data investigations

Proven solid understanding of modern analytics platforms and scalable data architectures