Senior Software Engineer - Data Governance & Machine Learning

Bloomberg
New York

About the job

Senior Software Engineer - Data Governance & Machine Learning. A core focus of this role is designing and scaling systems that power data discovery, sensitive data detection, metadata enrichment, and governance automation across the enterprise. This role is ideal for a hands-on engineer who applies ML and AI to real-world data governance problems and cares about trust, security, and explainability.

Responsibilities

Proactively drive the vision for Data Classification/Inventory/Catalog across multiple business areas, and define and execute on a plan to achieve that vision

Leverage data governance tools, configure and make them self-service for users

Design, build, and maintain ML-powered services and pipelines for automated data classification, tagging, and inventory

Develop integrations with metadata catalogs, data quality tools, and internal platforms to enrich and expose governance metadata

Apply ML and AI techniques to detect sensitive data (e.g., PII, PHI), map lineage, and support entity resolution

Collaborate with data stewards, governance leads, and platform teams to translate business requirements into scalable, automated solutions

Influence the architecture and evolution of AI-enabled governance tooling

Provide technical leadership and mentorship to junior engineers

Collaborate and build cross functional relationships with Product Owners, Data Engineers and Software Engineers to understand business and deliver on those needs.

Embrace agile framework with iterative product development and continuous improvement mindset.

Stay up to date with market trends, bring new ideas on-board and evaluate tooling for future needs.

Qualifications

Minimum

5+ years of experience building and leveraging data governance tools and platforms.

5+ years of experience using Python, JavaScript/TypeScript, Java, or Scala

Familiarity with AI driven data catalogs or metadata platforms (e.g., Collibra, Alation, Atlan, Purview)

Hands-on experience applying ML or NLP techniques in production (e.g., classification, clustering, NER)

Solid understanding of data governance concepts, including metadata, lineage, and data cataloging

Experience with cloud data platforms (e.g., AWS, GCP, Azure) and distributed systems

Preferred

Knowledge of compliance standards (e.g., GDPR, HIPAA, SOX)

Experience working in regulated industries

Exposure to knowledge graphs, semantic modeling