Senior Data Scientist (NLP and GenAI Specialist)

Morgan Stanley
Dallas, Texas, United States of America2026-02-02Full time

About the job

We're seeking someone to join our team as a Senior Data Scientist (NLP Specialist) in Non-Financial Risk to develop and deploy advanced AI models using NLP, Machine Learning, and quantitative techniques to improve surveillance and compliance monitoring. In the Legal & Compliance division, we assist the Firm in achieving its business objectives by facilitating and overseeing the Firm's management of legal, regulatory, and franchise risk. This is a Vice President level position within the NFR Data & Analytics team, which is responsible for designing and optimizing surveillance models and tools using advanced analytical techniques to help identify suspicious and/or illegal behaviors like money laundering, market manipulation, insider trading, unfair sales or trading practices and other financial crimes.

Responsibilities

Design high-performance systems leveraging cutting-edge techniques including Large Language Models (LLMs), Retrieval Augmented Generation (RAG), Agentic AI architecture, and knowledge graph analytics.

Create GenAI-based solutions to automate manual tasks and drive cost efficiency.

Conduct research to identify novel methods for enhancing analytical solutions.

Lead and develop junior data scientists to achieve key business objectives

Collaborate with Compliance, Legal, Financial Crimes, and IT stakeholders to champion the adoption of new AI/ML/NLP approaches, techniques and capabilities

Champion innovative approaches to improve detection of suspicious activity.

Qualifications

Minimum

Master's or PhD degree in Computer Science, Machine Learning, Intelligent Systems, Statistics, Mathematics, Engineering or other highly quantitative fields

10+ years of hands-on industry experience in building AI/ML/NLP solutions and applied statistical analysis to solve complex business problems

Knowledge of software design and system principles and excellent skills in either Python (preferred) or Java

Experience with AI/ML/NLP software packages such as LangChain, LangGraph, Semantic Kernel, CrewAI, OpenAI SDK, PyTorch, HuggingFace, etc.

Experience in adhering to Software Development Life Cycle (SDLC) principles include GIT related operations

Strong problem solving and time management skills

Excellent written and oral communication skills

Familiarity with financial markets, especially in Compliance, Non-Financial Risk, and Fraud analytics

Preferred

Experience with benchmark creation and evaluation including LLM-as-a-Judge based techniques

Knowledge of Vector Stores, Linux, SPARQL, and Graph Databases