Currently a Software Engineer at Databricks and an Adjunct Assistant Professor at UMass Amherst.
Research focuses on big data management, data-processing systems, and machine learning systems.
Takes a systems-driven approach by co-designing key components of modern data-intensive pipelines, including workflow engines, UDF debugging frameworks, pipelining optimizers, and ML acceleration systems for streaming data.
Integrates techniques from data management, distributed systems, program analysis, and machine learning to optimize performance, usability, and scalability.
Has contributed extensively to the Texera project (Apache Incubating), a collaborative and interactive workflow system for data science and AI/ML.