- DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness. CVPR, 2024 (Submitted)
- DocParseNet: Advanced Semantic Segmentation and OCR Embeddings for Efficient Scanned Document Annotation. ICML, 2024
- CrashFormer: A Multimodal Architecture to Predict the Risk of Crash. Proceedings of the 1st ACM SIGSPATIAL International Workshop on Advances in Urban-AI, 2023
- Novel Physics-Based Machine-Learning Models for Indoor Air Quality Approximations. The 9th SIGKDD International Workshop on Mining and Learning from Time Series, 2023
- Predicting Airborne Pollutant Concentrations and Events in a Commercial Building Using Low-Cost Pollutant Sensors and Machine Learning: A Case Study. Building and Environment, 2022
Research Experience
- Position: Senior Machine Learning Scientist
- Company: Flairsoft
- Time: Since May 2022
- Projects: Developed advanced solutions including risk analysis models, speech-to-text systems, and document annotation tools
Education
- Degree: Ph.D. Candidate in Computer Science
- School: The Ohio State University
- Advisor: Prof. Rajiv Ramnath
- Time: Since August 2020
- Major: Computer Science
Background
- Research Interests: Machine Learning, Large Language Models (LLMs), Vision-Language Models (VLMs), Multimodal Systems, and Time Series Analysis
- Professional Field: Computer Science
- Introduction: Pursuing a Ph.D. in Computer Science at The Ohio State University, focusing on bridging vision and language understanding and document processing.