- 2025/02 One paper got accepted to IEEE IoT-J 2025.
- 2025/01 Won the Best Paper Award at HICSS 2025.
- 2024/12 One paper got accepted to EuroSys 2025.
- [EuroSys'25] Towards VM Rescheduling Optimization Through Deep Reinforcement Learning
- [IoT-J'25] A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC Control
- [HICSS'25] Deepot: Parking Lot Identification Using Low-Resolution Satellite Imagery, Best Paper Award
- [TOSN'24] Optimizing Irrigation Efficiency Using Deep Reinforcement Learning in the Field
Research Experience
- Advanced AI Research Scientist at Accenture, focusing on multi-agent systems, LLM orchestration, and scalable AI infrastructure.
- Postdoctoral Researcher at Lawrence Berkeley National Laboratory (LBNL), developed play-verl — a VERL-based reinforcement learning benchmark evaluating PPO and GRPO algorithms on Qwen models with distributed training on multi-GPU systems; also worked on LLM fine-tuning for freight infrastructure and large-scale EV simulations.
Education
Ph.D. in Computer Science and Engineering, University of California, Merced (UCM).
Background
Research Interests: Large Language Models (LLMs) and Multi-Agent Systems, Reinforcement Learning and Reinforcement Learning from Human Feedback (RLHF), Distributed Training and Scalable AI Infrastructure, Machine Learning for Systems and Resource Optimization.