SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety (Under review at NeurIPS 2025)
Online Pre-Training for Offline-to-Online Reinforcement Learning (ICML 2025)
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection (arXiv preprint, under review, 2025)
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking (EMNLP Findings 2024)
Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments (ACL Findings 2024)
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration (ICML 2024)
Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning (CVPR Workshop 2024)
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations (NeurIPS 2023)