Youngsoo Jang
Scholar

Youngsoo Jang

Google Scholar ID: 6EoBBggAAAAJ
UNIST
Reinforcement LearningLarge Language ModelDialogue System
Citations & Impact
All-time
Citations
479
 
H-index
8
 
i10-index
8
 
Publications
19
 
Co-authors
30
list available
Resume (English only)
Academic Achievements
  • SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety (Under review at NeurIPS 2025)
  • Online Pre-Training for Offline-to-Online Reinforcement Learning (ICML 2025)
  • Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection (arXiv preprint, under review, 2025)
  • Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking (EMNLP Findings 2024)
  • Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments (ACL Findings 2024)
  • Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration (ICML 2024)
  • Show, Think, and Tell: Thought-Augmented Fine-Tuning of Large Language Models for Video Captioning (CVPR Workshop 2024)
  • SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations (NeurIPS 2023)