Scholar

Ximing Lu

Google Scholar ID: ssYPSmkAAAAJ

University of Washington

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

5,136

H-index

i10-index

Publications

Co-authors

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

28 items

Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages

2026

Cited

DeltaPrompts: Escaping the Zero-Delta Trap in Multimodal Distillation

2026

Cited

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

2026

Cited

iGRPO: Self-Feedback-Driven LLM Reasoning

2026

Cited

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

2026

Cited

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

arXiv.org · 2026

Cited

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

2025

Cited

Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

2025

Cited

Resume (English only)

Academic Achievements

Publications: Information-Theoretic Distillation for Reference-less Summarization (arXiv:2403.13780), A Roadmap to Pluralistic Alignment (ICML 2024), Impossible Distillation: From Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing (NAACL 2024), JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models (NAACL 2024), Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement (ICLR 2024, Oral), THE GENERATIVE AI PARADOX: 'What It Can Create, It May Not Understand' (ICLR 2024), The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning (ICLR 2024), Tailoring Self-Rationalizers with Multi-Reward Distillation (ICLR 2024), Improving Language Models with Advantage-Based Offline Policy Gradients (ICLR 2024), Value Kaleidoscope: Engaging AI with Pluralistic Human Values, Rights, and Duties (AAAI 2024), Faith and Fate: Limits of Transformers on Compositionality (NeurIPS 202).

Research Experience

Research projects include: Faith and Fate (exploring the fundamental limits of Transformer language models in compositional tasks), Generative AI Paradox (proposing and testing the Generative AI Paradox), NeuroLogic Decoding, NeuroLogic A*esque Decoding, Quark, and Inference-Time Policy Adapters.

Education

Ph.D. candidate at the University of Washington, advised by Professor Yejin Choi; B.S. degree in Computer Science from the University of Washington.

Background

Research interests: understanding the boundaries of machine intelligence and bridging the capability gap between models and humans. Focused on studying the capabilities and limits of language models, as well as developing learning and inference algorithms to unlock capabilities in smaller models.

Miscellany

Personal links: Google Scholar, Twitter, Github, CV, Research Statement

Co-authors

0 total

Co-authors: 0 (list not available)