Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
IP-Prompter: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting. ACM SIGGRAPH (Conference Paper Track) 2025.
MotionCrafter: Plug-and-play Motion Guidance for Diffusion Models. IEEE Transactions on Visualization and Computer Graphics (2025).
CreativeSynth: Cross-Art-Attention for Artistic Image Synthesis with Multimodal Diffusion. IEEE Transactions on Visualization and Computer Graphics (2025).
Multi-turn Consistent Image Editing. IEEE/CVF International Conference on Computer Vision (ICCV) 2025.
Bridging Class Imbalance and Partial Labeling via Spectral-Balanced Energy Propagation for Skeleton-based Action Recognition. IEEE/CVF International Conference on Computer Vision (ICCV) 2025.
A Comprehensive Review of Few-Shot Action Recognition. International Journal of Computer Vision (2025).
Z-Magic: Zero-shot Multiple Attributes Guided Image Creator. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025: 18390-18400.
B4M: Breaking Low-Rank Adapter for Making Content-Style Customization. ACM Transactions on Graphics 44(2): 21:1--21:17 (2025).
DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization. IEEE Transactions on Neural Networks and Learning Systems 36(2): 3370 - 3383 (2025).
SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding. International Conference on Learning Representations (ICLR) 2025 (Spotlight).
Dance-to-Music Generation with Encoder-based Textual Inversion. ACM SIGGRAPH Asia (Conference Paper Track) 2024: 135:1-135:11.
Dance Montage through Style Transfer and Music Generation. ACM SIGGRAPH.
Background
Research Interests: Artificial Intelligence Generated Content (AIGC), Computational Visual Media, Computational Creativity. Position: Professor at the Institute of Automation, Chinese Academy of Sciences, and Researcher at the National Laboratory of Pattern Recognition.