Paper 'VoCo-LLaMA: Towards Vision Compression with Large Language Models' accepted by CVPR 2025
Paper 'ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models' accepted by CVPR 2025
Paper 'Language-Aware Vision Transformer for Referring Segmentation' accepted by IEEE TPAMI (IF=20.8) in 2024; conference version presented at CVPR 2022