Scholar

Kangjie Chen

Google Scholar ID: vEPnP6oAAAAJ

Nanyang Technological University

Trustworthy AIRed-teamingBackdoor AttacksLLM-based Agents

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

821

H-index

i10-index

Publications

Co-authors

Contact

Emailkangjie001@ntu.edu.sg GitHubOpen ↗

Publications

15 items

X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving

2026

Cited

DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems

2026

Cited

SafeRedir: Prompt Embedding Redirection for Robust Unlearning in Image Generation Models

2026

Cited

TEAR: Temporal-aware Automated Red-teaming for Text-to-Video Models

2025

Cited

CREST-Search: Comprehensive Red-teaming for Evaluating Safety Threats in Large Language Models Powered by Web Search

2025

Cited

Unmasking Backdoors: An Explainable Defense via Gradient-Attention Anomaly Scoring for Pre-trained Language Models

2025

Cited

Quantifying and Alleviating Co-Adaptation in Sparse-View 3D Gaussian Splatting

2025

Cited

Towards Effective Prompt Stealing Attack against Text-to-Image Diffusion Models

2025

Cited

Resume (English only)

Academic Achievements

- Transstratal Adversarial Attack: Compromising Multi-Layered Defenses in Text-to-Image Models, NeurIPS 2025, Spotlight
- Analogy-based Multi-Turn Jailbreak against Large Language Models, NeurIPS 2025
- Impact-driven Context Filtering For Cross-file Code Completion, COLM 2025
- Automated Red Teaming for Text-to-Image Models through Feedback-Guided Prompt Iteration with Vision-Language Models, ICCV 2025
- USD: NSFW Content Detection for Text-to-Image Models via Scene Graph, USENIX Security 2025
- TRUST-VLM: Thorough Red-teaming for Uncovering Safety Threats in Vision-Language Models, ICML 2025
- ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users, NeurIPS 2024
- EvilEdit: Backdooring Text-to-Image Diffusion Models in One Second, ACM MM 2024
- Boosting Black-box Attack to Deep Neural Networks with Conditional Diffusion Models, TIFS 2024
- Protecting Confidential Virtual Machines from Hardware Performance Counter Side Channels, DSN 2024
- BadEdit: Backdooring Large Language Models by Model Editing, ICLR 2024
- GuardHFL: Privacy Guardian for Heterogeneous Federated Learning, ICML 2023
- Multi-target Backdoor Attacks for Code Pre-trained Models, ACL 2023
- Clean-image Backdoor: Attacking Multi-label Models with Poisoned Labels

Research Experience

Currently a Research Fellow at Digital Trust Centre, Nanyang Technological University, Singapore, working with Prof. Tianwei Zhang and Prof. Kwok-Yan Lam.

Education

- Ph.D., Nanyang Technological University, Advisor: Prof. Tianwei Zhang
- M.Eng., Tianjin University, Advisor: Prof. Jianye Hao
- B.Eng., University of Electronic Science and Technology of China

Background

Research Interests: Red-teaming and Evaluation of Foundation Models, Safety and Security of LLM-Based Autonomous Agents, Backdoor Attacks and Defenses in Deep Learning, Trustworthy AI.

Co-authors

0 total

Co-authors: 0 (list not available)