Scholar

Zhenhong Zhou

Google Scholar ID: 6TuPwzMAAAAJ

Nanyang Technological University

Large Language ModelAI SafetyLLM Safety

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

248

H-index

8

i10-index

7

Publications

19

Co-authors

0

Contact

Emailzhouzhenhong@bupt.edu.cn TwitterOpen ↗GitHubOpen ↗

Publications

34 items

Structure-Guided Visual Perturbation Neutralization for LVLMs

2026

Cited

0

A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

2026

Cited

0

Explaining and Breaking the Safety-Helpfulness Ceiling via Preference Dimensional Expansion

2026

Cited

0

SafeSeek: Universal Attribution of Safety Circuits in Language Models

2026

Cited

0

Resource Consumption Threats in Large Language Models

2026

Cited

0

MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents

2026

Cited

0

Omni-Safety under Cross-Modality Conflict: Vulnerabilities, Dynamics Mechanisms and Efficient Alignment

2026

Cited

0

RECUR: Resource Exhaustion Attack via Recursive-Entropy Guided Counterfactual Utilization and Reflection

2026

Cited

0

Resume (English only)

Academic Achievements

ICLR 2025 (Oral, Top 1.8%): 'On the Role of Attention Heads in Large Language Model Safety'
ICML 2025: Paper accepted
ACL 2025: Paper accepted
EMNLP 2024: Two papers accepted, one as co-first author
AAAI 2024: 'Quantifying and Analyzing Entity-Level Memorization in Large Language Models'
arXiv preprint: 'Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue'
Oct 2023: First-class Postgraduate Academic Scholarship, Beijing University of Posts and Telecommunications
Oct 2022: Second-class Postgraduate Academic Scholarship, Beijing University of Posts and Telecommunications

Co-authors

0 total

Co-authors: 0 (list not available)