Scholar

Junhong Shen

Google Scholar ID: M561o6QAAAAJ

Ph.D. student in Machine Learning, Carnegie Mellon University

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

512

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailjunhongs@andrew.cmu.edu CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

8 items

ARFBench: Benchmarking Time Series Question Answering Ability for Software Incident Response

2026

Cited

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

2026

Cited

RECODE: Reasoning Through Code Generation for Visual Question Answering

2025

Cited

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction

2025

Cited

CodePDE: An Inference Framework for LLM-driven PDE Solver Generation

2025

Cited

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

2025

Cited

CAT: Content-Adaptive Image Tokenization

2025

Cited

Specialized Foundation Models Struggle to Beat Supervised Baselines

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Paper 'Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction' accepted at NeurIPS 2025; paper 'CAT: Content-Adaptive Image Tokenization' accepted at NeurIPS 2025; paper 'Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity' presented orally at ICLR 2025 Scalable Optimization for Efficient and Adaptive Foundation Models Workshop; Tag-LLM work accepted at ICML 2024.

Research Experience

Interned at DeepMind working on a code generation agent for visual reasoning; interned at FAIR, contributing to Content-Adaptive Tokenizer (CAT) and Multi-Modal Mixture-of-Mamba projects; developed ScribeAgent web agents based on open-source LLMs.

Education

Ph.D. student in the Machine Learning Department at Carnegie Mellon University, advised by Ameet Talwalkar; B.S. in Mathematics of Computation from UCLA, where he worked with Lin Yang on sample-efficient reinforcement learning, and also worked on multi-agent RL and Theory of Mind, advised by Song-Chun Zhu and Ying Nian Wu.

Background

Research interests include enhancing LLMs' interaction with real-world applications, particularly building multi-modal models and agent systems that operate in real-world environments such as browsers, command lines, and IDEs. Also interested in enhancing LLMs' abilities to model diverse data types and applying them to long-tail, low-resource domains like science and business.

Miscellany

Seeking research scientist positions starting October 2025; supported by J.P. Morgan AI PhD Fellowship; organized the CMU Agent Workshop; participated in organizing the 2022 AutoML Decathlon.

Co-authors

14 total