Scholar

Guojun Xiong

Google Scholar ID: FIBwLnoAAAAJ

Harvard University, Department of Computer Science

Reinforcement learningRestless banditsNetworkingFinancial Agent

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

590

H-index

i10-index

Publications

Co-authors

list available

Contact

GitHubOpen ↗

Publications

21 items

Social Networks of LLM Agents

2026

Cited

Autoregressive Diffusion World Models for Off-Policy Evaluation of LLM Agents

2026

Cited

Concordia: Self-Improving Synthetic Tables for Federated LLMs

2026

Cited

Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading

2026

Cited

Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning

2026

Cited

FinCriticalED: A Visual Benchmark for Financial Fact-Level OCR Evaluation

2025

Cited

MeCaMIL: Causality-Aware Multiple Instance Learning for Fair and Interpretable Whole Slide Image Diagnosis

2025

Cited

When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents

2025

Cited

Resume (English only)

Academic Achievements

2025: Granted a USPTO patent on 'Next-generation Smart Agriculture Networks'.
2025: Paper accepted as Spotlight at NeurIPS 2025; additional papers accepted at ACL 2025, ACM SIGMETRICS 2025 Workshop, ICLR 2025 Workshop, IEEE ICC 2025, AAMAS 2025, AAAI 2025 (including Innovative Applications track), etc.
2024: Two papers accepted at NeurIPS 2024 (acceptance rate: 25.8%); one at AAAI 2025 (23.4%); one at AAMAS 2025 (24.5%); publications in IEEE Transactions on Wireless Communications, IEEE/ACM Transactions on Networking, AIoT 2024, RLC 2024 Workshop, etc.
2024: Selected as Top Reviewer (top 8%) at NeurIPS 2024.
2024: Awarded travel grant by Meta for the 1st RL Conference; received Excellence in Research Award from the Data Science Program at Stony Brook University.
Serving as PC member for multiple top-tier conferences: KDD 2025, IJCAI 2025, ICML 2025, AISTATS 2025, ICLR 2025, COLING 2025, AAAI 2024, etc.
2025: Delivered invited plenary talk at NSF AI Institute for Societal Decision Making; invited talk at SIAM Conference on Financial Mathematics & Engineering (FM25).

Background

Postdoctoral fellow in the Department of Computer Science at Harvard University, affiliated with Teamcore, hosted by Prof. Milind Tambe.
Motivated by complex, resource-constrained sequential decision-making problems under uncertainty.
Research objective is to advance reinforcement learning (RL) through innovative structured RL frameworks that leverage inherent problem structures to improve sample efficiency and accelerate learning.
Focuses on addressing key challenges in both model-based and model-free RL, especially in environments with multiple coupled Markov Decision Processes (MDPs).
Aims to develop RL algorithms with provably sub-linear regret guarantees.
Seeks real-world impact and scalability of structured RL in applications such as edge/cloud computing, cloud caching, wireless video streaming, and healthcare.
Primary research interests include: online sequential decision-making under uncertainty, stochastic optimization and control, finite-time convergence and regret analysis for RL, decentralized optimization, multi-agent RL for networked decision-intelligent systems, and broader societal impacts in public health and social good.

Co-authors

7 total