Scholar

Jerry Yao-Chieh Hu

Google Scholar ID: XOKdSAsAAAAJ

Northwestern University

Machine Learning(* denotes equal contribution)

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

675

H-index

i10-index

Publications

Co-authors

list available

Contact

LinkedInOpen ↗

Publications

18 items

Transformer Approximations from ReLUs

2026

Cited

Discrete Flow Matching Policy Optimization

2026

Cited

On Structured State-Space Duality

2025

Cited

A Theoretical Analysis of Discrete Flow Matching Generative Models

2025

Cited

Are Hallucinations Bad Estimations?

2025

Cited

POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization

2025

Cited

Genome-Factory: An Integrated Library for Tuning, Deploying, and Interpreting Genomic Models

2025

Cited

In-Context Algorithm Emulation in Fixed-Weight Transformers

2025

Cited

Resume (English only)

Academic Achievements

Research includes: Connecting attention and modern Hopfield networks, developing plug-in associative memory modules for retrieval and editing, studying in-context learning via internal algorithm execution, and characterizing the universality of minimalist Transformers and prompt-based algorithm emulation.

Research Experience

Studies modern foundation models through four pillars: Learning, Storing, Computing, and Universality. Pursues research through neuroscience, statistics, information, and computation.

Education

PhD candidate in Computer Science at Northwestern University, advised by Han Liu; B.S. in Physics from National Taiwan University, advised by Pisin Chen.

Background

Research interests: Theoretical foundations and principled methodologies for large foundation models (e.g., Large Language Models and Generative AI). Long-term goal is to leverage machine learning to tackle important scientific and societal challenges.

Miscellany

Engages in interdisciplinary research collaborations, including Particle Physics at Fermilab, Drug Design at Abbvie, Finance at Gamma Paradigm Capital, and NdLinear & NdLinear-LoRA at Ensemble AI. Offers 2 hours weekly for individual support office hours, welcoming students from underrepresented groups to schedule a chat.

Co-authors

2 total

Han Liu

Orrington Lunt Professor of Computer Science, Statistics and Data Science, Northwestern University

Zhao Song

University of California, Berkeley