Scholar
Guijin Son
Google Scholar ID: Zf_eLDsAAAAJ
Undergraduate, Yonsei University
Natural Language Processing
Large Language Models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
275
H-index
9
i10-index
9
Publications
20
Co-authors
5
list available
Contact
CV
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
25 items
LaRA: Layer-wise Representation Analysis for Detecting Data Contamination in RL Post-Training
2026
Cited
0
ResearchMath-14K: Scaling Research-Level Mathematics via Agents
2026
Cited
0
Self-Improving CAD Generation Agents with Finite Element Analysis as Feedback
2026
Cited
0
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs
2026
Cited
0
Pushing the Boundaries of Multiple Choice Evaluation to One Hundred Options
2026
Cited
0
KMMMU: Evaluation of Massive Multi-discipline Multimodal Understanding in Korean Language and Context
2026
Cited
0
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
2026
Cited
0
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models
arXiv.org · 2026
Cited
1
Load more
Resume (English only)
Academic Achievements
2025: Papers 'KMMLU-Pro' and 'Multi-LMentry' accepted to EMNLP 2025
2025: 'Linguistic Generalizability' (Oral) and 'FinKRX' (Industry Track) accepted to ACL 2025
2025: 'Robustness of Reward Models' accepted to ICML 2025
2025: 'BiGGen Bench' and 'KMMLU' accepted to NAACL 2025
2025: 'BiGGen Bench' awarded Best Paper at NAACL 2025
2024: 'Multitask Inference' accepted to ACL 2024
2024: 'HAE-RAE Bench' accepted to LREC-COLING 2024
Multiple preprints under review on multilingual reasoning, scientific verification benchmark (SPOT), and multilingual meta-evaluation (MM-Eval)
Background
Co-Founder at OneLine AI
Lead of HAE-RAE, an open-source research group focused on Korean NLP
Research interests: AI for Science, evaluation and reasoning with language models, multimodal reasoning, and agentic systems
Current goal: building stronger reasoning models and developing metrics to demonstrate real progress
Past projects include Korean knowledge and professional benchmarks, reward model evaluation, and financial applications of LLMs
Teaches at Fast Campus and SSAFY; contributes to curriculum development with Codeit and Code States; mentors at Upstage
Co-authors
5 total
Seungone Kim
Carnegie Mellon University
Stella Biderman
EleutherAI
Niklas Muennighoff
Stanford University
Jiwoo Hong
KAIST AI
James Thorne
KAIST
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up