Scholar
Ido Hakimi
Google Scholar ID: N0EnDYsAAAAJ
Google Research
NLP
Optimization
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
229
H-index
7
i10-index
7
Publications
19
Co-authors
0
Contact
No contact links provided.
Publications
10 items
ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning
2026
Cited
0
RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models
2026
Cited
0
Reinforcement Learning via Self-Distillation
2026
Cited
6
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning
2025
Cited
0
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
2025
Cited
0
Maximizing Prefix-Confidence at Test-Time Efficiently Improves Mathematical Reasoning
2025
Cited
0
From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning
2025
Cited
0
Local Mixtures of Experts: Essentially Free Test-Time Training via Model Merging
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up