Scholar

Itay Itzhak

Google Scholar ID: knwqPdoAAAAJ

Technion , Hebrew University of Jerusalem

Natural Language ProcessingMachine LearningArtificial Intelligence

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailitay1itzhak@gmail.com TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

8 items

From Feelings to Metrics: Understanding and Formalizing How Users Vibe-Test LLMs

2026

Cited

Growing Pains: Extensible and Efficient LLM Benchmarking Via Fixed Parameter Calibration

2026

Cited

HACK: Hallucinations Along Certainty and Knowledge Axes

2025

Cited

BlackboxNLP-2025 MIB Shared Task: Improving Circuit Faithfulness via Better Edge Selection

2025

Cited

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

2025

Cited

Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs

2025

Cited

DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation

2025

Cited

Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs

2025

Cited

Resume (English only)

Academic Achievements

Published several papers, including 'ManagerBench: Evaluating The Safety-Pragmatism Trade-Off In Autonomous LLMs', 'Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs', and more.

Research Experience

Interned at Meta AI, studying attention dynamics in translation models. Co-organized the GEM workshop at ACL 2025. Active contributor to the EvalEval coalition, which aims to standardize and compare evaluation outputs across frameworks.

Education

PhD candidate at Technion, co-advised by Yonatan Belinkov and Gabriel Stanovsky at the Hebrew University in Jerusalem. Completed M.Sc. at Tel Aviv University under Omer Levy, investigating how token-level spelling information is encoded in embedding matrices.

Background

Interested in evaluating and interpreting large language models (LLMs), with a particular focus on their reasoning and decision-making processes, including failures that reveal human-like cognitive biases. Combines behavioral and representational analyses to better understand model tendencies and the impact of fine-tuning and pretraining.

Miscellany