π€ AI Summary
This study addresses the critical challenge of personalizing large language models to accurately emulate individual judgesβ judicial reasoning styles under low-resource conditions. The authors propose a novel synthetic-organic supervision pipeline that automatically transforms raw case law into high-quality instruction-tuning data. By integrating causal language modeling, parameter-efficient fine-tuning, and structured processing of legal texts, the approach enables judge-level personalization in a Hebrew-language setting. Experimental results demonstrate that the method significantly outperforms existing techniques across three evaluation tasks, generating judicial reasoning that is semantically, stylistically, and lexically indistinguishable from that of human judges.
π Abstract
Despite significant advances in large language models, personalizing them for individual decision-makers remains an open problem. Here, we introduce a synthetic-organic supervision pipeline that transforms raw judicial decisions into instruction-tuning data, enabling parameter-efficient fine-tuning of personalized models for individual judges in low-resource settings. We compare our approach to state-of-the-art personalization techniques across three different tasks and settings. The results show that Causal Language Modeling followed by synthetically generated instruction-tuning significantly outperforms all other baselines, providing significant improvements across lexical, stylistic, and semantic similarity. Notably, our model-generated outputs are indistinguishable from the reasoning of human judges, highlighting the viability of efficient personalization, even in low-resource settings.