Hylog: A Hybrid Approach to Logging Text Production in Non-alphabetic Scripts

πŸ“… 2026-01-25
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Traditional keyloggers struggle to capture the screen conversion processes of Input Method Editors (IMEs) for non-alphabetic scripts, creating a methodological gap in cognitive research on text generation. This study proposes a hybrid logging system that leverages modular open-source plugins to synchronously record keystroke events and rendered text within Microsoft Word and Google Chrome. For the first time, this approach enables fine-grained, dual-track synchronization of Latin character input, Chinese character candidate selection, and confirmation actions during IME-based typing. Evaluated in a Simplified Chinese translation task, the system successfully captures input details inaccessible to conventional tools, demonstrating its technical feasibility and offering a novel methodological and data foundation for investigating cross-linguistic cognitive mechanisms underlying text production.

Technology Category

Application Category

πŸ“ Abstract
Research keyloggers are essential for cognitive studies of text production, yet most fail to capture the on-screen transformations performed by Input Method Editors (IMEs) for non-alphabetic scripts. To address this methodological gap, we present Hylog, a novel hybrid logging system that combines analytical keylogging with ecological text logging for a more complete and finer-grained analysis. Our modular, open-source system uses plug-ins for standard applications (Microsoft Word, Google Chrome) to capture both keyboard output and rendered text, which a hybridizer module then synchronizes into a dual trace. To validate the system's technical feasibility and demonstrate its analytical capabilities, we conducted a proof-of-concept study where two volunteers translated a text into simplified Chinese. Hylog successfully captured keypresses and temporal intervals between Latin letters, Chinese characters, and IME confirmations -- some measurements invisible to traditional keyloggers. The resulting data enable the formulation of new, testable hypotheses about the cognitive restrictions and affordances at different linguistic layers in IME-mediated typing. Our plug-in architecture enables extension to other IME systems and fosters more inclusive multilingual text-production research.
Problem

Research questions and friction points this paper is trying to address.

keylogging
Input Method Editor
non-alphabetic scripts
text production
cognitive studies
Innovation

Methods, ideas, or system contributions that make the work stand out.

hybrid logging
input method editor (IME)
non-alphabetic scripts
keylogging
text production
πŸ”Ž Similar Papers
No similar papers found.
R
Roberto Crotti
Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
Giovanni Denaro
Giovanni Denaro
University of Milano-Bicocca
Software Testing - Software Analysis - Software Engineering
Z
Zhiqiang Du
Department of Interpreting and Translation, Alma Mater Studiorum University of Bologna, Bologna, Italy
R
Ricardo Munoz MartΓ­n
Department of Interpreting and Translation, Alma Mater Studiorum University of Bologna, Bologna, Italy