🤖 AI Summary
Learners with Roman-alphabet backgrounds face significant challenges in memorizing Chinese characters. Method: This paper proposes an interpretable keyword-based mnemonic generation framework. Departing from black-box large language model (LLM) approaches, it formalizes mnemonic construction as a latent-variable rule-based process grounded in character components and semantic associations, and introduces an EM-style iterative algorithm to automatically discover high-frequency, generalizable mnemonic structures and compositional rules from online learners’ generated content. The method explicitly decouples component decomposition, semantic mapping, and associative generation—enhancing transparency and pedagogical credibility, especially in cold-start scenarios. Contribution/Results: Experiments demonstrate that the generated mnemonics outperform baselines in both accuracy and interpretability, while uncovering key cognitive mechanisms underlying effective mnemonic design.
📝 Abstract
Learning Japanese vocabulary is a challenge for learners from Roman alphabet backgrounds due to script differences. Japanese combines syllabaries like hiragana with kanji, which are logographic characters of Chinese origin. Kanji are also complicated due to their complexity and volume. Keyword mnemonics are a common strategy to aid memorization, often using the compositional structure of kanji to form vivid associations. Despite recent efforts to use large language models (LLMs) to assist learners, existing methods for LLM-based keyword mnemonic generation function as a black box, offering limited interpretability. We propose a generative framework that explicitly models the mnemonic construction process as driven by a set of common rules, and learn them using a novel Expectation-Maximization-type algorithm. Trained on learner-authored mnemonics from an online platform, our method learns latent structures and compositional rules, enabling interpretable and systematic mnemonics generation. Experiments show that our method performs well in the cold-start setting for new learners while providing insight into the mechanisms behind effective mnemonic creation.