π€ AI Summary
Addressing the challenge of identifying and applying implicit community norms in academic writing, this paper proposes a corpus-driven real-time writing assistance method. The approach employs statistical modeling, context-sensitive sentence retrieval, and keyword frequency analysis to automatically uncover domain-specific section-level structural distributions and syntactic patterns; it further operationalizes corpus-level statistical distributions into an interactive, visual writing interface. This work introduces the novel paradigm of βnorm operationalization,β enabling authors to instantaneously perceive, emulate, and adapt linguistic and structural conventions during composition. A user study (N=16) demonstrates that participants significantly improved manuscript organization and rhetorical strategies, and reported substantially increased confidence in both norm adherence and principled innovation.
π Abstract
Many communities, including the scientific community, develop implicit writing norms. Understanding them is crucial for effective communication with that community. Writers gradually develop an implicit understanding of norms by reading papers and receiving feedback on their writing. However, it is difficult to both externalize this knowledge and apply it to one's own writing. We propose two new writing support concepts that reify document and sentence-level patterns in a given text corpus: (1) an ordered distribution over section titles and (2) given the user's draft and cursor location, many retrieved contextually relevant sentences. Recurring words in the latter are algorithmically highlighted to help users see any emergent norms. Study results (N=16) show that participants revised the structure and content using these concepts, gaining confidence in aligning with or breaking norms after reviewing many examples. These results demonstrate the value of reifying distributions over other authors' writing choices during the writing process.