Problem
Research questions and friction points this paper is trying to address.
Capturing temporal evolution of scientific text
Improving domain-specific NLP task performance
Analyzing scientific discourse development over time
Innovation
Methods, ideas, or system contributions that make the work stand out.
Uses whole words as tokens instead of subwords
Base model pretrained on 1.7 million arXiv papers
Progressively trained annually for temporal evolution