Problem
Research questions and friction points this paper is trying to address.
Balancing latency and accuracy in streaming and offline ASR
Improving accuracy in overlapping speech with CSS and E2E systems
Enhancing multi-talker transcription readability with segment-based SOT
Innovation
Methods, ideas, or system contributions that make the work stand out.
Continuous Speech Separation front-end for overlapping speech
Dual models for streaming and offline ASR
Segment-based SOT for better offline transcription