- Patents: Media Capture Device With Power Saving And Encryption (JP 7710816, 10 Jul 2025)
Research Experience
- Granite speech: IBM open-source speech-aware large language models
- High performance deep neural network acoustic modeling
- Distributed acoustic modeling for automatic speech recognition
- Fast and accurate speech recognition for customer care
- E2E Learning for Speech Recognition and Synthesis
- Unsupervised Learning Techniques for Large Unlabeled Data for Speech
- IARPA Babel project for spoken term detection
- DARPA Transtac project for multi-lingual speech-to-speech translation
- Adjunct Professor, Department of Electrical Engineering, Columbia University
Education
I received my B.S. degree from Shanghai Jiao Tong University, Shanghai, China, M.S. degree from Tsinghua University, Beijing, China, and Ph.D. degree from the University of California, Los Angeles, all in electrical engineering.
Background
I am a principal research scientist in the speech department of the IBM T. J. Watson Research Center. My research interests include automatic speech recognition, multi-lingual speech-to-speech translation, digital speech processing, statistical signal processing, machine learning, and pattern recognition. Most recently, I have been working on deep learning in speech applications.