- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources4
- Resource Type
-
0003000001000000
- More
- Availability
-
40
- Author / Contributor
- Filter by Author / Creator
-
-
Bartels, Phillip L. (1)
-
Caradonna, Timothy M. (1)
-
Carlin, Aaron F. (1)
-
Chanda, Sumit K. (1)
-
Cheng-I Lai (1)
-
Clausen, Thomas Mandel (1)
-
Corbett, Kevin D. (1)
-
Cox, David (1)
-
Esko, Jeffrey D. (1)
-
Feldman, Jared (1)
-
Fu, Yonggan (1)
-
Garretson, Aaron F. (1)
-
Glass, Charles A. (1)
-
Godula, Kamil (1)
-
Golden, Gregory J. (1)
-
Gordts, Philip L.S.M. (1)
-
Hauser, Blake M. (1)
-
Hui Shi, Yang Zhang (1)
-
Jose, Joyce (1)
-
Kaizhi Qian (1)
-
- Filter by Editor
-
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Self-supervised learning (SSL) for rich speech representations has achieved empirical success in low-resource Automatic Speech Recognition (ASR) and other speech processing tasks, which can mitigate the necessity of a large amount of transcribed speech and thus has driven a growing demand for on-device ASR and other speech processing. However, advanced speech SSL models have become increasingly large, which contradicts the limited on-device resources. This gap could be more severe in multilingual/multitask scenarios requiring simultaneously recognizing multiple languages or executing multiple speech processing tasks. Additionally, strongly overparameterized speech SSL models tend to suffer from overfitting when being finetuned on low-resource speech corpus. This work aims to enhance the practical usage of speech SSL models towards a win-win in both enhanced efficiency and alleviated overfitting via our proposed S-Router framework, which for the first time discovers that simply discarding no more than 10% of model weights via only finetuning model connections of speech SSL models can achieve better accuracy over standard weight finetuning on downstream speech processing tasks. More importantly, S-Router can serve as an all-in-one technique to enable (1) a new finetuning scheme, (2) an efficient multilingual/multitask solution, (3) a state-of-the-art pruning technique, and (4) a new tool to quantitatively analyze the learned speech representation.more » « less
-
Fu, Yonggan; Yu, Qixuan; Zhang, Yang Zhang; Wu, Shang: Quyang; Cox, David; Lin, Yingyan (, 35th Conference on Neural Information Processing Systems (NeurIPS 2021))
-
Hui Shi, Yang Zhang (, ICLR 2020)
-
Clausen, Thomas Mandel; Sandoval, Daniel R.; Spliid, Charlotte B.; Pihl, Jessica; Perrett, Hailee R.; Painter, Chelsea D.; Narayanan, Anoop; Majowicz, Sydney A.; Kwong, Elizabeth M.; McVicar, Rachael N.; et al (, Cell)null (Ed.)
An official website of the United States government

Full Text Available