Search for: All records

Creators/Authors contains: "Zhu, Yunzheng"

« Prev Next »

Total Resources

3

Resource Type
Conference Paper

2

Conference Proceeding

0

Dataset

0

Journal Article

1

Workshop Report

0

Availability
Full Text / Resource Available

3

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards Better Domain Adaptation for Self-Supervised Models: A Case Study of Child ASR

https://doi.org/10.1109/JSTSP.2022.3200910

Fan, Ruchao ; Zhu, Yunzheng ; Wang, Jinhan ; Alwan, Abeer ( October 2022 , IEEE Journal of Selected Topics in Signal Processing)

Full Text Available
Towards Better Meta-Initialization with Task Augmentation for Kindergarten-Aged Speech Recognition

https://doi.org/10.1109/ICASSP43922.2022.9747599

Zhu, Yunzheng ; Fan, Ruchao ; Alwan, Abeer ( May 2022 , Proceedings of the IEEE ICASSP)

Children’s automatic speech recognition (ASR) is always difficult due to, in part, the data scarcity problem, especially for kindergarten-aged kids. When data are scarce, the model might overfit to the training data, and hence good starting points for training are essential. Recently, meta-learning was proposed to learn model initialization (MI) for ASR tasks of different languages. This method leads to good performance when the model is adapted to an unseen language. How-ever, MI is vulnerable to overfitting on training tasks (learner overfitting). It is also unknown whether MI generalizes to other low-resource tasks. In this paper, we validate the effectiveness of MI in children’s ASR and attempt to alleviate the problem of learner overfitting. To achieve model-agnostic meta-learning (MAML), we regard children’s speech at each age as a different task. In terms of learner overfitting, we propose a task-level augmentation method by simulating new ages using frequency warping techniques. Detailed experiments are conducted to show the impact of task augmentation on each age for kindergarten-aged speech. As a result, our approach achieves a relative word error rate (WER) improvement of 51% over the baseline system with no augmentation or initialization.
more » « less
Full Text Available
Low Resource German ASR with Untranscribed Data Spoken by Non-Native Children; INTERSPEECH 2021 Shared Task SPAPL System

https://doi.org/10.21437/Interspeech.2021-1974

Wang, Jinhan ; Zhu, Yunzheng ; Fan, Ruchao ; Chu, Wei ; Alwan, Abeer ( August 2021 , Interspeech 2021)
null (Ed.)
Full Text Available