Mu Yang's Website
Mu Yang's Website
Home
Publications
Experience
Projects
Misc
CV
Light
Dark
Automatic
paper
Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation
Current leading Text-To-Audio (TTA) generation models suffer from degraded performance on zero-shot and few-shot settings. It is often …
Mu Yang
,
Bowen Shi
,
Matthew Le
,
Wei-Ning Hsu
,
Andros Tjandra
PDF
Audio Samples
DiariST: Streaming Speech Translation with Speaker Diarization
End-to-end speech translation (ST) for conversation recordings involves several under-explored challenges such as speaker diarization …
Mu Yang
,
Naoyuki Kanda
,
Xiaofei Wang
,
Junkun Chen
,
Peidong Wang
,
Jian Xue
,
Jinyu Li
,
Takuya Yoshioka
PDF
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model
This study is focused on understanding and quantifying the change in phoneme and prosody information encoded in the Self-Supervised …
Mu Yang
,
Ram C. M. C. Shekar
,
Okim Kang
,
John H. L. Hansen
PDF
Learning ASR Pathways: A Sparse Multilingual ASR Model
Neural network pruning can be effectively applied to compress automatic speech recognition (ASR) models. However, in multilingual ASR, …
Mu Yang
,
Andros Tjandra
,
Chunxi Liu
,
David Zhang
,
Duc Le
,
Ozlem Kalinli
PDF
Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment
Current leading mispronunciation detection and diagnosis (MDD) systems achieve promising performance via end-to-end phoneme …
Mu Yang
,
Kevin Hirschi
,
Stephen D. Looney
,
Okim Kang
,
John H. L. Hansen
PDF
Audio Samples
Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis
This work presents a lifelong learning approach to train a multilingual Text-To-Speech (TTS) system, where each language was seen as an …
Mu Yang
,
Shaojin Ding
,
Tianlong Chen
,
Tong Wang
,
Zhangyang Wang
PDF
Code
Audio Samples
Joint Hypoglycemia Prediction and Glucose Forecasting via Deep Multi-task Learning
We present a multitask learning approach to the problem of hypoglycemia (HG) prediction in diabetes. The approach is based on a …
Mu Yang
,
Darpit Dave
,
Madhav Erraguntla
,
Gerard L. Cote
,
Ricardo Gutierrez-Osuna
PDF
EventPlus: A Temporal Event Understanding Pipeline
We present EventPlus, a temporal event understanding pipeline that integrates various state-of-the-art event understanding components …
Mingyu Derek Ma
,
Jiao Sun
,
Mu Yang
,
Kung-Hsiang Huang
,
Nuan Wen
,
Shikhar Singh
,
Rujun Han
,
Nanyun Peng
PDF
Code
Demo
A CNN-based Active Learning Framework to Identify Mycobacteria in Digitized Ziehl-Neelsen Stained Human Tissues
Tuberculosis is the most common mycobacterial disease that affects humans worldwide. Rapid and reliable diagnosis of mycobacteria is …
Mu Yang
,
Karolina Nurzynska
,
Ann E. Walts
,
Arkadiusz Gertych
PDF
Biomedical Event Extraction with Hierarchical Knowledge Graphs
Biomedical event extraction is critical in understanding biomolecular interactions described in scientific corpus. One of the main …
Kung-Hsiang Huang
,
Mu Yang
,
Nanyun Peng
PDF
Code
»
Cite
×