2023
- Hao Huang, Lin Wang, Jichen Yang, Ying Hu, and Liang He.
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP Journal on Audio, Speech, and Music Processing |
arxiv
- Haoxiang Su, Hongyan Xie, Hao Huang*, Shuangyong Song, Ruiyu Fang, Xiaomeng Huang, Sijie Feng.
Scalable-DSC: A Structural Template Prompt Approach to Scalable Dialogue State Correction.
EMNLP 2023 |
arxiv
- Kai Wang, Jingjing Liu, Yizhou Peng, Hao Huang*.
Neural RAPT: Deep Learning-based Pitch Tracking with Prior Algorithmic Knowledge Instillation..
International Journal of Speech Technology |
arxiv
- Rui Li, Zhiwei Xie, Haihua Xu, Yizhou Peng, Hexin Liu, Hao Huang*, Eng Siong Chng.
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memor.
Interspeech 2023 |
arxiv
- Yachad Guo, Zhibin Qiu, Hao Huang*, Chng Eng Siong.
Improved Keyword Recognition Based on Aho-Corasick Automaton .
International Joint Conference on Neural Networks (IJCNN) |
arxiv
- Jichen Yang, Yi Zhou*, Hao Huang*.
Mel-S3R: Combining Mel-Spectrogram and Self-Supervised Speech Representation with VQ-VAE for Any-to-Any Voice Conversion .
Speech Communication |
arxiv
- Yuhang Yang, Haihua Xu, Hao Huang*, Eng Siong Chng, Sheng Li.
Speech-Text based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition .
ICASSP 2023 |
arxiv
- Zhibin Qiu, Mengfan Fu, Yinfeng Yu, LiLi Yin, Fuchun Sun, Hao Huang*.
SRTNet: Time Domain Speech Enhancement via Stochastic Refinement .
ICASSP 2023 |
arxiv
- Saierdaer Yusuyin, Hao Huang*, Junhua Liu, Cong Liu.
Investigation into Phone-Based Subword Units for Multilingual End-To-End Speech Recognition.
ICASSP 2023 |
arxiv
- Lili Yin, Di Wu, Zhibin Qiu Hao Huang*.
Mitigating Domain Dependency for Improved Speech Enhancement via SNR Loss Boosting .
ICASSP 2023 |
arxiv
- Kai Wang, Yuhang Yang, Hao Huang*, Ying Hu, Sheng Li.
SpeakerAugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation .
ICASSP 2023 |
arxiv
2022
- Hongyan Xie, Haoxiang Su, Shuangyong Song, Hao Huang*, Bo Zou, Kun Deng, Jianghua Lin, Zhihui Zhang and Xiaodong He.
Correctable-DST: Mitigating Historical Context Mismatch between Training and Inference for Improved Dialogue State Tracking. .
EMNLP 2022 (Long Paper, Oral Presentation) |
arxiv
- Guodong Ma, Pengfei Hu, Nurmemet Yolwas, Shen Huang, Hao Huang*.
PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition.
Interspeech 2022 |
arxiv
- Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang*, Eng Siong Chng.
Minimum Word Error Training for Non-Autoregressive Transformer-Based Code-Switching ASR.
ICASSP 2022 |
arxiv
- Kai Wang, Yizhou Peng, Hao Huang*, Ying Hu, Sheng Li.
Mining Hard Samples Locally And Globally for Improved Speech Separation.
ICASSP 2022 |
arxiv
2021
- Guodong Ma, Pengfei Hu, Jian Kang, Nurmemet Yolwas, Shen Huang*, Hao Huang*.
Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition.
Interspeech 2021 |
pdf
- Jicheng Zhang, Yizhou Peng, Pham Van Tung, Haihua Xu, Hao Huang *, Eng Siong Chng.
E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition.
Interspeech 2021 |
arxiv
- Kai Wang, Hao Huang*, Ying Hu, Zhihua Huang, Sheng Li.
Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain..
Interspeech 2021 |
pdf
- Xiao Kang, Hao Huang*, Ying Hu, Zhihua Huang.
Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion.
Digital Signal Processing 2021 |
SCI |
pdf
- Hao Huang*, Kai Wang, Ying Hu, Sheng Li.
Encoder-Decoder based Pitch Tracking and Joint Model Training for Mandarin Tone Classification.
ICASSP 2021 |
pdf
- Weiqi Gao, Hao Huang*.
A gating context-aware text classification model with BERT and graph convolutional Networks..
Journal of Intelligent and Fuzzy Systems |
SCI
- Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang*, Eng Siong Chng.
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems..
ISCSLP2021 |
arxiv
2020
- Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang*, Eng Siong Chng.
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-switching Speech Recognition..
Interspeech2020 |
arxiv
- Zhong Ying, Ying Hu*, Hao Huang, and Wushour Silamu.
A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition.
Interspeech2020 |
pdf
- 董兴磊,胡英*,黄浩,吾守尔.斯拉木.
基于稀疏卷积非负矩阵部分联合分解的单声道语音分离.
自动化学报2020 |
pdf
more
- Hao Huang*, Haihua Xu, Ying Hu, Gang Zhou.
A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection. .
JASA2017 |
pdf
- Haihua Xu, Hang Su, Chongjia Ni, Xiong Xiao, Hao Huang, Eng Siong Chng and Haizhou Li.
Semi-supervised and Cross-lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models under Low-resource Conditions. .
Interspeech2016 |
pdf
- Hao Huang*,Haihua Xu,Xianhui Wang,Wushour Silamu.
Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection.
IEEE/ACM 2015 |
pdf
- 黄浩*,徐海华,王羡慧,吾守尔.斯拉木
自动发音错误检测中基于最大化F1值准则的区分性特征补偿训练算法.
电子学报 2015 |
pdf
- Hao Huang, Wang J, Abudureyimu H.
Maximum F1-Score Discriminative Training Criterion for Automatic Mispronunciation Detection.
Interspeech2012 |
pdf
- 黄浩*、李兵虎、吾守尔·斯拉木.
区分性模型组合中基于决策树的声学上下文建模方法..
自动划学报 2012 |
pdf
- Hao Huang*, Binghu Li.
Lattice Based Discriminative Model Combination Using Automatically Induced Phonetic Contexts..
Interspeech2011 |
pdf
- Hao Huang*, Binghu Li.
Automatic context induction for tone model integration in Mandarin speech recognition.
中国邮电高校学报(英文版) |
pdf
- Xiong Y, Zhu J, Huang Hao, Haihua Xu.
Minimum tag error for discriminative training of conditional random fields. .
Information Sciences 2009 |
pdf
- Huang Hao, Zhu J.
Discriminative incorporation of explicitly trained tone models into lattice based rescoring for Mandarin speech recognition.
ICASSP 2008 |
IEEE 2008 |
pdf
- Huang H, Zhu J
Minimum phoneme error based filter bank analysis for speech recognition. .
IEEE 2006 |
pdf
- Xiong Y, Zhu J, Huang Hao, Haihua Xu.
Minimum phoneme error based filter bank analysis for speech recognition .
IEEE 2006 |
pdf