Publications

Please check Google Scholar for the full list.

Journal

Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li, Accented Text-to-Speech Synthesis with Limited Data, IEEE/ACM Transactions on Audio, Speech and Language Processing 2024
Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li, Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis, IEEE Signal Processing Letter 2023
Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li, Optimization of Cross-Lingual Voice Conversion with Linguistics Losses to Reduce Foreign Accents, IEEE/ACM Transactions on Audio, Speech and Language Processing 2023
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li, TTS-Guided Training for Accent Conversion Without Parallel Data, IEEE Signal Processing Letter 2023
Mauro Barni, Yi Fang, Yuhong Liu, Laura Robinson, Kazutoshi Sasahara, Subramaniam Vincent, Xinchao Wang, Zhizheng Wu, Combating Misinformation/Disinformation in Online Social Media - A Multidisciplinary View, APSIPA Transactions on Signal and Information Processing, 2022
Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilci, Md Sahidullah, Aleksandr Sizov, Nicholas Evans, Massimiliano Todisco, Hector Delgado, "ASVspoof: the Automatic Speaker Verification Spoofing and Countermeasures Challenge", IEEE Journal of Selected Topic of Signal Processing, 2017
Xiaohai Tian, Siu-Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li, "An Exemplar-based Approach to Frequency Warping for Voice Conversion", IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017
Yanmin Qian, Nanxin Chen, Heinrich Dinkel, Zhizheng Wu, "Deep Feature Engineering for Noise Robust Spoofing Detection", IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017
Zhizheng Wu, Simon King, "Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training", IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016
Ibon Saratxaga, Jon Sanchez, Zhizheng Wu, Inma Hernaez, Eva Navas, "Synthetic Speech Detection Using Phase Information", Speech Communication, 2016.
Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi, "Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance", IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol 24, Issue 4, pp 768-783, 2016
Zhizheng Wu, Haizhou Li, "On the study of replay and voice conversion attacks to text-dependent speaker verification", Multimedia Tools and Applications, Springer, 2015.
Aleksandr Sizov, Elie Khoury, Tomi Kinnunen, Zhizheng Wu, Sebastien Marcel, "Joint Speaker Verification and Anti-Spoofing in the i-Vector Space", IEEE Transactions on Information Forensics and Security, Vol 10, Issue 4, pp. 821-832, 2015.
Zhizheng Wu, Nicholas Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li, "Spoofing and countermeasures for speaker verification: a survey", Speech Communication, Volume 66, Pages 130–153, 2015
Zhizheng Wu, Eng Siong Chng, Haizhou Li, "Exemplar-based voice conversion using joint nonnegative matrix factorization", Multimedia Tools and Applications, Vol 74, Issue 22, pp 9943-9958, Springer, 2015
Zhizheng Wu, Haizhou Li, "Voice conversion versus speaker verification: an overview", APSIPA Transactions on Signal and Information Processing, 3, e17 doi:10.1017/ATSIP.2014.17.
Zhizheng Wu, Tuomas Virtanen, Eng Siong Chng, Haizhou Li, "Exemplar-based sparse representation with residual compensation for voice conversion", IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol 22, Issue 10, pp. 1506-1521, 2014.
Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, "Mixture of Factor Analyzers using priors from non-parallel speech for voice conversion", IEEE Signal Processing Letter, Vol 19, Issue 12, pp. 914-917, 2012.
Yao Qian, Zhizheng Wu, Boyang Gao, Frank K Soong, "Improved Prosody Generation by Maximizing Joint Likelihood of State and Longer Units", IEEE Transactions on Audio, Speech and Language Processing, Vol 19, Issue 6, pp. 1702-1710, 2011.

Conference

Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao, AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models, Neurips 2023
Yi Zhou, Xiaohai Tian, Zhizheng Wu, Haizhou Li, "Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation", INTERSPEECH 2021
Zhizheng Wu, Zhihang Xie, Simon King, "The Blizzard Challenge 2019", Blizzard Challenge 2019
Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu, "Building a mixed-lingual neural TTS system with only monolingual data", INTERSPEECH 2019
Tim Capes, Paul Coles, Alistair Conkie, Ladan Golipour, Abie Hadjitarkhani, Qiong Hu, Nancy Huddleston, Melvyn Hunt, Jiangchuan Li, Matthias Neeracher, Kishore Prahallad, Tuomo Raitio, Ramya Rasipuram, Greg Townsend, Becci Williamson, David Winarsky, Zhizheng Wu, Hepeng Zhang, "Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System", INTERSPEECH 2017
Zhizheng Wu, Oliver Watts, Simon King, "Merlin: An Open Source Neural Network Speech Synthesis System", the 9th ISCA Speech Synthesis Workshop (2016).
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi, "Multidimensional scaling of systems in the Voice Conversion Challenge 2016", the 9th ISCA Speech Synthesis Workshop (2016).
Mei Li, Zhizheng Wu, Lei Xie, "On the impact of phoneme alignment in DNN-based speech synthesis", the 9th ISCA Speech Synthesis Workshop (2016).
Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King, "A template-based approach for speech synthesis intonation generation using LSTMs", Interspeech 2016.
Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, Simon King, "Waveform generation based on signal reshaping for statistical parametric speech synthesis", Interspeech 2016.
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi, "Analysis of the Voice Conversion Challenge 2016 Evaluation Results", Interspeech 2016.
Tomoki Toda, Ling-Hui Chen, Daisuke Saito, Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi, "The Voice Conversion Challenge 2016", Interspeech 2016.
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, "An investigation of spoofing speech detection under additive noise and reverberant conditions", Interspeech 2016.
Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku, "GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis", Interspeech 2016.
Zhizheng Wu, Simon King, "Investigating gated recurrent neural networks for speech synthesis", ICASSP 2016
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, "Spoofing detection from a feature representation perspective", ICASSP 2016
Thomas Merritt, Robert A.J. Clark, Zhizheng Wu, Junichi Yamagishi, Simon King, "Deep neural network-guided unit selection synthesis", ICASSP 2016
Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, Simon King, "From HMMs to DNNs: where do the improvements come from?", ICASSP 2016
Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King, "Robust TTS duration modelling using DNNs", ICASSP 2016
Zhizheng Wu, Simon King, "Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features", Interspeech 2015
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King, "A study of speaker adaptation for DNN-based speech synthesis", Interspeech 2015
Zhizheng Wu, Tomi Kinnunen, Nicholas Evans, Junichi Yamagishi, Cemal Hanilci, Md Sahidullah, Aleksandr Sizov, "ASVspoof 2015: the First Automatic Speaker Verification Spoofing and Countermeasures Challenge", Interspeech 2015.
Qiong Hu, Zhizheng Wu, Korin Richmond, Junichi Yamagishi, Yannis Stylianou, Ranniery Maia, "Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning", Interspeech 2015
Cassia Valentini-Botinhao, Zhizheng Wu, Simon King, "Towards minimum perceptual error training for DNN-based speech synthesis", Interspeech 2015
Oliver Watts, Zhizheng Wu, Simon King, "Sentence-level control vectors for deep neural network speech synthesis", Interspeech 2015
Xiaohai Tian, Zhizheng Wu, Siu-Wa Lee, Nguyen Quy Hy, Minghui Dong, Eng Siong Chng, "System Fusion for High-Performance Voice Conversion", Interspeech 2015
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi, "Human vs Machine Spoofing Detection on Wideband and Narrowband Data", Interspeech 2015
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, Simon King, "Deep neural network context embeddings for model selection in rich-context HMM synthesis", Interspeech 2015
Oliver Watts, Srikanth Ronanki, Zhizheng Wu, Tuomo Raitio, Antti Suni, "The NST–GlottHMM entry to the Blizzard Challenge 2015", The Blizzard Challenge workshop 2015
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King, "Deep neural network employing multi-task learning and stacked bottleneck features for speech synthesis", ICASSP 2015
Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito, Tomoki Toda, Simon King, "SAS: A speaker verification spoofing database containing diverse attacks", ICASSP 2015.
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Eng Siong Chng, Minghui Dong, "Sparse representation for frequency warping based voice conversion", ICASSP 2015
Zhizheng Wu, Eng Siong Chng, Haizhou Li, "Joint nonnegative matrix factorization for exemplar-based voice conversion", Interspeech 2014.
Siu-Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li, "A Comparative Study of Spectral Transformation Techniques for Singing Voice Synthesis", Interspeech 2014.
Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, Sebastien Marcel, "Introducing I-Vectors for Joint Anti-spoofing and Speaker Verification", Interspeech 2014.
Zhizheng Wu, Sheng Gao, Eng Siong Chng, Haizhou Li, "A study on replay attack and anti-spoofing for text-dependent speaker verification", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2014.
Xiaohai Tian, Zhizheng Wu, Siu-Wa Lee, Eng Siong Chng, "Correlation-based frequency warping for voice conversion", International Symposium on Chinese Spoken Language Processing (ISCSLP) 2014.
Zhizheng Wu, Haizhou Li, "Voice conversion and spoofing attack on speaker verification systems", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2013.
Xiaohai Tian, Zhizheng Wu, Eng Siong Chng, "Local partial least square regression for spectral mapping in voice conversion", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2013.
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, "Exemplar-based voice conversion using non-negative spectrogram deconvolution", The 8th speech synthesis workshop (SSW8)
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, "Exemplar-based unit selection for voice conversion utilizing temporal information", Interspeech 2013.
Zhizheng Wu, Anthony Larcher, Kong Aik Lee, Eng Siong Chng, Tomi Kinnunen, Haizhou Li, "Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints", Interspeech 2013.
Zhizheng Wu, Eng Siong Chng, Haizhou Li, "Conditional restricted boltzmann machine for voice conversion", ChinaSIP 2013.
Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li, "Synthetic speech detection using temporal modulation feature", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2013.
Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, Eliathamby Ambikairajah, "A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2012.
Zhizheng Wu, Eng Siong Chng, Haizhou Li, "Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition", Interspeech 2012.
Tomi Kinnunen, Zhizheng Wu, Kong Aik Lee, Filip Sedlak, Eng Siong Chng, Haizhou Li, "Vulnerability of Speaker Verification Systems Against Voice Conversion Spoofing Attacks: the Case of Telephone Speech", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2012
Zhizheng Wu, Tomi Kinnunen, Eng Siong Chng, Haizhou Li, "Text-Independent F0 Transformation with Non-Parallel Data for Voice Conversion", Interspeech, Makuhari, Japan, 2010.
Zhizheng Wu, Eng Siong Chng, Haizhou Li, "Development of HMM-based Malay Text-to-Speech System", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 2010, Singapore, 2010.
Yao Qian, Frank Soong, Miaomiao Wang, Zhizheng Wu, "A Minimum V/U Error Approach to F0 Generation in HMM-Based TTS", Interspeech, Brighton, UK, 2009.
Yao Qian, Zhizheng Wu, Frank K Soong, "Improved Prosody Generation by Maximizing Joint Likelihood of State and Longer Units", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Zhizheng Wu, Yao Qian, Frank K Soong, Bo Zhang, "Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech", International Symposium on Chinese Spoken Language Processing (ISCSLP), Kunming, China, 2008.
Boyang Gao, Yao Qian, Zhizheng Wu, Frank K Soong, "Duration Refinement by Jointly Optimizing State and Longer Units", Interspeech, Brisbane, Australia, 2008.

Zhizheng Wu

Journal

Conference