Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis
Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis
Both of the prosody and spectral features are important for emotional speech synthesis. Besides prosody effects, voice quality and articulation parameters are the factors that should be considered to modify in emotional speech synthetic systems. Generally, rules and filters are designed to process these parameters respectively. This paper proves that by modifying spectral envelope, the voice quality and articulation could be adjusted as a whole. Thus, it will not need to modify each of the parameter separately depending on rules. Accordingly, it will make the synthetic system more flexible by designing an automatic spectral envelope model based on some machine learning methods. The perception test in this paper also shows that when prosody and spectral features are all modified, the best emotional synthetic speech will be obtained.
3-540-29621-2
334-341
Shao, Yanqiu
b454b95d-76c6-4c46-9fbc-565a576ef3e0
Wang, Zhuoran
ee2cbf1e-1250-46a7-8787-82920a656570
Han, Jiqing
907dce2d-9e9b-4196-a9e8-991d693fb44b
Liu, Ting
d86f6607-a0c9-4877-b4c6-fa9705fe6b6b
2005
Shao, Yanqiu
b454b95d-76c6-4c46-9fbc-565a576ef3e0
Wang, Zhuoran
ee2cbf1e-1250-46a7-8787-82920a656570
Han, Jiqing
907dce2d-9e9b-4196-a9e8-991d693fb44b
Liu, Ting
d86f6607-a0c9-4877-b4c6-fa9705fe6b6b
Shao, Yanqiu, Wang, Zhuoran, Han, Jiqing and Liu, Ting
(2005)
Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis.
1st International Conference on Affective Computing & Intelligent Interaction, Beijing, China.
22 - 24 Oct 2005.
.
(doi:10.1007/11573548_43).
Record type:
Conference or Workshop Item
(Poster)
Abstract
Both of the prosody and spectral features are important for emotional speech synthesis. Besides prosody effects, voice quality and articulation parameters are the factors that should be considered to modify in emotional speech synthetic systems. Generally, rules and filters are designed to process these parameters respectively. This paper proves that by modifying spectral envelope, the voice quality and articulation could be adjusted as a whole. Thus, it will not need to modify each of the parameter separately depending on rules. Accordingly, it will make the synthetic system more flexible by designing an automatic spectral envelope model based on some machine learning methods. The perception test in this paper also shows that when prosody and spectral features are all modified, the best emotional synthetic speech will be obtained.
Restricted to Registered users only
More information
Published date: 2005
Additional Information:
Event Dates: Oct 22-24
Venue - Dates:
1st International Conference on Affective Computing & Intelligent Interaction, Beijing, China, 2005-10-22 - 2005-10-24
Organisations:
Electronics & Computer Science
Identifiers
Local EPrints ID: 261542
URI: http://eprints.soton.ac.uk/id/eprint/261542
ISBN: 3-540-29621-2
PURE UUID: 0e5062c3-71a6-4b67-a71f-7ee8f0f00732
Catalogue record
Date deposited: 13 Nov 2005
Last modified: 14 Mar 2024 06:54
Export record
Altmetrics
Contributors
Author:
Yanqiu Shao
Author:
Zhuoran Wang
Author:
Jiqing Han
Author:
Ting Liu
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics