Efficient interactive weight tuning for TTS synthesis: Reducing user fatigue by improving user consistency
15 June 2006Alías, F., Llorà, X., Formiga, L., Sastry, K., Goldberg, D. E. (2006). IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2006). 1, 865—868. [Full paper - PDF].
Abstract:
The quality of corpus-based text-to-speech systems depends on the accuracy of the unit selection process, which in turn relies on the cost function definition. This function should map the user perceptual preference when selecting synthesis units, which is a very difficult task. This paper continues our previous work on fusing the human judgements with the cost function by means of interactive weight tuning. The application of active interactive genetics algorithms mitigates user fatigue by improving user consistency. As a result, the obtained weights generate more natural synthetic speech when compared to previous objective and subjective proposals.
Related Posts:
- Evaluation consistency in iGAs: User contradictions as cycles in partial-ordering graphs
- Combating user fatigue in iGAs: Partial ordering, support vector machines, and synthetic fitness
- Analyzing active interactive genetic algorithms using visual analytics
- Fujitsu ultra-mobile PCs
- Improving small population performance under noise with viral infection + tropism
No comments yet
