Tied-mixture HMMs have been proposed as the acoustic model for large-vocabulary continuous speech recognition and have yielded promising results. They share base-distribution and provide more flexibility in choosing the degree of tying than state-...
The 'Kabushiki Shikyo' program broadcast on NHK Radio 2 reports on the daily closing prices and net changes of about 830 stocks listed on the Tokyo Stock Exchange. Reading out the numerical values without making mistakes within the allotted broadc...
Hiroyuki Segi   Kazuo Onoe   Shoei Sato   Akio Kobayashi   Akio Ando   
Journal of Information Technology Research 7(3) 15-31 2014年7月 [査読有り]
Tied-mixture HMMs have been proposed as the acoustic model for large-vocabulary continuous speech recognition and have yielded promising results. They share base-distribution and provide more flexibility in choosing the degree of tying than state-...
A new pause duration setting method for synthesis by compilation of recorded speech is proposed and its effect is confirmed by subjective evaluation test.
We have been conducting research on a high-quality speech synthesis system for automatic audio broadcasting. We propose stock prices voice synthesizer with numerical speech synthesis method and speech rate conversion.
It is useful to combine multi-speaker's speech database for concatenative speech synthesis system. This paper describes a perceptual study on naturalness and personality by exchanging a phoneme segment to other speaker's one in a word speech synth...