ONOE Kazuo, SEGI Hiroyuki, KOBAYAKAWA Takeshi, SATO Shoei, IMAI Toru, ANDO Akio
IEICE technical report. Speech, Jun 22, 2001
We are studying speech recognition for simultaneous subtitling of news programs. One of our current studies is recognition of reporter's speech in the afflicted area, from overseas and so on. Because these speeches are uttered in various acoustic conditions, we need technologies to handle such conditions. In this paper, a filter-bank subtraction technique is proposed for robust speech recognition under various acoustic conditions. It calculates, in each pass band of the filter-bank, the minimum value of the filter output in a certain interval of time, and then estimates a noise spectrum over all bands as a composition of those minimum values. The experiments show that the new method yielded better recognition performance than the conventional spectral subtraction techniques.