skip to primary navigation skip to content

CBSU bibliography search


To request a reprint of a CBSU publication, please click here to send us an email (reprints may not be available for all publications)

A comparison of two compression functions used for noisy vowel detection with back-propagation networks.
Authors:
Thurston, P. & Norris, D.
Reference:
In Eurospeech 91: Proceedings of the 2nd European Conference on Speech Communication and Technology, Vol. 2 (pp. 995-998). Genova, Italy, 24-26 September.
Year of publication:
1991
CBU number:
2640
Abstract:
This paper compares the merits of two alternative front-ends for a connectionist vowel recogniser; an auditory hair-cell model and a conventional FFT. The choice of front-end has a marked impact on the recognition of vowels, particularly when the networks are trained and tested using speech in the presence of noise. One notable difference between the front-ends is that whereas the FFT based front-end uses an output compression function that is cubic, the auditory model has an intrinsic compression function that is logarithmic. By rescaling the output values for the two front-ends, we demonstrate that the difference in performance is determined by the speech compression function and not by the choice of front-end per se.


genesis();