Human speech consists of a series of voiced sounds—tonal sounds or formants—and unvoiced sounds. The main distinction between voiced and unvoiced sounds is that voiced sounds are produced by an oscillation of the vocal cords, whereas unvoiced sounds are produced by blocking and restricting the air flow with lips, tongue, palate, throat, and larynx.
If speech containing voiced and unvoiced sounds is used as a vocoder’s analysis signal, but the synthesis engine doesn’t differentiate between voiced and unvoiced sounds, the result will sound rather weak. To avoid this problem, the synthesis section of the vocoder must produce different sounds for the voiced and unvoiced parts of the signal.
The EVOC 20 PolySynth includes an Unvoiced/Voiced detector for this specific purpose. This unit detects the unvoiced portions of the sound in the analysis signal and then substitutes the corresponding portions in the synthesis signal with noise, with a mixture of noise and synthesizer signal, or with the original signal. If the U/V Detector detects voiced parts, it passes this information to the Synthesis section, which uses the normal synthesis signal for these portions.