摘要: In this chapter the effectiveness of syllable-based prosodic features for speaker recognition is discussed. The term prosody represents a collection characteristics such as intonation, stress and timing, primarily expressed using variations in pitch, energy duration at various levels speech. Prosody reflects learned/acquired speaking habits person hence contributes recognition. Because are less affected by channel mismatch noise, they particularly well suited forensics, field that demands accurate identification suspects with few mitigating conditions possible. chapter, author describes method extracting directly from speech signal. Applying method, segmented into syllable-like regions vowel onset points (VOP). locations VOPs serve reference extraction representation features. demonstrated extended task NIST evaluation 2003. Combining evidence spectral proposed helps to improve overall accuracy.