Perception et cognition auditives PECA

Atelier du Réseau de sciences cognitives d'Ile-de-France

Responsables : Carolyn Drake et Daniel Pressnitzer

Site PECA

Lieu : IRCAM, 1 place Igor-Stravinsky, 75004 Paris, salle Stravinsky
Métros : Hôtel de Ville, Châtelet, Les Halles, Rambuteau


Programme 2003-2004

mardi 16 septembre DeLiang Wang (Ohio State University) Multipitch Tracking for Noisy Speech
 


Abstract :

An effective algorithm for multipitch tracking is important for a variety of acoustic processing tasks. However, the performance of existing algorithms is not satisfactory. In this talk, we present a robust algorithm for multipitch tracking of noisy speech. Our approach integrates an improved channel and peak selection method, a new method for extracting periodicity across different channels, and
a hidden Markov model (HMM) for modeling continuous pitch tracks. The resulting algorithm can reliably detect single and double pitch tracks in a noisy environment. We suggest a pitch error measure for the multipitch situation. The proposed algorithm is evaluated on a database of speech utterances mixed with various types of interference. Quantitative comparisons show that the algorithm significantly outperforms existing ones.

 

Contacts :

Carolyn Drake, tel 01 55 20 59 30 ou 06 83 82 68 24, Laboratoire de psychologie expérimentale, Institut de psychologie, Centre universitaire de Boulogne, 71, avenue Edouard Vaillant, 92774 Boulogne-Billancourt Cedex
Daniel Pressnitzer

 

Haut de la page haut