Utilize este identificador para referenciar este registo: https://hdl.handle.net/10316/102726
Título: Broad phonetic class definition driven by phone confusions
Autor: Lopes, Carla 
Perdigão, Fernando 
Palavras-chave: Confusion Matrix; Conditional Random Field; Frame Error Rate; Discriminative Training; Context Window
Data: 2012
Projeto: FCT - PhD Grant (SFRH/BD/27966/2006) 
Título da revista, periódico, livro ou evento: Eurasip Journal on Advances in Signal Processing
Volume: 2012
Número: 1
Resumo: Intermediate representations between the speech signal and phones may be used to improve discrimination among phones that are often confused. These representations are usually found according to broad phonetic classes, which are defined by a phonetician. This article proposes an alternative data-driven method to generate these classes. Phone confusion information from the analysis of the output of a phone recognition system is used to find clusters at high risk of mutual confusion. A metric is defined to compute the distance between phones. The results, using TIMIT data, show that the proposed confusion-driven phone clustering method is an attractive alternative to the approaches based on human knowledge. A hierarchical classification structure to improve phone recognition is also proposed using a discriminative weight training method. Experiments show improvements in phone recognition on the TIMIT database compared to a baseline system.
URI: https://hdl.handle.net/10316/102726
ISSN: 1687-6180
DOI: 10.1186/1687-6180-2012-158
Direitos: openAccess
Aparece nas coleções:I&D IT - Artigos em Revistas Internacionais
FCTUC Eng.Electrotécnica - Artigos em Revistas Internacionais

Mostrar registo em formato completo

Google ScholarTM

Verificar

Altmetric

Altmetric


Este registo está protegido por Licença Creative Commons Creative Commons