Dissertation/Thesis Abstract

Text-independent, automatic speaker recognition system evaluation with males speaking both Arabic and English
by Alamri, Safi S., M.A., University of Colorado at Denver, 2015, 61; 1605087
Abstract (Summary)

Automatic speaker recognition is an important key to speaker identification in media forensics and with the increase of cultures mixing, there’s an increase in bilingual speakers all around the world. The purpose of this thesis is to compare text-independent samples of one person using two different languages, Arabic and English, against a single language reference population. The hope is that a design can be started that may be useful in further developing software that can complete accurate text-independent ASR for bilingual speakers speaking either language against a single language reference population. This thesis took an Arabic model sample and compared it against samples that were both Arabic and English using and an Arabic reference population, all collected from videos downloaded from the Internet. All of the samples were text-independent and enhanced to optimal performance. The data was run through a biometric software called BATVOX 4.1, which utilizes the MFCCs and GMM methods of speaker recognition and identification. The result of testing through BATVOX 4.1 was likelihood ratios for each sample that were evaluated for similarities and differences, trends, and problems that had occurred.

Indexing (document details)
Advisor: Grigoras, Catalin
Commitee: Gaston, Leslie, Grigoras, Catalin, Smith, Jeff M.
School: University of Colorado at Denver
Department: Recording Arts
School Location: United States -- Colorado
Source: MAI 55/02M(E), Masters Abstracts International
Subjects: Multimedia Communications
Publication Number: 1605087
ISBN: 978-1-339-30494-6
Copyright © 2021 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy