Dissertation/Thesis Abstract

Automatic detection and classification of prosodic events
by Rosenberg, Andrew, Ph.D., Columbia University, 2009, 381; 3393597
Abstract (Summary)

Prosody, or intonation, is a critically important component of spoken communication. The automatic extraction of prosodic information is necessary for machines to process speech with human levels of proficiency. In this thesis we describe work on the automatic detection and classification of prosodic events—specifically, pitch accents and prosodic phrase boundaries. We present novel techniques, feature representations and state of the art performance in each of these tasks. We also present three proof-of-concept applications—speech summarization, story segmentation and non-native speech assessment—showing that access to hypothesized prosodic event information can be used to improve the performance of downstream spoken language processing tasks. We believe the contributions of this thesis advance the understanding of prosodic events and the use of prosody in spoken language processing towards the goal of human-like processing of speech by machines.

Indexing (document details)
Advisor: Herschberg, Julia
School: Columbia University
School Location: United States -- New York
Source: DAI-B 71/02, Dissertation Abstracts International
Subjects: Computer science
Keywords: Automatic detection, Natural language processing, Prosodic events, Prosody
Publication Number: 3393597
ISBN: 978-1-109-60472-6
Copyright © 2021 ProQuest LLC. All rights reserved. Terms and Conditions Privacy Policy Cookie Policy