Learn More
We describe an experimental text-to-speech system that uses information about syntactic constituency, adjacency to a verb, and constituent length to determine prosodic phrasing for synthetic speech. A central goal of our work has been to characterize "discourse neutral" phrasing, i.e. sentence-level phrasing patterns that are independent of discourse(More)
Our goal is to use natural language processing to identify deceptive and non-deceptive passages in transcribed narratives. We begin by motivating an analysis of language-based deception that relies on specific linguistic indicators to discover deceptive statements. The indicator tags are assigned to a document using a mix of automated and manual methods.(More)
Experimental laboratory results, often performed with college student subjects, have proposed several linguistic phenomena as indicative of speaker deception. We have identified a subset of these phenomena that can be formalized as a linguistic model. The model incorporates three classes of language-based deception cues: (1) linguistic devices used to avoid(More)
Preface This guide contains instructions for segmenting discourse that are meant to be self-contained. Although based on a particular theory of discourse structure, they do not make reference in any way to that theory. As a result, they can be used by \naive" subjects, i.e., segmenters who have not studied discourse theory or discourse processing methods.(More)
In automatic speech recognition (ASR) enabled applications for medical dictations, corpora of literal transcriptions of speech are critical for training both speaker independent and speaker adapted acoustic models. Obtaining these transcriptions is both costly and time consuming. Non-literal transcriptions, on the other hand, are easy to obtain because they(More)
While various aspects of syntactic structure have been shown to bear on the determination of phrase-level prosody, the text-to-speech field has lacked a robust working system to test the possible relations between syntax and prosody. We describe an implemented system which uses the deterministic parser Fidditch to create the input for a set of prosody(More)
This paper focuses on some characteristics of disfluencies in human-human (HHI) and human-computer (HCI) interaction corpora to outline similarities and differences. The main variables studied are disfluency rates and prosodic features. Structured, table-like input increases the disfluency rate in HCI and decreases it in HHI. Direct exposure (visibility) to(More)