Cross-Linguistic Study of the Production of Turn-Taking Cues in American English and Argentine Spanish

@inproceedings{Brusco2017CrossLinguisticSO,
  title={Cross-Linguistic Study of the Production of Turn-Taking Cues in American English and Argentine Spanish},
  author={Pablo Brusco and Juan Manuel P{\'e}rez and Agust{\'i}n Gravano},
  booktitle={INTERSPEECH},
  year={2017}
}
We present the results of a series of machine learning experiments aimed at exploring the differences and similarities in the production of turn-taking cues in American English and Argentine Spanish. An analysis of prosodic features automatically extracted from 21 dyadic conversations (12 En, 9 Sp) revealed that, when signaling Holds, speakers of both languages tend to use roughly the same combination of cues, characterized by a sustained final intonation, a shorter duration of turn-final… 

Figures and Tables from this paper

Neural correlates of spoken language processing
In conversation, humans normally exchange turns in swift coordination, with few silences and overlaps, due partly to the existence of turn-yielding cues: acoustic, prosodic, lexical or syntactic
The interplay of pragmatics and politeness in intonation
booklet The effect of dialectal tonal variation on lexical access and sentence comprehension Yiya Chen, Leiden University There has been a wealth of linguistic data explaining dialectal variation in
A Conversation-Analytic Annotation of Turn-Taking Behavior in Japanese Multi-Party Conversation and its Preliminary Analysis
TLDR
The results show that the above four-way distinction is essential to account for the distributions of the syntactic and prosodic features, suggesting the insufficiency of previous turn-taking models that do not consider the distinction between i and ii or between ii and iii.
Turn-Taking Predictions across Languages and Genres Using an LSTM Recurrent Neural Network
TLDR
An improved recurrent network model is presented that outperforms [1] and does so without requiring lexical annotation, providing good results in turn-taking prediction for English, Spanish, Japanese, Mandarin and French.
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers
TLDR
Evaluations with two kinds of dialogue corpora of human-robot interaction demonstrate that the proposed multitask learning scheme outperforms the conventional single-task learning.
Turn-Taking Prediction Based on Detection of Transition Relevance Place
TLDR
This study proposes taking into account the concept of the transition relevance place (TRP) for turn-taking prediction, and conducts annotation of TRP on a human-robot dialogue corpus, ensuring the objectivity of this annotation among annotators.
A protocol for collecting speech data with varying degrees of trust
TLDR
A preliminary experiment using a text-only version of the protocol in Argentine Spanish shows that the protocol effectively succeeds in influencing subjects into the desired mental state of either trusting or distrusting the agent’s skills.
Voice Quality as a Turn-Taking Cue
This work revisits the idea that voice quality dynamics (VQ) contributes to conveying pragmatic distinctions, with two case studies to further test this idea. First, we explore VQ as a turn-taking ...

References

SHOWING 1-10 OF 18 REFERENCES
The additive effect of turn-taking cues in human and synthetic voice
An Analysis of Turn-Taking and Backchannels Based on Prosodic and Syntactic Features in Japanese Map Task Dialogs
TLDR
It is shown that in both turn-taking and backchannels, some instances of syntactic features make extremely strong contributions, and in general, syntax has a stronger contribution than any individual prosodic feature, although the whole prosody contributes as strongly as, or even more strongly than, syntax.
Who Do You Think Will Speak Next? Perception of Turn-Taking Cues in Slovak and Argentine Spanish
TLDR
Results of a perception study indicate that 1) in addition to textual cues, prosodic cues also play a clear role in perceiving how the dialogue will unfold; and 2) there exists a non-empty intersection of temporal and intonational prosodic turn-taking cues in the two languages, despite their belonging to separate families.
Turn-taking cues in task-oriented dialogue
Affirmative Cue Words in Task-Oriented Dialogue
TLDR
It is found that contextual information and final intonation figure as the most salient cues to automatic disambiguation of affirmative cue words, a family of cue words that speakers use frequently in conversation.
Universals and cultural variation in turn-taking in conversation
TLDR
The empirical evidence suggests robust human universals in this domain, where local variations are quantitative only, pointing to a single shared infrastructure for language use with likely ethological foundations.
From reaction to prediction: experiments with computational models of turn-taking
TLDR
This paper describes an experiment that shows human performance at prediction of turn-taking decisions and shows that a model automatically induced from data can reach a similar level of performance and describes a series of experiments on spontaneous dialogue data where they combine pause thresholds with syntactic and prosodic information to make turn- taking decisions.
Keeping the Floor in Multiparty Conversations: Intonation, Syntax, and Pause
This statistical investigation of intonation, syntax, and pause in natural conversation assesses how these variables are combined and manipulated to achieve turn-taking goals. In general, the
Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody
TLDR
This work develops a new approach to EOU detection that uses prosodic features, modeled by decision trees and combined with an event N-gram language model to obtain a score that measures the likelihood that any nonspeech region is an EOU.
...
...