Cross-Linguistic Study of the Production of Turn-Taking Cues in American English and Argentine Spanish
@inproceedings{Brusco2017CrossLinguisticSO, title={Cross-Linguistic Study of the Production of Turn-Taking Cues in American English and Argentine Spanish}, author={Pablo Brusco and Juan Manuel P{\'e}rez and Agust{\'i}n Gravano}, booktitle={INTERSPEECH}, year={2017} }
We present the results of a series of machine learning experiments aimed at exploring the differences and similarities in the production of turn-taking cues in American English and Argentine Spanish. An analysis of prosodic features automatically extracted from 21 dyadic conversations (12 En, 9 Sp) revealed that, when signaling Holds, speakers of both languages tend to use roughly the same combination of cues, characterized by a sustained final intonation, a shorter duration of turn-final…
8 Citations
Neural correlates of spoken language processing
- Psychology
- 2017
In conversation, humans normally exchange turns in swift coordination, with few silences and overlaps, due partly to the existence of turn-yielding cues: acoustic, prosodic, lexical or syntactic…
The interplay of pragmatics and politeness in intonation
- Linguistics
- 2018
booklet The effect of dialectal tonal variation on lexical access and sentence comprehension Yiya Chen, Leiden University There has been a wealth of linguistic data explaining dialectal variation in…
A Conversation-Analytic Annotation of Turn-Taking Behavior in Japanese Multi-Party Conversation and its Preliminary Analysis
- LinguisticsLREC
- 2020
The results show that the above four-way distinction is essential to account for the distributions of the syntactic and prosodic features, suggesting the insufficiency of previous turn-taking models that do not consider the distinction between i and ii or between ii and iii.
Turn-Taking Predictions across Languages and Genres Using an LSTM Recurrent Neural Network
- Computer Science2018 IEEE Spoken Language Technology Workshop (SLT)
- 2018
An improved recurrent network model is presented that outperforms [1] and does so without requiring lexical annotation, providing good results in turn-taking prediction for English, Spanish, Japanese, Mandarin and French.
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers
- Computer ScienceINTERSPEECH
- 2018
Evaluations with two kinds of dialogue corpora of human-robot interaction demonstrate that the proposed multitask learning scheme outperforms the conventional single-task learning.
Turn-Taking Prediction Based on Detection of Transition Relevance Place
- Computer ScienceINTERSPEECH
- 2019
This study proposes taking into account the concept of the transition relevance place (TRP) for turn-taking prediction, and conducts annotation of TRP on a human-robot dialogue corpus, ensuring the objectivity of this annotation among annotators.
A protocol for collecting speech data with varying degrees of trust
- Computer ScienceWorkshop on Speech, Music and Mind (SMM 2019)
- 2019
A preliminary experiment using a text-only version of the protocol in Argentine Spanish shows that the protocol effectively succeeds in influencing subjects into the desired mental state of either trusting or distrusting the agent’s skills.
Voice Quality as a Turn-Taking Cue
- SociologyINTERSPEECH
- 2019
This work revisits the idea that voice quality dynamics (VQ) contributes to conveying pragmatic distinctions, with two case studies to further test this idea. First, we explore VQ as a turn-taking ...
References
SHOWING 1-10 OF 18 REFERENCES
An Analysis of Turn-Taking and Backchannels Based on Prosodic and Syntactic Features in Japanese Map Task Dialogs
- Computer ScienceLanguage and speech
- 1998
It is shown that in both turn-taking and backchannels, some instances of syntactic features make extremely strong contributions, and in general, syntax has a stronger contribution than any individual prosodic feature, although the whole prosody contributes as strongly as, or even more strongly than, syntax.
Who Do You Think Will Speak Next? Perception of Turn-Taking Cues in Slovak and Argentine Spanish
- PsychologyINTERSPEECH
- 2016
Results of a perception study indicate that 1) in addition to textual cues, prosodic cues also play a clear role in perceiving how the dialogue will unfold; and 2) there exists a non-empty intersection of temporal and intonational prosodic turn-taking cues in the two languages, despite their belonging to separate families.
Affirmative Cue Words in Task-Oriented Dialogue
- PsychologyCL
- 2011
It is found that contextual information and final intonation figure as the most salient cues to automatic disambiguation of affirmative cue words, a family of cue words that speakers use frequently in conversation.
Universals and cultural variation in turn-taking in conversation
- SociologyProceedings of the National Academy of Sciences
- 2009
The empirical evidence suggests robust human universals in this domain, where local variations are quantitative only, pointing to a single shared infrastructure for language use with likely ethological foundations.
From reaction to prediction: experiments with computational models of turn-taking
- Computer ScienceINTERSPEECH
- 2006
This paper describes an experiment that shows human performance at prediction of turn-taking decisions and shows that a model automatically induced from data can reach a similar level of performance and describes a series of experiments on spontaneous dialogue data where they combine pause thresholds with syntactic and prosodic information to make turn- taking decisions.
Keeping the Floor in Multiparty Conversations: Intonation, Syntax, and Pause
- Linguistics
- 2003
This statistical investigation of intonation, syntax, and pause in natural conversation assesses how these variables are combined and manipulated to achieve turn-taking goals. In general, the…
Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody
- Computer ScienceINTERSPEECH
- 2002
This work develops a new approach to EOU detection that uses prosodic features, modeled by decision trees and combined with an event N-gram language model to obtain a score that measures the likelihood that any nonspeech region is an EOU.