MIDAS: A Dialog Act Annotation Scheme for Open Domain HumanMachine Spoken Conversations

  title={MIDAS: A Dialog Act Annotation Scheme for Open Domain HumanMachine Spoken Conversations},
  author={Dian Yu and Zhou Yu},
Dialog act prediction in open-domain conversations is an essential language comprehension task for both dialog system building and discourse analysis. Previous dialog act schemes, such as SWBD-DAMSL, are designed mainly for discourse analysis in human-human conversations. In this paper, we present a dialog act annotation scheme, MIDAS (Machine Interaction Dialog Act Scheme), targeted at open-domain human-machine conversations. MIDAS is designed to assist machines to improve their ability to… 

Figures and Tables from this paper

FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows

This work proposes segment act, an extension of dialog act from utterance level to segment level, and crowdsource a large-scale dataset for it, and develops the first consensus-based dialogue evaluation framework, FlowEval, which provides a reference-free approach for dialog evaluation by using pseudo-references.

Filling Conversation Ellipsis for Better Social Dialog Understanding

This work proposes a method which considers both the original utterance that has ellipsis and the automatically completed utterance in dialog act and semantic role labeling tasks, and combines the prediction results from these two utterances using a selection model that is guided by expert knowledge.

A Transfer Learning Approach for Dialogue Act Classification of GitHub Issue Comments

This paper presents a transfer learning approach for performing dialogue act classification on issue comments, and compares the performance of several word and sentence level encoding models including Global Vectors for Word Representations, Universal Sentence Encoder, and Bidirectional Encoder Representations from Transformers.

Simulated Chats for Task-oriented Dialog: Learning to Generate Conversations from Instructions

This paper presents a data creation strategy that uses the pre-trained language model, GPT2 (Radford et al. 2018), to simulate the interaction between crowd-sourced workers by creating a user bot and an agent bot that achieves significant improvements in both low-resource setting as well as in over-all task performance.

HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations

This work proposes HERALD, an efficient annotation framework that reframes the training data annotation process as a denoising problem, and improves annotation efficiency significantly and achieves 86% user disengagement detection accuracy in two dialog corpora.

A Multi-Dimensional, Cross-Domain and Hierarchy-Aware Neural Architecture for ISO-Standard Dialogue Act Tagging

This work proposes a neural architecture to increase classification accuracy, especially on low-frequency fine-grained tags and takes advantage of the hierarchical structure of the ISO taxonomy and utilises syntactic information in the form of Part-Of-Speech and dependency tags, in addition to contextual information from previous turns.

Modeling Performance in Open-Domain Dialogue with PARADISE

A PARADISE model is developed for predicting the performance of Athena, a dialogue system that has participated in thousands of conversations with real users, while competing as a finalist in the Alexa Prize.

What Would a Teacher Do? Predicting Future Talk Moves

This paper introduces a new task, called future talk move prediction (FTMP), which consists of predicting the next talk move – an utterance strategy from APT – given a conversation history with its corresponding talk moves and introduces a neural network model for this task, which outperforms multiple baselines by a large margin.

Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent

Aiming to be both informative and conversational, the Chirpy Cardinal bot chats with users in an authentic, emotionally intelligent way by integrating controlled neural generation with scaffolded, hand-written dialogue, producing an engaging and socially fluent experience.

Corpus with Speech Function Annotation: Challenges, Advantages, and Limitations

Creating a corpus labeled with dialog acts is one of the most difficult tasks in corpus linguistics. Dialog acts can reflect various aspects of the utterances in the dialogues, but most often



ISO-Standard Domain-Independent Dialogue Act Tagging for Conversational Agents

This paper proposes a methodology to map several publicly available corpora to a subset of the ISO standard, in order to create a large task-independent training corpus for DA classification, and shows the feasibility of using this corpus to train a domain-independent DA tagger testing it on out-of-domain conversational data.

Dialogue act modeling for automatic tagging and recognition of conversational speech

A probabilistic integration of speech recognition with dialogue modeling is developed, to improve both speech recognition and dialogue act classification accuracy.

Coding Dialogs with the DAMSL Annotation Scheme

The slight revisions to DAMSL discussed here should increase accuracy on the next set of tests and produce a reliable, exible, and comprehensive utterance annotation scheme.

Contextual Topic Modeling For Dialog Systems

This work extends previous work on neural topic classification and unsupervised topic keyword detection by incorporating conversational context and dialog act features, and shows that topical metrics such as topical depth is highly correlated with dialog evaluation metricssuch as coherence and engagement implying that conversational topic models can predict user satisfaction.

Transfer of Corpus-Specific Dialogue Act Annotation to ISO Standard: Is it worth it?

Compared to the legacy annotation scheme, on the Italian LUNA Human-Human corpus, the DiAML annotation scheme exhibits better cross-domain and data aggregation classification performance, while maintaining comparable in-domain performance.

DATE: A Dialogue Act Tagging Scheme for Evaluation of Spoken Dialogue Systems

This paper describes a dialogue act tagging scheme developed for the purpose of providing finer-grained quantitative dialogue metrics for comparing and evaluating DARPA COMMUNICATOR spoken dialogue systems and suggests that dialogue act metrics can ultimately support more focused qualitative analysis of the role of various dialogue strategy parameters.

The Influence of Context on Dialogue Act Recognition

A deep analysis of the influence of context information on dialogue act recognition using an event-based classification approach, using SVMs, instead of the more common sequential approaches, such as HMMs.

Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual

When the authors speak about discourse or conversational knowledge, they can describe a conversation in terms of the high-level goals and plans of the participants and model sociolinguistic facts about conversation structure such how participants might expect one type of conversational units to be responsed to by another.

"I Like Your Shirt" - Dialogue Acts for Enabling Social Talk in Conversational Agents

The dialogue act set and the sequences are used in the dialogue system to provide a more knowledgedriven treatment of small talk than chatbots can offer.

Exploiting Sentence and Context Representations in Deep Neural Models for Spoken Language Understanding

This paper presents a deep learning architecture for the semantic decoder component of a Statistical Spoken Dialogue System that uses unaligned semantic annotations and it uses distributed semantic representation learning to overcome the limitations of explicit delexicalisation.