Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
- Byeongchang Kim, Hyunwoo Kim, Gunhee Kim
- Computer ScienceNorth American Chapter of the Association for…
- 2 November 2018
This work collects Reddit TIFU dataset, consisting of 120K posts from the online discussion forum Reddit, and proposes a novel abstractive summarization model named multi-level memory networks (MMN), equipped with multi- level memory to store the information of text from different levels of abstraction.
KLUE: Korean Language Understanding Evaluation
- Sungjoon Park, Jihyung Moon, Kyunghyun Cho
- Computer ScienceNeurIPS Datasets and Benchmarks
- 20 May 2021
This work introduces Korean Language Understanding Evaluation (KLUE), a collection of 8 Korean natural language understanding (NLU) tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction, Dependency Parsing, Machine Reading Comprehension, and Dialogue State Tracking, and provides suitable evaluation metrics and fine-tuning recipes for pretrained language models for each task.
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes
- Hyunwoo Kim, Byeongchang Kim, Gunhee Kim
- Computer ScienceConference on Empirical Methods in Natural…
- 18 September 2021
Taking inspiration from social cognition, a generative estimator is used to infer emotion cause words from utterances with no word-level label and a novel method based on pragmatics is introduced to make dialogue models focus on targeted words in the input during generation.
Curiosity-Bottleneck: Exploration By Distilling Task-Specific Novelty
- Youngjin Kim, Wontae Nam, Hyunwoo Kim, Ji-Hoon Kim, Gunhee Kim
- Computer ScienceInternational Conference on Machine Learning
- 24 May 2019
This work shows that Curiosity-Bottleneck learns an effective exploration strategy by robustly measuring the state novelty in distractive environments where state-of-the-art exploration methods often degenerate.
ProsocialDialog: A Prosocial Backbone for Conversational Agents
- Hyunwoo Kim, Youngjae Yu, Maarten Sap
- Computer ScienceConference on Empirical Methods in Natural…
- 25 May 2022
This work introduces ProsocialDialog, the first large-scale multi-turn dialogue dataset to teach conversational agents to respond to problematic content following social norms, and introduces a dialogue safety detection module, Canary, capable of generating RoTs given conversational context, and a socially-informed dialogue agent, Prost.
Will I Sound like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness
- Hyunwoo Kim, Byeongchang Kim, Gunhee Kim
- Computer ScienceConference on Empirical Methods in Natural…
- 13 April 2020
Inspired by social cognition and pragmatics, existing dialogue agents are endow with public self-consciousness on the fly through an imaginary listener to enforce dialogue agents to refrain from uttering contradiction and improve consistency of existing dialogue models.
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization
- Hyunwoo Kim, Jack Hessel, Yejin Choi
- Computer SciencearXiv.org
- 20 December 2022
This work presents SODA: the first publicly available, million-scale high-quality social dialogue dataset, and trains COSMO: a generalizable conversation model that is significantly more natural and consistent on unseen datasets than best-performing conversation models (e.g., GODEL, BlenderBot-1, Koala, Vicuna).
How Robust are Fact Checking Systems on Colloquial Claims?
- Byeongchang Kim, Hyunwoo Kim, Seokhee Hong, Gunhee Kim
- Computer ScienceNorth American Chapter of the Association for…
- 1 June 2021
It is found that existing fact checking systems that perform well on claims in formal style significantly degenerate on colloquial claims with the same semantics, and it is shown that document retrieval is the weakest spot in the system even vulnerable to filler words, such as “yeah” and “you know”.
Public Self-consciousness for Endowing Dialogue Agents with Consistent Persona
- Hyunwoo Kim, Byeongchang Kim, Gunhee Kim
- Computer SciencearXiv.org
- 13 April 2020
This approach, based on the Rational Speech Acts framework, attempts to maintain consistency in an unsupervised manner requiring neither additional annotations nor pretrained external models to improve consistency in dialogue agents.