Share This Author
Embedding Entities and Relations for Learning and Inference in Knowledge Bases
It is found that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication.
A Diversity-Promoting Objective Function for Neural Conversation Models
This work proposes using Maximum Mutual Information (MMI) as the objective function in neural models, and demonstrates that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.
Learning deep structured semantic models for web search using clickthrough data
- Po-Sen Huang, Xiaodong He, Jianfeng Gao, L. Deng, A. Acero, Larry Heck
- Computer ScienceCIKM
- 27 October 2013
A series of new latent semantic models with a deep structure that project queries and documents into a common low-dimensional space where the relevance of a document given a query is readily computed as the distance between them are developed.
MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
A benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data, which could lead to one of the largest classification problems in computer vision.
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
This new dataset is aimed to overcome a number of well-known weaknesses of previous publicly available datasets for the same task of reading comprehension and question answering, and is the most comprehensive real-world dataset of its kind in both quantity and quality.
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
It is shown that conversational systems that leverage DialoGPT generate more relevant, contentful and context-consistent responses than strong baseline systems.
Stacked Attention Networks for Image Question Answering
- Zichao Yang, Xiaodong He, Jianfeng Gao, L. Deng, Alex Smola
- Computer ScienceIEEE Conference on Computer Vision and Pattern…
- 7 November 2015
A multiple-layer SAN is developed in which an image is queried multiple times to infer the answer progressively, and the progress that the SAN locates the relevant visual clues that lead to the answer of the question layer-by-layer.
Multi-Task Deep Neural Networks for Natural Language Understanding
A Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks that allows domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations.
Deep Reinforcement Learning for Dialogue Generation
- Jiwei Li, Will Monroe, Alan Ritter, Dan Jurafsky, Michel Galley, Jianfeng Gao
- Computer ScienceEMNLP
- 5 June 2016
This work simulates dialogues between two virtual agents, using policy gradient methods to reward sequences that display three useful conversational properties: informativity, non-repetitive turns, coherence, and ease of answering.
On the Variance of the Adaptive Learning Rate and Beyond
This work identifies a problem of the adaptive learning rate, suggests warmup works as a variance reduction technique, and proposes RAdam, a new variant of Adam, by introducing a term to rectify the variance of theadaptive learning rate.