• Publications
  • Influence
Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model
TLDR
This paper introduces Multi-News, the first large-scale MDS news dataset, and proposes an end-to-end model which incorporates a traditional extractive summarization model with a standard SDS model and achieves competitive results on MDS datasets. Expand
ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks
TLDR
The first large-scale manually-annotated corpus for scientific papers is developed and released by enabling faster annotation and summarization methods that integrate the authors’ original highlights and the article’s actual impacts on the community are proposed, to create comprehensive, hybrid summaries. Expand
Medical Text Classification using Convolutional Neural Networks
TLDR
An approach to automatically classify clinical text at a sentence level using deep convolutional neural networks to represent complex features and outperforms several approaches widely used in natural language processing tasks by about 15%. Expand
What Should I Learn First: Introducing LectureBank for NLP Education and Prerequisite Chain Learning
TLDR
LectureBank is introduced, a dataset containing 1,352 English lecture files collected from university courses which are each classified according to an existing taxonomy as well as 208 manually-labeled prerequisite relation topics, which is publicly available. Expand
CONSIDERING TRAVELERS' RISK-TAKING BEHAVIOR IN DYNAMIC TRAFFIC ASSIGNMENT
Dynamic traffic assignment (DTA) has been a topic of substantial research during the past decade and the DTA models offer the potential to support effective evaluation and operation AdvancedExpand
TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation
TLDR
This work introduces TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research and is notably the largest manually-picked corpus of resources intended for N LP education which does not include only academic papers. Expand
The Genome of the Beluga Whale (Delphinapterus leucas)
TLDR
The genome of the beluga whale was determined using DNA sequencing approaches that employed both microfluidic partitioning library and non-partitioned library construction to aid the understanding of the functional elements. Expand
Digital Gene Expression by Tag Sequencing on the Illumina Genome Analyzer
TLDR
This unit provides a protocol for performing digital gene expression profiling on the Illumina Genome Analyzer sequencing platform that increases utility while reducing both the cost and time required to generate gene expression profiles. Expand
A Social Bookmarking-Based People Search Service Building Communities of Practice with Collective Intelligence
TLDR
This paper attempts to propose an effective way to locate people with shared interests by using Internet resources bookmarked by the users, so that the similarity of interests between them can be analyzed. Expand
...
1
2
...