• Publications
  • Influence
Deep Keyphrase Generation
TLDR
We propose a generative model for keyphrase prediction with an encoder-decoder framework, which can capture the deep semantic meaning of the content with a deep learning method. Expand
Integrating Transformer and Paraphrase Rules for Sentence Simplification
TLDR
We propose a novel model based on a multi-layer and multi-head attention architecture and we pro- pose two innovative approaches to integrate the Simple PPDB (A Paraphrase Database for Simplification), an external paraphrase knowledge base for simplification that covers a wide range of real-world simplification rules. Expand
Generating Diverse Numbers of Diverse Keyphrases
TLDR
We propose a recurrent generative model that generates multiple keyphrases sequentially from a text, with specific modules that promote generation diversity. Expand
Does Order Matter? An Empirical Study on Generating Multiple Keyphrases as a Sequence
TLDR
In this paper, we propose several orderings for concatenation and inspect the important factors for training a successful keyphrase generation model, which can shed light on future research on this line of work. Expand
Automatic classification of citation function by new linguistic features
TLDR
We present some useful features by analyzing and finding unique linguistic patterns in citation context, which can be used for improving the applications of citation analysis. Expand
Knowledge-Based Content Linking for Online Textbooks
TLDR
This paper explores multiple knowledge-based content linking algorithms for connecting online educational resources and their usefulness in linking book subsections. Expand
Exploring Knowledge Learning in Collaborative Information Seeking Process
TLDR
Knowledge learning is recognized as an important component in people's search process. Expand
Towards an integrated clickstream data analysis framework for understanding web users' information behavior
TLDR
This paper provides an integrated framework for information scientists to employ in their exploitation of clickstream data, which could contribute to more comprehensive research on users’information behavior. Expand
Automatic ICD Code Assignment to Medical Text with Semantic Relational Tuples
TLDR
We propose an automatic feature extraction method by means of capturing semantic relational tuples. Expand
Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents
Faceted summarization provides briefings of a document from different perspectives. Readers can quickly comprehend the main points of a long document with the help of a structured outline. However,Expand
...
1
2
...