• Publications
  • Influence
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
MOTIVATION Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows. With the progress in natural language processing, extracting valuableExpand
  • 453
  • 117
  • Open Access
Self-Attention Graph Pooling
Advanced methods of applying deep learning to structured data such as graphs have been proposed in recent years. In particular, studies have focused on generalizing convolutional neural networks toExpand
  • 100
  • 29
  • Open Access
Evaluating window joins over unbounded streams
We investigate algorithms for evaluating sliding window joins over pairs of unbounded streams. We introduce a unit-time-basis cost model to analyze the expected performance of these algorithms. UsingExpand
  • 404
  • 28
  • Open Access
On schema matching with opaque column names and data values
Most previous solutions to the schema matching problem rely in some fashion upon identifying "similar" column names in the schemas to be matched, or by recognizing common domains in the data storedExpand
  • 254
  • 25
  • Open Access
Catching the boat with Strudel: experiences with a Web-site management system
The Strudel system applies concepts from database management systems to the process of building Web sites. Strudel's key idea is separating the management of the site's data, the creation andExpand
  • 365
  • 21
  • Open Access
Comparative study of name disambiguation problem using a scalable blocking-based framework
In this paper, we consider the problem of ambiguous author names in bibliographic citations, and comparatively study alternative approaches to identify and correct such name variants (e.g., "VannevarExpand
  • 117
  • 11
  • Open Access
The Niagara Internet Query System
Many projections envision a future in which the Internet is populated with a vast number of Web-accessible XML files—a “World-Wide Database”. Recently, there has been a great deal of research intoExpand
  • 222
  • 8
  • Open Access
STRUDEL: a Web site management system
The growth of the World-Wide Web has created a new kind of data management problem: building and maintaining Web sites. Building a Web site involves several tasks, such as choosing what informationExpand
  • 162
  • 8
  • Open Access
Effective and scalable solutions for mixed and split citation problems in digital libraries
In this paper, we consider two important problems that commonly occur in bibliographic digital libraries, which seriously degrade their data qualities: Mixed Citation (MC) problem (i.e., citations ofExpand
  • 99
  • 8
  • Open Access
Content-Aware Hierarchical Point-of-Interest Embedding Model for Successive POI Recommendation
Recommending a point-of-interest (POI) a user will visit next based on temporal and spatial context information is an important task in mobile-based applications. Recently, several POI recommendationExpand
  • 42
  • 8
  • Open Access