• Publications
  • Influence
Topic modeling with network regularization
TLDR
We formally define the problem of topic modeling with network structure (TMN). Expand
  • 382
  • 31
  • PDF
Mining advisor-advisee relationships from research publication networks
TLDR
Information network contains abundant knowledge about relationships among people or entities. Expand
  • 178
  • 9
  • PDF
A Combination Approach to Web User Profiling
TLDR
In this article, we study the problem of Web user profiling, which is aimed at finding, extracting, and fusing the “semantic”-based user profile from the Web. Expand
  • 132
  • 9
  • PDF
A general optimization framework for smoothing language models on graph structures
TLDR
We propose a general and unified optimization framework for smoothing language models on graph structures. Expand
  • 73
  • 8
  • PDF
Cross-Lingual Latent Topic Extraction
TLDR
In this paper, we propose a new topic model called Probabilistic Cross-Lingual Latent Semantic Analysis (PCLSA) which can be used to mine shared latent topics from unaligned text data in different languages. Expand
  • 76
  • 6
  • PDF
Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases
TLDR
In this paper, we propose a new data model called topic cube to combine OLAP with probabilistic topic modeling and enable OLAP on the dimension of text data in a multidimensional text database. Expand
  • 98
  • 5
  • PDF
ArnetMiner: An Expertise Oriented Search System for Web Community
TLDR
We present a novel expertise oriented search system for web community, which is available at http://www.arnetminer.org. Expand
  • 42
  • 4
  • PDF
Social Network Extraction of Academic Researchers
TLDR
We propose a unified approach to researcher social network extraction and a constraint-based probabilistic model to perform name disambiguation in integration. Expand
  • 131
  • 3
Structural Topic Model for Latent Topical Structure Analysis
TLDR
We propose a new topic model, Structural Topic Model, which simultaneously discovers topics and reveals the latent topical structures in text through explicitly modeling topical transitions with a latent first-order Markov chain. Expand
  • 26
  • 3
  • PDF
An epigenetic biomarker combination of PCDH17 and POU4F2 detects bladder cancer accurately by methylation analyses of urine sediment DNA in Han Chinese
To develop a routine and effectual procedure of detecting bladder cancer (BlCa), an optimized combination of epigenetic biomarkers that work synergistically with high sensitivity and specificity isExpand
  • 36
  • 2
  • PDF