Tag-Weighted Topic Model For Large-scale Semi-Structured Documents

  title={Tag-Weighted Topic Model For Large-scale Semi-Structured Documents},
  author={Shuangyin Li and Jiefei Li and Guan Huang and Ruiyang Tan and Rong Pan},
To date, there have been massive Semi-Structured Documents (SSDs) during the evolution of the Internet. These SSDs contain both unstructured features (e.g., plain text) and metadata (e.g., tags). Most previous works focused on modeling the unstructured text, and recently, some other methods have been proposed to model the unstructured text with specific tags. To build a general model for SSDs remains an important problem in terms of both model fitness and efficiency. We propose a novel method… CONTINUE READING


Publications citing this paper.

A survey of tag-based information retrieval

International Journal of Multimedia Information Retrieval • 2016
View 7 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…