Corpus ID: 18779357

Tag-Weighted Topic Model For Large-scale Semi-Structured Documents

@article{Li2015TagWeightedTM,
  title={Tag-Weighted Topic Model For Large-scale Semi-Structured Documents},
  author={Shuangyin Li and Jiefei Li and Guan Huang and Ruiyang Tan and Rong Pan},
  journal={ArXiv},
  year={2015},
  volume={abs/1507.08396}
}
  • Shuangyin Li, Jiefei Li, +2 authors Rong Pan
  • Published in ArXiv 2015
  • Computer Science, Mathematics
  • To date, there have been massive Semi-Structured Documents (SSDs) during the evolution of the Internet. These SSDs contain both unstructured features (e.g., plain text) and metadata (e.g., tags). Most previous works focused on modeling the unstructured text, and recently, some other methods have been proposed to model the unstructured text with specific tags. To build a general model for SSDs remains an important problem in terms of both model fitness and efficiency. We propose a novel method… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    2
    Twitter Mentions

    Citations

    Publications citing this paper.
    SHOWING 1-2 OF 2 CITATIONS

    A survey of tag-based information retrieval

    VIEW 7 EXCERPTS
    CITES BACKGROUND
    HIGHLY INFLUENCED

    Recurrent Attentional Topic Model

    VIEW 1 EXCERPT
    CITES METHODS

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 34 REFERENCES

    Correlated Topic Models

    VIEW 10 EXCERPTS
    HIGHLY INFLUENTIAL

    Latent Dirichlet Allocation

    VIEW 10 EXCERPTS
    HIGHLY INFLUENTIAL

    The Author-Topic Model for Authors and Documents

    VIEW 12 EXCERPTS
    HIGHLY INFLUENTIAL

    Modeling annotated data

    VIEW 7 EXCERPTS
    HIGHLY INFLUENTIAL