A new text representation scheme combining Bag-of-Words and Bag-of-Concepts approaches for automatic text classification

@article{Alahmadi2013ANT,
  title={A new text representation scheme combining Bag-of-Words and Bag-of-Concepts approaches for automatic text classification},
  author={Alaa Y. Alahmadi and Arash Joorabchi and Abdulhussain E. Mahdi},
  journal={2013 7th IEEE GCC Conference and Exhibition (GCC)},
  year={2013},
  pages={108-113}
}
  • Alaa Y. Alahmadi, Arash Joorabchi, Abdulhussain E. Mahdi
  • Published in
    7th IEEE GCC Conference and…
    2013
  • Computer Science
  • This paper introduces a new approach to creating text representations and apply it to a standard text classification collections. The approach is based on supplementing the well-known Bag-of-Words (BOW) representational scheme with a concept-based representation that utilises Wikipedia as a knowledge base. The proposed representations are used to generate a Vector Space Model, which in turn is fed into a Support Vector Machine classifier to categorise a collection of textual documents from two… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 12 CITATIONS

    Automatic Classifying Self-Admitted Technical Debt Using N-Gram IDF

    VIEW 7 EXCERPTS
    CITES METHODS
    HIGHLY INFLUENCED

    A Novel Ensemble Representation Learning method for Document Classification

    VIEW 1 EXCERPT
    CITES BACKGROUND

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 13 REFERENCES

    UsingWikipedia knowledge to improve text classification

    • P.Wang, J.Hu, H-J. Zeng, Z. Chen
    • Knowledge and Information Systems, vol. 19:pp
    • 2009
    VIEW 19 EXCERPTS
    HIGHLY INFLUENTIAL

    Stummee. Wordnet improves text document clustering

    • A. Hotho, S. Staab
    • SIGIR :Workshop on Semantic Web,
    • 2003
    VIEW 5 EXCERPTS
    HIGHLY INFLUENTIAL

    WordNet: a lexical database for English

    VIEW 5 EXCERPTS
    HIGHLY INFLUENTIAL

    Enhancing text clustering by leveraging Wikipedia semantics

    VIEW 15 EXCERPTS
    HIGHLY INFLUENTIAL

    An open-source toolkit for mining Wikipedia

    VIEW 2 EXCERPTS

    A Tutorial on Support Vector Machines for Pattern Recognition

    VIEW 2 EXCERPTS