Hierarchical text categorization and its application to bioinformatics

@inproceedings{Kiritchenko2006HierarchicalTC,
  title={Hierarchical text categorization and its application to bioinformatics},
  author={Svetlana Kiritchenko},
  year={2006}
}
In a hierarchical categorization problem, categories are partially ordered to form a hierarchy. In this dissertation, we explore two main aspects of hierarchical categorization: learning algorithms and performance evaluation. We introduce the notion of consistent hierarchical classification that makes classification results more comprehensible and easily interpretable for end-users. Among the previously introduced hierarchical learning algorithms, only a local top-down approach produces… Expand
Web page classification and hierarchy adaptation
TLDR
This dissertation investigates methods to improve classification from two aspects: feature enhancement and hierarchy adaptation, and investigates problems that impede user navigation in hierarchies, and proposes novel methods to facilitate efficient navigation. Expand
An Empirical Comparison of Flat and Hierarchical Performance Measures for Multi-Label Classification with Hierarchy Extraction
TLDR
Whether HMC measures do significantly improve quality assessment in this setting is verified and a proper measure that reflects the potential quality of extracted hierarchies in the best possible way is sought. Expand
A Study of Hierarchical and Flat Classification of Proteins
TLDR
The results show that exploiting the class hierarchy improves performance on the synthetic data but not in the case of the protein classification problems, and it is recommended that strong flat multiclass methods be used as a baseline to establish the benefit of exploiting class hierarchies in this area. Expand
Hierarchical Partitioning of the Output Space in Multi-label Data
TLDR
This work presents the general setup for a HOMER model and a simple extension of the algorithm that is suited for MLCs that output rankings, and provides a detailed analysis of the properties of the algorithms, both from an aspect of effectiveness and computational complexity. Expand
Multi-label Hierarchical Text Classification using the ACM Taxonomy
Many of the works of text classification involve t he attribution of each text a single class label from a predefined se t of classes, usually small and flat organized (flat classification). However,Expand
Grouping by association: using associative networks for document categorization
TLDR
This thesis describes a method of using associative networks for automatic document grouping, and extends the method to enable application to multilingual document libraries and compares it to other state-of- the-art methods for document grouping. Expand
Hybrid Intelligent Technique for Text Categorization
  • A. T. Sadiq, S. Abdullah
  • Computer Science
  • 2012 International Conference on Advanced Computer Science Applications and Technologies (ACSAT)
  • 2012
TLDR
The proposed system consists of three main steps: document representation, classifier construction and performance evaluation, which has achieved good results up to 96%, when applied to a number of test documents for each sub-category of main categories. Expand
Hierarchical Text Classification for News Articles Based-on Named Entities
TLDR
The experimental results show that the use of named entities improves the performance of hierarchical text classification for news articles. Expand
Global Model for Hierarchical Multi-Label Text Classification
TLDR
This paper formalizes this task as a structured prediction problem and proposes a global model that jointly outputs multiple labels and a decoding algorithm for it that finds an exact solution with dynamic programming. Expand
Hybrid Intelligent Techniques for Text Categorization
TLDR
This paper shows that the proposed system has achieved good results up to 96%, when applied to a number of test text documents for each sub-category of main categories. Expand
...
1
2
3
4
...

References

SHOWING 1-10 OF 187 REFERENCES
Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization
TLDR
The notion of consistent classification which takes into account the semantics of a class hierarchy is introduced and a novel global hierarchical approach that produces consistent classification is presented that significantly outperforms the corresponding "flat" approach. Expand
On learning hierarchical classifications
TLDR
The potential benefits of using a given hierarchy over base classes to learn accurate multi-category classifiers for these domains is investigated and the benefits of learning categorydiscriminants in a “hard” top-down fashion is explored and compared to a "soft” approach which shares training data among sibling categories. Expand
Hierarchical classification of Web content
  • S. Dumais, Hao Chen
  • Computer Science
  • SIGIR '00
  • 2000
TLDR
This paper explores the use of hierarchical structure for classifying a large, heterogeneous collection of web content using support vector machine (SVM) classifiers, which have been shown to be efficient and effective for classification, but not previously explored in the context of hierarchical classification. Expand
Hierarchical text classification and evaluation
  • Aixin Sun, Ee-Peng Lim
  • Mathematics, Computer Science
  • Proceedings 2001 IEEE International Conference on Data Mining
  • 2001
Hierarchical classification refers to the assignment of one or more suitable categories from a hierarchical category space to a document. While previous work in hierarchical classification focused onExpand
Exploiting Hierarchy in Text Categorization
TLDR
The structure that is present in the semantic space of topics is used in order to improve performance in text categorization: according to their meaning, topics can be grouped together into “meta-topics”, e.g., gold, silver, and copper are all metals. Expand
Text classification in a hierarchical mixture model for small training sets
TLDR
A hierarchical mixture model which extends the standard naive Bayes classifier and previous hierarchical approaches is presented and improved estimates of the term distributions are made by differentiation of words in the hierarchy according to their level of generality/specificity. Expand
Hierarchical Text Categorization Using Neural Networks
TLDR
The results show that the use of the hierarchical structure improves text categorization performance with respect to an equivalent flat model, and the optimized Rocchio algorithm achieves a performance comparable with that of the Hierarchical Mixture of Experts model. Expand
Functional Annotation of Genes Using Hierarchical Text Categorization
TLDR
A novel global hierarchical learning approach that takes into account the semantics of a class hierarchy is presented that significantly outperforms the corresponding “flat” approach, i.e. the approach that does not consider any hierarchical information. Expand
The Effect of Using Hierarchical Classifiers in Text Categorization
TLDR
It is shown that judicious use of a hierarchy can significantly improve both the speed and effectiveness of the categorization process and described a procedure for generating a hierarchy of classifiers that models the hierarchy structure. Expand
Hierarchically Classifying Documents Using Very Few Words
TLDR
This work proposes an approach that utilizes the hierarchical topic structure to decompose the classification task into a set of simpler problems, one at each node in the classification tree, which can be solved accurately by focusing only on a very small set of features, those relevant to the task at hand. Expand
...
1
2
3
4
5
...