Emotion Classification with Reduced Feature Set SGDClassifier, Random Forest and Performance Tuning

  title={Emotion Classification with Reduced Feature Set SGDClassifier, Random Forest and Performance Tuning},
  author={Kaushika Pal and Biraj V. Patel},
  journal={Computing Science, Communication and Security},
  pages={95 - 108}
  • Kaushika Pal, B. Patel
  • Published 26 March 2020
  • Computer Science
  • Computing Science, Communication and Security
Text Classification is vital and challenging due to varied kinds of data generated these days; emotions classification represented in form of text is more challenging due to diverse kind of emotional content and such content is growing on web these days. This research work is classifying emotions written in Hindi in form of poem with 4 categories namely Karuna, Shanta, Shringar and Veera. POS tagging is used on all the poem and then features are extracted by observing certain poetic features… 
1 Citations
A Survey: Accretion in Linguistic Classification of Indian Languages
  • Dip Patel
  • Linguistics
    Data Engineering for Smart Systems
  • 2021


An Efficient Method for Text Classification Task
The experimental results show that the LSTM classification model has better classification result than the traditional machine learning model, and the L STM model with the Attention mechanism has a certain degree of improvement compared with the classical text classification models.
Text Classification Based on Keywords with Different Thresholds
A text classification approach based on automatic keywords extraction with different thresholes for Vietnamese text documents, which belong to ten topics, downloaded from two electronic magazines vnexpress.net and vietnamnet.vn is proposed.
Poetic Features for Poem Recognition: A Comparative Study
Findings of this study suggest that the proposed features generate highly accurate classifiers, which can be used for poem classification in large databases such as World Wide Web.
Comparative Study between Traditional Machine Learning and Deep Learning Approaches for Text Classification
This work has created multiple classifiers for document classification and compared their accuracy on raw and processed data and is also exploring hierarchical classifier for classification of classes and subclasses.
A Semantics Aware Random Forest for Text Classification
SARF extracts the features used by trees to generate the predictions and selects a subset of the predictions for which the features are relevant to the predicted classes and evaluated its classification performance on real-world text datasets and assessed its competitiveness with state-of-the-art ensemble selection methods.
Do We Need More Training Samples For Text Classification?
In this article, feature selection as a technology to remove irrelevant features was found be able to prevent overfitting to a great extent and the kappa coefficient as a performance measure of classifiers could increase 11 points at the maximum.
Word embedding and cognitive linguistic models in text classification tasks
For model based on word embedding there are pretty vast opportunities for extension: from experimenting with different word embeddings and various distance metrics to more complicated architecture of layers and even promising state of the art artificial neural network models, activation functions and their modifications.
Text classification using Fuzzy TF-IDF and Machine Learning Models
This work compares several Machin Learning algorithms such as Naïve Bayes and its derivatives, SVM and Random forest classifiers, using the FTF-IDF representation, and calls sum attribute selection methods to improve the quality of the used classifiers.
Sentiment Analysis in a Resource Scarce Language:Hindi
A Sentiment Analysis in Hindi Language (SAHL) is proposed for reviews in movie domain and the effectiveness of the proposed approach is confirmed by extensive simulations performed on a large movie dataset.
Supervised learning Methods for Bangla Web Document Categorization
Empirical results support that all four methods for categorization of Bangla documents produce satisfactory performance with SVM attaining good result in terms of high dimensional and relatively noisy document feature vectors.