Corpus ID: 8750401

Unsupervised word embedding based polarity detection for tamil tweets

@article{Nivedhitha2016UnsupervisedWE,
  title={Unsupervised word embedding based polarity detection for tamil tweets},
  author={E. Nivedhitha and S. Sanjay and M. A. Kumar and K. Soman},
  journal={Control theory \& applications},
  year={2016},
  volume={9}
}
With the advent of technological advancements in the recent times, people, and Internet have become inseparable. People belonging to all categories access the internet. Micro blogs from twitter has real time information as on how the general audience feel. Extracting this information and finding out the intent of the general audience can be of great help for business and political organizations. Sentimental analysis is a sub-branch of Natural Language Processing which intends in finding out the… Expand

Figures, Tables, and Topics from this paper

ON SENTIMENT ANALYSIS IN TAMIL TEXTS
Sentiment Analysis (SA) is an application of Natural Language Processing (NLP) to analyse the sentiments expressed in the text. It classifies into categories of qualities and opinions such as good,Expand
Sentiment Analysis of Twitter Data on Demonetization Using Machine Learning Techniques
TLDR
An attempt is made to understand the public opinion on the recently implemented demonetization policy in India through sentiment analysis on twitter data set using machine learning approaches. Expand
Sentiment Analysis in Tamil Texts: A Study on Machine Learning Techniques and Feature Representation
TLDR
Basic features such as word count and punctuation count are used in addition to traditional features including Bag of Words and Term Frequency-Inverse Document Frequency included to check their influence in the prediction. Expand
Topic categorization of Tamil News Articles using PreTrained Word2Vec Embeddings with Convolutional Neural Network
  • R. S, A. R., S. Murugan, M.S. Julie
  • Computer Science
  • 2020 International Conference on Computational Intelligence for Smart Power System and Sustainable Energy (CISPSSE)
  • 2020
TLDR
It is found that CNN with pretrained embeddings gave better F1 score compare to TFIDF feature learned with Support Vector Machine (SVM), Naive Bayes (NB) algorithm. Expand
Sentiment Lexicon Expansion using Word2vec and fastText for Sentiment Prediction in Tamil texts
TLDR
A sentiment lexicon expansion method using Word2vec and fastText word embeddings along with rule-based Sentiment Analysis method, which uses expanded lexicons, lists of conjunctions and negational words to predict the sentiments expressed in Tamil texts is proposed. Expand
A journey of Indian languages over sentiment analysis: a systematic review
TLDR
This research work will assist researchers in finding the available resources such as annotated datasets, pre-processing linguistic and lexical resources in Indian languages for sentiment analysis and will also support in selecting the most suitable sentiment analysis technique in a specific domain along with relevant future research directions. Expand
A Novel Sentiment Classification of Product Reviews using Levenshtein Distance
TLDR
A Levenshtein distance based sentiment classification engine that reads the reviews across different websites for a given product, eventually providing a metric that would help the user in making an informed choice of product is proposed. Expand
Prosody Detection from Text Using Aggregative Linguistic Features
TLDR
The proposed prosody prediction model for multifarious Tamil text is developed to classify the text into three classes of high, mid and low using the generated aggregative linguistic feature score. Expand
Amrita-CEN@MSIR-FIRE2016: Code-Mixed Question Classification using BoWs and RNN Embeddings
TLDR
An embedding based Bag-ofWords method and Recurrent Neural Network to achieve an automatic question classification in the code-mixed BengaliEnglish text is presented and two systems that classify questions mostly at the sentence level are built. Expand

References

SHOWING 1-10 OF 23 REFERENCES
AMRITA-CEN@SAIL2015: Sentiment Analysis in Indian Languages
TLDR
A supervised algorithm is used for classifying the tweets into positive, negative and neutral labels using Naive Bayes classifier. Expand
Twitter, MySpace, Digg: Unsupervised Sentiment Analysis in Social Media
TLDR
The results demonstrate that the proposed algorithm, even though unsupervised, outperforms machine learning solutions in the majority of cases, overall presenting a very robust and reliable solution for sentiment analysis of informal communication on the Web. Expand
Simulated and Self-Sustained Classification of Twitter Data based on its Sentiment
TLDR
A methodology for naturally grouping the estimation of Twitter messages is presented, in blend with a Maximum Entropy, Naive Bayes and Decision Tree classifiers, and statistical representation of the classes has been depicted through KH Coder. Expand
SentiWordNet for Indian Languages
TLDR
Multiple computational techniques like, WordNet based, dictionary based, Dictionary based, corpus based or generative approaches for generating SentiWordNet(s) for Indian languages are proposed. Expand
AMRITA_CEN-NLP@SAIL2015: Sentiment Analysis in Indian Language Using Regularized Least Square Approach with Randomized Feature Learning
The present work is done as part of shared task in Sentiment Analysis in Indian Languages SAIL 2015, under constrained category. The task is to classify the twitter data into three polarityExpand
Labeling Emotion in Bengali Blog Corpus – A Fine Grained Tagging at Sentence Level
Emotion, the private state of a human entity, is becoming an important topic in Natural Language Processing (NLP) with increasing use of search engines. The present task aims to manually annotate theExpand
Supervised methods for domain classification of tamil documents
TLDR
Dinakarannewspaper dataset from EMILLE/CIIL Corpus has been utilized to experiment the ability of Machine Learning algorithms in Tamil domain classification. Expand
SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
TLDR
SENTIWORDNET is a lexical resource in which each WORDNET synset is associated to three numerical scores Obj, Pos and Neg, describing how objective, positive, and negative the terms contained in the synset are. Expand
Sentimantics: Conceptual Spaces for Lexical Sentiment Polarity Representation with Contextuality
TLDR
A new way to represent sentiment knowledge in a Vector Space Model that can store dynamic prior polarity with varying contextual information is proposed. Expand
Sentiment Analysis using Support Vector Machines with Diverse Information Sources
This paper introduces an approach to sentiment analysis which uses support vector machines (SVMs) to bring together diverse sources of potentially pertinent information, including severalExpand
...
1
2
3
...