Tweets Classification using Corpus Dependent Tags, Character and POS N-grams

@inproceedings{GonzlezGallardo2015TweetsCU,
  title={Tweets Classification using Corpus Dependent Tags, Character and POS N-grams},
  author={Carlos E. Gonz{\'a}lez-Gallardo and Azucena Montes Rend{\'o}n and Gerardo Sierra and J. Antonio Nu{\~n}ez-Ju{\'a}rez and Adolfo Jonathan Salinas-L{\'o}pez and Juan Ek},
  booktitle={CLEF},
  year={2015}
}
This paper is part of the Author Profiling task at PAN 2015 contest; in witch participants had to predict the gender, age and personality traits of Twitter users in four different languages (Spanish, English, Italian and Dutch). Our approach takes into account stylistic features represented by character Ngrams and POS N-grams to classify tweets. The main idea of using character Ngrams is to extract as much information as possible that is encoded inside the tweet (emoticons, character flooding… CONTINUE READING
Highly Cited
This paper has 18 citations. REVIEW CITATIONS