• Corpus ID: 28742248

Twitter User Classification using Ambient Metadata

@article{Nagpal2014TwitterUC,
  title={Twitter User Classification using Ambient Metadata},
  author={Chirag Nagpal and Khushboo Singhal},
  journal={ArXiv},
  year={2014},
  volume={abs/1407.8499}
}
Microblogging websites, especially Twitter have become an important means of communication, in today's time. Often these services have been found to be faster than conventional news services. With millions of users, a need was felt to classify users based on ambient metadata associated with their user accounts. We particularly look at the effectiveness of the profile description field in order to carry out the task of user classification. Our results show that such metadata can be an effective… 
1 Citations

Figures and Tables from this paper

An exposition of the nature of volunteered geographical information and its suitability for integration into spatial data infrastructures
TLDR
Thissis (PhD)--University of Pretoria, 2016 is a posthumous publication based on a thesis presented at the 2016 South African Academy of Arts and Sciences (SAAS) convocation, where the author’s dissertation was presented as a stand-alone work.

References

SHOWING 1-7 OF 7 REFERENCES
A Machine Learning Approach to Twitter User Classification
TLDR
This paper automatically infer the values of user attributes such as political orientation or ethnicity by leveraging observable information such as the user behavior, network structure and the linguistic content of the user’s Twitter feed through a machine learning approach.
Characterizing Microblogs with Topic Models
TLDR
A scalable implementation of a partially supervised learning model (Labeled LDA) that maps the content of the Twitter feed into dimensions that correspond roughly to substance, style, status, and social characteristics of posts is presented.
Topical Keyphrase Extraction from Twitter
TLDR
A context-sensitive topical PageRank method for keyword ranking and a probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking are proposed.
Named Entity Recognition in Tweets: An Experimental Study
TLDR
The novel T-ner system doubles F1 score compared with the Stanford NER system, and leverages the redundancy inherent in tweets to achieve this performance, using LabeledLDA to exploit Freebase dictionaries as a source of distant supervision.
Natural Language Processing with Python
This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filtering to automatic
Scikit-learn: Machine Learning in Python
Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing
Introduction to Information Retrieval
  • R. Larson
  • Computer Science, Environmental Science
    J. Assoc. Inf. Sci. Technol.
  • 2010
TLDR
This chapter discusses Information Retrieval, the science and technology behind information retrieval and retrieval, and some of the techniques used in the retrieval of information.