Corpus ID: 16689521

Using Wide Range of Features for Author profiling

@inproceedings{Maharjan2015UsingWR,
  title={Using Wide Range of Features for Author profiling},
  author={Suraj Maharjan and T. Solorio},
  booktitle={CLEF},
  year={2015}
}
  • Suraj Maharjan, T. Solorio
  • Published in CLEF 2015
  • Computer Science
  • Predicting an author’s age, gender and personality traits by analyzing his/her documents is important in forensics, marketing and resolving authorship disputes. Our system combines different styles, lexicons, topics, familial tokens and different categories of character n-grams as features to build a logistic regression model for four different languages: English, Spanish, Italian and Dutch. With this model, we obtained global ranking scores of 0.6623, 0.6547, 0.7411, 0.7662 for English… CONTINUE READING
    11 Citations

    Tables and Topics from this paper

    Explore Further: Topics Discussed in This Paper

    Overview of the 3rd Author Profiling Task at PAN 2015
    • 195
    • Highly Influenced
    • PDF
    Author Profiling en Social Media: Identificación de Edad, Sexo y Variedad del Lenguaje
    • F. Pardo
    • Computer Science
    • Proces. del Leng. Natural
    • 2017
    • Highly Influenced
    • PDF
    CAPS: A Cross-genre Author Profiling System
    • 11
    • PDF
    Discriminating between Similar Languages Using a Combination of Typed and Untyped Character N-grams and Words
    • 13
    • Highly Influenced
    • PDF
    Author Identification in Turkish Documents with Ridge Regression Analysis
    • 1
    A Survey On Authorship Attribution Approaches
    • PDF

    References

    SHOWING 1-9 OF 9 REFERENCES
    Not All Character N-grams Are Created Equal: A Study in Authorship Attribution
    • 124
    • PDF
    Overview of the 3rd Author Profiling Task at PAN 2015
    • 195
    • Highly Influential
    • PDF
    Profiling for English Emails
    • 35
    • PDF
    Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach
    • 1,063
    • PDF
    Software Framework for Topic Modelling with Large Corpora
    • 2,916
    • PDF
    Scikit-learn: Machine Learning in Python
    • 29,762
    • PDF
    Overview of the 3 rd author profiling task at PAN
    • 2015
    Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics
    • 105