Nationality Classification Using Name Embeddings

@article{Ye2017NationalityCU,
  title={Nationality Classification Using Name Embeddings},
  author={Junting Ye and S. Han and Yifan Hu and B. Coskun and Meizhu Liu and H. Qin and S. Skiena},
  journal={Proceedings of the 2017 ACM on Conference on Information and Knowledge Management},
  year={2017}
}
  • Junting Ye, S. Han, +4 authors S. Skiena
  • Published 2017
  • Computer Science
  • Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
  • Nationality identification unlocks important demographic information, with many applications in biomedical and sociological research. Existing name-based nationality classifiers use name substrings as features and are trained on small, unrepresentative sets of labeled names, typically extracted from Wikipedia. As a result, these methods achieve limited performance and cannot support fine-grained classification. We exploit the phenomena of homophily in communication patterns to learn name… CONTINUE READING
    28 Citations

    Figures, Tables, and Topics from this paper.

    The Secret Lives of Names?: Name Embeddings from Social Media
    • 3
    • PDF
    Name-Nationality Classification Technology under Keras Deep Learning
    Homophily and Nationality Assortativity Among the Most Cited Researchers' Social Network
    • Michal Vaanunu, C. Avin
    • Computer Science
    • 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)
    • 2018
    • 4
    Single Training Dimension Selection for Word Embedding with PCA
    • Yu Wang
    • Computer Science
    • EMNLP/IJCNLP
    • 2019
    • 1
    • PDF
    User-Level Race and Ethnicity Predictors from Twitter Text
    • 20
    • PDF
    The preeminence of ethnic diversity in scientific collaboration
    • 49
    • PDF
    Ethnic Diversity Increases Scientific Impact
    • 7

    References

    SHOWING 1-5 OF 5 REFERENCES
    Ethnea -- an instance-based ethnicity classifier based on geo-coded author names in a large-scale bibliographic database
    • 17
    • Highly Influential
    Name-Ethnicity Classification and Ethnicity-Sensitive Name Matching
    • 27
    • Highly Influential
    • PDF
    The classification of ethnic status using name information.
    • 73
    • Highly Influential
    • PDF
    Name-ethnicity classi€cation from open sources
    • In SIGKDD
    • 2009
    Generally useful ethnic search system: GUESS
    • In Annual Meeting of the American Names Society
    • 1976