Nationality Classification Using Name Embeddings

@article{Ye2017NationalityCU,
  title={Nationality Classification Using Name Embeddings},
  author={Junting Ye and Shuchu Han and Yifan Hu and Baris Coskun and Meizhu Liu and Hong Qin and Steven Skiena},
  journal={ArXiv},
  year={2017},
  volume={abs/1708.07903}
}
Nationality identification unlocks important demographic information, with many applications in biomedical and sociological research. Existing name-based nationality classifiers use name substrings as features and are trained on small, unrepresentative sets of labeled names, typically extracted from Wikipedia. As a result, these methods achieve limited performance and cannot support fine-grained classification. We exploit the phenomena of homophily in communication patterns to learn name… CONTINUE READING
34
Twitter Mentions

Citations

Publications citing this paper.
SHOWING 1-10 OF 13 CITATIONS

The Secret Lives of Names?: Name Embeddings from Social Media

VIEW 9 EXCERPTS
CITES BACKGROUND & METHODS

Single Training Dimension Selection for Word Embedding with PCA

  • Yu Wang
  • Computer Science
  • EMNLP/IJCNLP
  • 2019
VIEW 2 EXCERPTS
CITES BACKGROUND

Ethnic Diversity Increases Scientific Impact

VIEW 1 EXCERPT
CITES BACKGROUND

Homophily and Nationality Assortativity Among the Most Cited Researchers' Social Network

  • Michal Vaanunu, Chen Avin
  • Computer Science
  • 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)
  • 2018
VIEW 2 EXCERPTS
CITES METHODS

References

Publications referenced by this paper.
SHOWING 1-4 OF 4 REFERENCES

The classification of ethnic status using name information.

VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

Generally useful ethnic search system: GUESS

  • Robert W Buechley
  • In Annual Meeting of the American Names Society
  • 1976
VIEW 9 EXCERPTS
HIGHLY INFLUENTIAL