Corpus ID: 235790517

Bib2Auth: Deep Learning Approach for Author Disambiguation using Bibliographic Data

  title={Bib2Auth: Deep Learning Approach for Author Disambiguation using Bibliographic Data},
  author={Zeyd Boukhers and Nagaraj Bahubali and Abinaya Thulsi Chandrasekaran and Adarsh Anand and Soniya Manchenahalli Gnanendra Prasadand and Sriram Aralappa},
Author name ambiguity remains a critical open problem in digital libraries due to synonymy and homonymy of names. In this paper, we propose a novel approach to link author names to their real-world entities by relying on their co-authorship pattern and area of research. Our supervised deep learning model identifies an author by capturing his/her relationship with his/her co-authors and area of research, which is represented by the titles and sources of the target author’s publications. These… Expand

Figures and Tables from this paper


Two supervised learning approaches for name disambiguation in author citations
Two supervised learning approaches to disambiguate authors in the citations are investigated, one uses the naive Bayes probability model, a generative model; the other uses support vector machines (SVMs) and the vector space representation of citations, a discriminative model. Expand
Evaluating Co-authorship Networks in Author Name Disambiguation for Common Names
The goal of this study is to evaluate a method which uses co-authorship networks and analyze the effect of common names on it and show that despite the good performance of implemented method for most names, it should optimize for common names. Expand
A survey of author name disambiguation techniques: 2010–2016
This survey provides a detailed review of author name disambiguation techniques available in the literature, makes a comparison of these techniques at an abstract level and discusses their limitations. Expand
Effective self-training author name disambiguation in scholarly digital libraries
A novel two-step disambiguation method, SAND (Self-training Associative Name Disambiguator), that eliminates the need of any manual labeling effort and is as effective as, and in some cases superior to, supervised ones, without manually labeling any training example. Expand
Detecting Ambiguous Author Names in Crowdsourced Scholarly Data
This paper proposes an approach to detect ambiguous names at query time, which makes it applicable in the context of a social computing application, and explores two kinds of heuristic features based on citations and crowd sourced topics. Expand
Name2Vec: Personal Names Embeddings
This paper proposes to create name-embeddings by employing a Doc2Vec methodology, where each name is viewed as a document and each letter in the name is considered a word, and shows that the new proposed method can predict with high accuracy when a pair of names matches. Expand
A brief survey of automatic methods for author name disambiguation
A taxonomy for characterizing the current author name disambiguation methods described in the literature is proposed, a brief survey of the most representative ones is presented and several open challenges are discussed. Expand
Using Deep Learning Word Embeddings for Citations Similarity in Academic Papers
A corpus-based approach using deep learning word embeddings to compute more effective citation similarity is proposed and the final results demonstrate that deep learning based approach can enhance the effectiveness of citation similarity. Expand
Author name disambiguation in MEDLINE
This work test the hypothesis that the Author-ity model will suffice to disambiguate author names for the vast majority of articles in MEDLINE, a database that has each name on each article assigned to one of 6.7 million inferred author-individual clusters. Expand
A Web Service for Author Name Disambiguation in Scholarly Databases
A novel, web-based, RESTful API for searching disambiguated authors, using the PubMed database as a sample application and develops a novel algorithm that has a fast record-to-cluster match for record-based queries. Expand