Microsoft Academic Graph: When experts are not enough

@article{Wang2020MicrosoftAG,
  title={Microsoft Academic Graph: When experts are not enough},
  author={Kuansan Wang and Zhihong Shen and Chiyuan Huang and Chieh-Han Wu and Yuxiao Dong and Anshul Kanakia},
  journal={Quantitative Science Studies},
  year={2020},
  volume={1},
  pages={396-413}
}
An ongoing project explores the extent to which artificial intelligence (AI), specifically in the areas of natural language processing and semantic reasoning, can be exploited to facilitate the studies of science by deploying software agents equipped with natural language understanding capabilities to read scholarly publications on the web. The knowledge extracted by these AI agents is organized into a heterogeneous graph, called Microsoft Academic Graph (MAG), where the nodes and the edges… 
AIDA: a Knowledge Graph about Research Dynamics in Academia and Industry
Academia and industry share a complex, multifaceted, and symbiotic relationship. Analysing the knowledge flow between them, understanding which directions have the biggest potential, and
Link Prediction using Numerical Weights for Knowledge Graph Completion within the Scholarly Domain
Knowledge graphs (KGs) are widely used for modeling scholarly communication, informing scientometric analysis, and supporting a variety of intelligent services to explore the literature and predict
Link Prediction of Weighted Triples for Knowledge Graph Completion Within the Scholarly Domain
TLDR
The Weighted Triple Loss is proposed, a new loss function for KGE models that takes full advantage of the additional numerical weights on facts and it is even tolerant to incorrect weights, which enables to complete information about affiliation types, countries, and research topics.
Open Science Graphs Must Interoperate!
TLDR
This work describes the key motivations for i) the definition of a classification for OSGs to compare their features, identify commonalities and differences, and added value and for ii) thedefinition of an Interoperability Framework, specifically an information model and APIs that enable a seamless exchange of information across graphs.
Effective distributed representations for academic expert search
TLDR
It is observed that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the most effective paper representations for document-centric expert retrieval, however, retrofitting the paper embedDings and using elaborate author contribution weighting strategies did not improve retrieval performance.
ResearchFlow: Understanding the Knowledge Flow Between Academia and Industry
TLDR
ResearchFlow, an approach that integrates semantic technologies and machine learning to quantifying the diachronic behaviour of research topics across academia and industry, is evaluated on the task of forecasting the impact of research Topics on the industrial sector and found that its granular characterization of topics improves significantly the performance with respect to alternative solutions.
The unknown knowns: a graph-based approach for temporal COVID-19 literature mining
TLDR
This is the first study to propose a holistic framework that includes encoding the scientific knowledge in a semantic graph, demonstrates an evolutionary examination of past and ongoing research and offers scientists with tools to generate new hypotheses and research directions through predictive modeling and deep machine learning techniques.
Application of graph theory in the library domain—Building a faceted framework based on a literature review
  • A. Lüschow
  • Computer Science
    Journal of Librarianship and Information Science
  • 2021
TLDR
The goal is to provide both researchers and libraries with a standard tool to classify scientific work, at the same time allowing for the identification of previously underrepresented areas where future research might be productive.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 70 REFERENCES
An Overview of Microsoft Academic Service (MAS) and Applications
TLDR
A knowledge driven, highly interactive dialog that seamlessly combines reactive search and proactive suggestion experience, and a proactive heterogeneous entity recommendation are demonstrated.
AMiner: Mining Deep Knowledge from Big Scholar Data
AMiner is the second generation of the ArnetMiner system. We focus on developing author-centric analytic and mining tools for gaining a deep understanding of the large and heterogeneous networks
An Analysis of the Microsoft Academic Graph
TLDR
The results show that the citation data and publication metadata correlate well with external datasets, and the MAG has very good coverage across different domains with a slight bias towards technical disciplines.
A Review of Microsoft Academic Services for Science of Science Studies
TLDR
The use of three key AI technologies that underlies its prowess in capturing scholarly communications with adequate quality and broad coverage are focused on, including a reinforcement learning approach to assessing scholarly importance for entities participating in scholarly communications, called the saliency, that serves both as an analytic and a predictive metric in MAS.
Dimensions: Building Context for Search and Evaluation
TLDR
The techniques used to create the Dimensions dataset, a new database that focussed on a different set of problems to other scholarly search systems, holds a representation of an n-partite graph associated with the heterogeneous research objects.
A Scalable Hybrid Research Paper Recommender System for Microsoft Academic
TLDR
There is a strong correlation between participant scores and the similarity rankings produced by the large scale hybrid paper recommender system but that additional focus needs to be put towards improving recommender precision, particularly for content based recommendations.
Sustainable computational science: the ReScience initiative
TLDR
ReScience is a peer-reviewed journal that targets computational research and encourages the explicit replication of already published research, promoting new and open-source implementations in order to ensure that the original research can be replicated from its description.
Citation analysis with microsoft academic
TLDR
It is postulated that Microsoft Academic has the potential to be used for full-fledged bibliometric analyses, and structured and rich metadata is found, which facilitates data retrieval, handling and processing.
The Google scholar experiment: How to index false papers and manipulate bibliometric indicators
TLDR
The results of an experiment undertaken to analyze Google Scholar's capacity to detect citation‐counting manipulation are shown and it is discussed how the effects of such an outburst could affect the future development of such products, at both the individual level and the journal level.
The Microsoft Academic Search challenges at KDD Cup 2013
TLDR
KDD Cup 2013 invited participants to tackle author name ambiguity by automatically determining which papers in an author profile are truly written by a given author, and by identifying which author profiles need to be merged because they belong to the same author.
...
1
2
3
4
5
...