• Corpus ID: 249017947

Overview of STEM Science as Process, Method, Material, and Data Named Entities

@article{DSouza2022OverviewOS,
  title={Overview of STEM Science as Process, Method, Material, and Data Named Entities},
  author={Jennifer D’Souza},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.11863}
}
We are faced with an unprecedented production in scholarly publications worldwide. Stakeholders in the digital libraries posit that the document-based publishing paradigm has reached the limits of adequacy. In-stead, structured, machine-interpretable, fine-grained scholarly knowledge publishing as Knowledge Graphs (KG) is strongly advo-cated. In this work, we develop and analyze a large-scale structured dataset of STEM articles across 10 different disciplines, viz. Agriculture , Astronomy… 

References

SHOWING 1-10 OF 80 REFERENCES

The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources

TLDR
The findings cumulatively indicate that human annotation and automatic learning of multidisciplinary scientific concepts as well as their semantic disambiguation in a wide-ranging setting as STEM is reasonable.

Improving Access to Scientific Literature with Knowledge Graphs

TLDR
A scholarly knowledge graph can be used to give a condensed overview on the state-of-the-art addressing a particular research quest, for example as a tabular comparison of contributions according to various characteristics of the approaches.

SemEval-2021 Task 11: NLPContributionGraph - Structuring Scholarly NLP Contributions for a Research Knowledge Graph

TLDR
The SemEval-2021 Shared Task NLPContributionGraph tasks participants to develop automated systems that structure contributions from NLP scholarly articles in the English language at three levels of information granularity, i.e. at sentence-level, phrase- level, and phrases organized as triples toward Knowledge Graph (KG) building.

Domain-Independent Extraction of Scientific Concepts from Research Articles

TLDR
A set of generic scientific concepts that have been identified in a systematic annotation process are suggested and used to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the phrasal level in a joint effort with domain experts.

Microsoft Academic Graph: When experts are not enough

TLDR
The design, schema, and technical and business motivations behind MAG are described and how MAG can be used in analytics, search, and recommendation scenarios are elaborated.

Sentence, Phrase, and Triple Annotations to Build a Knowledge Graph of Natural Language Processing Contributions—A Trial Dataset

TLDR
NlpContributionGraph data is integrated into the Open Research Knowledge Graph (ORKG), a next-generation KG-based digital library with intelligent computations enabled over structured scholarly knowledge, as a viable aid to assist researchers in their day-to-day tasks.

Pattern-based Acquisition of Scientific Entities from Scholarly Article Titles

TLDR
A rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles by selecting lexico-syntactic patterns that were easily recognizable, occurred frequently, and positionally indicated a scientific entity type.

EXTRACTING BIO‐MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK

TLDR
The BioNLP’09 shared task on event extraction received community‐wide participation, establishing the state‐of‐the‐art performance at fine‐grained event extraction as well as allowing the identification of remaining challenges and suggesting directions for future improvements.

Construction of the Literature Graph in Semantic Scholar

TLDR
This paper reduces literature graph construction into familiar NLP tasks, point out research challenges due to differences from standard formulations of these tasks, and report empirical results for each task.

SciREX: A Challenge Dataset for Document-Level Information Extraction

TLDR
SciREX is introduced, a document level IE dataset that encompasses multiple IE tasks, including salient entity identification and document level N-ary relation identification from scientific articles, and a neural model is developed as a strong baseline that extends previous state-of-the-art IE models to document-level IE.
...