Overview of STEM Science as Process, Method, Material, and Data Named Entities

@article{DSouza2022OverviewOS,
  title={Overview of STEM Science as Process, Method, Material, and Data Named Entities},
  author={Jennifer D’Souza},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.11863}
}
We are faced with an unprecedented production in scholarly publications worldwide. Stakeholders in the digital libraries posit that the document-based publishing paradigm has reached the limits of adequacy. In-stead, structured, machine-interpretable, fine-grained scholarly knowledge publishing as Knowledge Graphs (KG) is strongly advo-cated. In this work, we develop and analyze a large-scale structured dataset of STEM articles across 10 different disciplines, viz. Agriculture , Astronomy… 

References

SHOWING 1-10 OF 80 REFERENCES
The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources
TLDR
The findings cumulatively indicate that human annotation and automatic learning of multidisciplinary scientific concepts as well as their semantic disambiguation in a wide-ranging setting as STEM is reasonable.
Improving Access to Scientific Literature with Knowledge Graphs
TLDR
A scholarly knowledge graph can be used to give a condensed overview on the state-of-the-art addressing a particular research quest, for example as a tabular comparison of contributions according to various characteristics of the approaches.
SemEval-2021 Task 11: NLPContributionGraph - Structuring Scholarly NLP Contributions for a Research Knowledge Graph
TLDR
The SemEval-2021 Shared Task NLPContributionGraph tasks participants to develop automated systems that structure contributions from NLP scholarly articles in the English language at three levels of information granularity, i.e. at sentence-level, phrase- level, and phrases organized as triples toward Knowledge Graph (KG) building.
Identifying Used Methods and Datasets in Scientific Publications
TLDR
This paper proposes an approach to identifying methods and datasets in texts that have actually been used by the authors and can be extended in such a way that it can identify mentions of any entity type (e.g., task).
Domain-Independent Extraction of Scientific Concepts from Research Articles
TLDR
A set of generic scientific concepts that have been identified in a systematic annotation process are suggested and used to annotate a corpus of scientific abstracts from 10 domains of Science, Technology and Medicine at the phrasal level in a joint effort with domain experts.
Microsoft Academic Graph: When experts are not enough
TLDR
The design, schema, and technical and business motivations behind MAG are described and how MAG can be used in analytics, search, and recommendation scenarios are elaborated.
Sentence, Phrase, and Triple Annotations to Build a Knowledge Graph of Natural Language Processing Contributions—A Trial Dataset
TLDR
NlpContributionGraph data is integrated into the Open Research Knowledge Graph (ORKG), a next-generation KG-based digital library with intelligent computations enabled over structured scholarly knowledge, as a viable aid to assist researchers in their day-to-day tasks.
Pattern-based Acquisition of Scientific Entities from Scholarly Article Titles
TLDR
A rule-based approach for the automatic acquisition of salient scientific entities from Computational Linguistics (CL) scholarly article titles by selecting lexico-syntactic patterns that were easily recognizable, occurred frequently, and positionally indicated a scientific entity type.
EXTRACTING BIO‐MOLECULAR EVENTS FROM LITERATURE—THE BIONLP’09 SHARED TASK
TLDR
The BioNLP’09 shared task on event extraction received community‐wide participation, establishing the state‐of‐the‐art performance at fine‐grained event extraction as well as allowing the identification of remaining challenges and suggesting directions for future improvements.
Construction of the Literature Graph in Semantic Scholar
TLDR
This paper reduces literature graph construction into familiar NLP tasks, point out research challenges due to differences from standard formulations of these tasks, and report empirical results for each task.
...
...