Ontology Extraction and Usage in the Scholarly Knowledge Domain

  title={Ontology Extraction and Usage in the Scholarly Knowledge Domain},
  author={Angelo Salatino and Francesco Osborne and Enrico Motta},
  booktitle={Applications and Practices in Ontology Design, Extraction, and Reasoning},
Ontologies of research areas have been proven to be useful in many application for analysing and making sense of scholarly data. In this chapter, we present the Computer Science Ontology (CSO), which is the largest ontology of research areas in the field of Computer Science, and discuss a number of applications that build on CSO, to support high-level tasks, such as topic classification, metadata extraction, and recommendation of books. 

Figures from this paper

CSO Classifier 3.0: a scalable unsupervised method for classifying documents in terms of research topics
This new version of the CSO Classifier includes a new component for discarding outlier topics and offers improved scalability, and is evaluated on a gold standard of manually annotated articles, demonstrating a significant improvement over alternative methods.
Dark Knowledge: Knowledge Graphs of What Cannot Happen or Should not Be Done
It is argued that in human thinking the main usage of logical reasoning is not making far-fetched strict logical conclusions but weeding out obviously wrong ideas and false information while generating new ideas is mostly intuitive, which can be used in constructing hybrid reasoning systems where neural networks play the role of intuition (generating possible solutions).


Deriving concept hierarchies from text
This paper presents a means of automatically deriving a hierarchical organization of concepts from a set of documents without use of training data or standard clustering techniques, using a type of co-occurrence known as subsumption.
Reducing the Effort for Systematic Reviews in Software Engineering
A novel Expert-Driven Automatic Methodology, EDAM, is introduced that allows researchers to skip the tedious tasks of keywording and manually classifying primary studies, thus freeing effort for the analysis and the discussion in SRs.
Improving Editorial Workflow and Metadata Quality at Springer Nature
The most recent version of Smart Topic Miner (STM), an ontology-driven application that assists the Springer Nature editorial team in annotating the volumes of all books covering conference proceedings in Computer Science, is presented.
Ontology-Based Recommendation of Editorial Products
Smart Book Recommender (SBR), an ontology-based recommender system developed by The Open University in collaboration with Springer Nature, which supports their Computer Science editorial team in selecting the products to market at specific venues, is created.
The Computer Science Ontology: A Large-Scale Taxonomy of Research Areas
The Computer Science Ontology (CSO), a large-scale, automatically generated ontology of research areas, which includes about 26K topics and 226K semantic relationships, is introduced by applying the Klink-2 algorithm on a very large dataset of 16M scientific articles.
Klink-2: Integrating Multiple Web Sources to Generate Semantic Topic Networks
Klink-2 is presented, a novel approach which improves on earlier work on automatic generation of semantic topic networks and addresses the aforementioned limitations by taking advantage of a variety of knowledge sources available on the web.
Exploring Scholarly Data with Rexplore
A novel tool, Rexplore, which integrates statistical analysis, semantic technologies, and visual analytics to provide effective support for exploring and making sense of scholarly data is developed and the results from a task-centric empirical evaluation are presented.
Forecasting the Spreading of Technologies in Research Communities
It is hypothesised that it is possible to learn typical technology propagation patterns from historical data and to exploit this knowledge to anticipate where a technology may be adopted next and to alert relevant stakeholders about emerging and relevant technologies in other fields.
A decade of Semantic Web research through the lenses of a mixed methods approach
This paper builds on a qualitative analysis of the main seminal papers, which adopt a top-down approach, and on quantitative results derived with three bottom-up data-driven approaches on a corpus of Semantic Web papers published between 2006 and 2015.
AI-KG: An Automatically Generated Knowledge Graph of Artificial Intelligence
The Artificial Intelligence Knowledge Graph (AI-KG), a large-scale automatically generated knowledge graph that describes 820K research entities, has been generated by applying an automatic pipeline that extracts entities and relationships using three tools: DyGIE++, Stanford CoreNLP, and the CSO Classifier.