Semantic Annotation of Scholarly Documents and Citations

@inproceedings{Ciancarini2013SemanticAO,
  title={Semantic Annotation of Scholarly Documents and Citations},
  author={Paolo Ciancarini and Angelo Di Iorio and Andrea Giovanni Nuzzolese and Silvio Peroni and Fabio Vitali},
  booktitle={AI*IA},
  year={2013}
}
Scholarly publishing is in the middle of a revolution based on the use of Web-related technologies as medium of communication. In this paper we describe our ongoing study of semantic publishing and automatic annotation of scholarly documents, presenting several models and tools for the automatic annotation of structural and semantic components of documents. In particular, we focus on citations and their automatic classification obtained by CiTalO, a framework that combines ontology learning… Expand
Characterising Citations in Scholarly Articles: An Experiment
TLDR
A comparison of the performance of different users is introduced, and strengths and difficulties that emerged when using that particular model to characterise citations of scholarly articles are shown. Expand
On the Automated Generation of Scholarly Publishing Linked Datasets: The Case of CEUR-WS Proceedings
TLDR
A system that automatically generates rich RDF datasets from CEUR-WS workshop proceedings and exposes them as Linked Data and modelled and published as an RDF graph is presented. Expand
Knowledge Extraction and Modeling from Scientific Publications
TLDR
This work proposes a platform to automatically extract, enrich and characterize several structural and semantic aspects of scientific publications, representing them as RDF datasets and presents a set of tailored Web visualizations to ease the exploration of the distinct facets of scientific knowledge extracted by the platform. Expand
A Pattern-Based Ontology for Describing Publishing Workflows
TLDR
The Publishing workflow Ontology (PWO) is introduced, an OWL 2 DL ontology for the description of generic workflows that is particularly suitable for formalising typical publishing processes such as the publication of articles in journals. Expand
Pattern Analysis of Citation-Anchors in Citing Documents for Accurate Identification of In-Text Citations
TLDR
A taxonomy and workable system is proposed, which utilizes a set of heuristics build from detailed study and is applied on unseen diversified data set taken from the Journal of Universal Computer Science and CiteSeer. Expand
Lessons Learned: The Complexity of Accurate Identification of in-Text Citations
TLDR
The accurate identification of in-text citations will help information retrieval systems, digital libraries and citation indexes, as well as highlighting the problems (mathematical ambiguities, wrong allotments, commonality in content and string variation) in identifying in- text citations from scientific documents. Expand
Semantify CEUR-WS Proceedings: Towards the Automatic Generation of Highly Descriptive Scholarly Publishing Linked Datasets
TLDR
This paper presents a system that automatically generates rich RDF datasets from CEUR-WS workshop proceedings through a sequence of processing phases and presents its evaluation following the procedure set by the organizers of the ESWC2014 Semantic Publishing Challenge. Expand
CAD: an algorithm for citation-anchors detection in research papers
TLDR
The paper proposes an algorithm, CAD, for identification of citation-anchors and its in-text citation frequency based on different rules and shows that CAD algorithm improved F-score by 44% and 37% respectively on both J.UCS and CiteSeer dataset over the contemporary technique. Expand
2 Exploiting the Dr . Inventor Framework to Mine Scientific Publications
During the last decade the amount of scientific articles available online has substantially grown in parallel with the adoption of the Open Access publishing model. Nowadays researchers, as well asExpand
Important citation identification using sentiment analysis of in-text citations
TLDR
This research proposed an in-text citation sentiment analysis-based approach for binary classification which effectively enhanced the results of the state-of-the-art on the benchmark dataset. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 22 REFERENCES
Semantic publishing: the coming revolution in scientific journal publishing
Recent developments in Web technology can be used for semantic enhancement of scholarly journal articles, by aiding publication of data and metadata and providing ‘lively’ interactive access toExpand
FaBiO and CiTO: Ontologies for describing bibliographic resources and citations
TLDR
This article introduces the principles and architectures of two new ontologies central to the task of semantic publishing: FaBiO, the FRBR-aligned Bibliographic Ontology, an ontology for recording and publishing bibliographic records of scholarly endeavours on the Semantic Web, and CiTO, the Citation Typing Ontology. Expand
Towards the Automatic Identification of the Nature of Citations
TLDR
An algorithm is proposed, called CiTalO, to infer automatically the function of citations by means of Semantic Web technologies and NLP techniques, and some strengths and limitations of this approach are discussed. Expand
Knowledge Extraction Based on Discourse Representation Theory and Linguistic Frames
TLDR
This work defines a mapping between DRT and RDF/OWL for the production of quality linked data and ontologies, and presents FRED, an online tool for converting text into internally well-connected and linked-data-ready ontologies in web-service-acceptable time. Expand
Faceted documents: describing document characteristics using semantic lenses
TLDR
This paper proposes eight different semantic lenses through which these facets may be viewed, and describes and exemplifies the ontologies by which these lenses may be implemented. Expand
Recognising document components in XML-based academic articles
TLDR
An algorithm is proposed that allows us to identify the structural role of each element in a set of homogeneous scientific articles stored as XML files. Expand
Mining Semantic Relations between Research Areas
TLDR
Klink is proposed, a new approach to automatically generating relations between research areas and populating a bibliographic ontology, which combines both machine learning methods and external knowledge, which is drawn from a number of resources, including Google Scholar and Wikipedia. Expand
From Proteins to Fairytales: Directions in Semantic Publishing
There has been an ongoing discussion about how to improve the delivery of scientific content using online tools, especially by focusing on content reuse and social media. This column explores howExpand
The OntoWordNet Project: Extension and Axiomatization of Conceptual Relations in WordNet
TLDR
Preliminary results provide the conviction that a research program aiming to obtain a consistent, modularized, and axiomatized ontology from WordNet can be completed in acceptable time with the support of semi-automatic techniques. Expand
Utopia documents: linking scholarly literature with research data
TLDR
Utopia Documents is a novel PDF reader that semantically integrates visualization and data-analysis tools with published research articles that has been used to transform static document features into objects that can be linked, annotated, visualized and analyzed interactively. Expand
...
1
2
3
...