Learn More
The Semantic Web realization depends on the availability of a critical mass of metadata for the web content, associated with the respective formal knowledge about the world. We claim that the Semantic Web, at its current stage of development, is in a state of a critically need of metadata generation and usage schemata that are specific, well-defined and(More)
The KIM platform provides a novel Knowledge and Information Management infrastructure and services for automatic semantic annotation, indexing, and retrieval of documents. It provides mature infrastructure for scaleable and customizable information extraction (IE 1) as well as annotation and document management, based on GATE 2. In order to provide basic(More)
The approach towards Semantic Web Information Extraction (IE) presented here is implemented in KIM – a platform for semantic indexing, annotation, and retrieval. It combines IE based on the mature text engineering platform (GATE 1) with Semantic Web-compliant knowledge representation and management. The cornerstone is automatic generation of named-entity(More)
This paper describes the development of the RussIE system in which we experimented with the creation of reusable processing components and language resources for a Russian Information Extraction system. The work was done as part of a multilingual project to adapt existing tools and resources for HLT to new domains and languages. The system was developed(More)
Here we present work on using spatial knowledge in conjunction with information extraction (IE). Considerable volume of location data was imported in a knowledge base (KB) with entities of general importance used for semantic annotation, indexing, and retrieval of text. The Semantic Web knowledge representation standards are used, namely RDF(S). An(More)
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recognition gives a temporally precise but conceptually inaccurate annotation model. Information extraction from related web news sites gives the opposite: conceptual accuracy but no(More)
The T2K experiment observes indications of ν(μ) → ν(e) appearance in data accumulated with 1.43×10(20) protons on target. Six events pass all selection criteria at the far detector. In a three-flavor neutrino oscillation scenario with |Δm(23)(2)| = 2.4×10(-3)  eV(2), sin(2)2θ(23) = 1 and sin(2)2θ(13) = 0, the expected number of such events is 1.5±0.3(syst).(More)
This paper motivates the need for Semantic Web enabled language technology tools and introduces a set of freely available, customisable components which integrate data about language with Semantic Web data in the form of ontologies. We also argue for a closer integration between Natural Language Processing (NLP) and Semantic Web tools and infrastructures(More)
The second heart sound, S2, is generally believed to be comprised of aortic (A2) and pulmonary (P2) components. Previously, the normalized splitting interval (NSI) between the A2 and P2 components has been shown to be proportional to the pulmonary artery pressure (PAP). A set of fully automated algorithms based on adaptive modeling of A2/P2 components using(More)