Sindice.com: a document-oriented lookup index for open linked data

@article{Oren2008SindicecomAD,
  title={Sindice.com: a document-oriented lookup index for open linked data},
  author={Eyal Oren and Renaud Delbru and Michele Catasta and Richard Cyganiak and Holger Stenzhorn and Giovanni Tummarello},
  journal={Int. J. Metadata Semant. Ontologies},
  year={2008},
  volume={3},
  pages={37-52}
}
Data discovery on the Semantic Web requires crawling and indexing of statements, in addition to the 'linked-data' approach of de-referencing resource URIs. Existing Semantic Web search engines are focused on database-like functionality, compromising on index size, query performance and live updates. We present Sindice, a lookup index over Semantic Web resources. Our index allows applications to automatically locate documents containing information about a given resource. In addition, we allow… 

SWSE: Objects before documents!

SWSE, a search engine over 1.1 billion statements published on the Semantic Web, is presented, which provides an easy-to-use end-user interface through which users can find and navigate an object-orientated information space.

Linked Data Indexing Methods: A Survey

This paper attempts to introduce advantages and disadvantages of the state-of-the-art solutions and discusses several issues related to the ongoing research effort - the proposal of an efficient querying framework over Linked Data.

SIREn: entity retrieval system for the web of data

It is demonstrated how SIREn can effectively answer queries over 10 billion triples on single commodity machine.

A Node Indexing Scheme for Web Entity Retrieval

This paper presents an “entity retrieval system” designed to provide entity search capabilities over datasets as large as the entire Web of Data and advocates the use of a node indexing scheme and shows that it offers a good compromise between query expressiveness, query processing time and update complexity in comparison to three other indexing techniques.

LODatio: A Schema-Based Retrieval System for Linked Open Data at Web-Scale

Beyond classical search system functions such as retrieval, ranking, result set size estimation and providing result snippets, LODatio provides sophisticated support for the users in refining and expanding their information need.

Interactive Query Processing for Linked Data

This work has designed, implemented, and evaluated an approach using a breadth-first retrieval process to execute SPARQL queries over the Linked Data cloud, and special care was taken to support interactive applications, which depend on displaying results to users quickly.

Searching Linked Data and Services with a Single Query

This paper addresses the problem of searching data and services in the LOD based on the existing standards and techniques and introduces an approach that integrates data queries with data from service calls and compositions.

An Entity Name System for Linking Semantic Web Data

In this paper, we argue that realizing the vision of the Semantic Web as an open and global decentralized knowledge space would greatly benefit from the availability of a large-scale service which

Efficient querying of distributed linked data

The purpose of the ongoing research effort is to propose an efficient framework for querying Linked Data, which requires finding the compromise between storing data in local storages and accessing them directly on-demand in distributed data sources.

A Graph-based Approach to Indexing Semantic Web Data

This paper provides a means to index SW data in graph structures, which potentially benefit the graph exploration and ranking in SW querying.
...

References

SHOWING 1-10 OF 69 REFERENCES

Swoogle: a search and metadata engine for the semantic web

Swoogle is a crawler-based indexing and retrieval system for the Semantic Web. It extracts metadata for each discovered document, and computes relations between documents. Discovered documents are

Swoogle: Searching for Knowledge on the Semantic Web

Swoogle is an implemented system that discovers, analyzes and indexes knowledge encoded in semantic web documents on the Web, which helps human users and software systems to find relevant documents, terms and triples via its search and navigation services.

MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data

This work contrasts the approach to conventional web crawlers, and describes and evaluates a five-step pipelined architecture to crawl and index data from both the traditional and the Semantic Web.

A taxonomy of web search

This taxonomy of web searches is explored and how global search engines evolved to deal with web-specific needs is discussed.

Towards a scalable search and query engine for the web

This work presents a search engine that uses the RDF data model to enable interactive query answering over richly structured and interlinked data collected from many disparate sources on the Web.

Search Engines for Semantic Web Knowledge

The general issues underlying the indexing and retrieval of RDF based information are discussed and Swoogle, a crawler based search engine whose index contains information on over 1.3M RDF documents is described.

Enabling Semantic Web Communities with DBin: An Overview

An overview of the DBin Semantic Web information manager is given and how it enables users to create and experience theSemantic Web by exchanging RDF knowledge in P2P “topic” channels is described.

Cool URIs for the semantic web

This article discusses two strategies, called 303 URIs and hash URIs, and gives pointers to several web sites that use them, and briefly discusses why several other proposals have problems.

Characterizing the Semantic Web on the Web

A collection of Semantic Web documents from an estimated ten million available on the Web is harvested and analyzed, and a number of metrics, properties and usage patterns found to follow a power law distribution are described.

Tabulator: Exploring and Analyzing linked data on the Semantic Web

The Tabulator project is an attempt to demonstrate and utilize the power of linked RDF data with a user-friendly Semantic Web browser that is able to recognize and follow RDF links to other RDF resources based on the user’s exploration and analysis.
...