• Publications
  • Influence
Efficient query evaluation using a two-level retrieval process
TLDR
We present an efficient query evaluation method based on a two level approach: at the first level, our method iterates in parallel over query term postings and identifies candidate documents using an approximate evaluation taking into account only partial information on term occurrences and no query independent factors; at the second level, promising candidates are fully evaluated and their exact scores are computed. Expand
  • 382
  • 79
  • PDF
Web-a-where: geotagging web content
TLDR
We describe Web-a-Where, a system for associating geography with Web pages. Expand
  • 584
  • 49
  • PDF
Static index pruning for information retrieval systems
TLDR
We introduce static index pruning methods that significantly reduce the index size in information retrieval systems. Expand
  • 232
  • 20
  • PDF
Searching XML documents via XML fragments
TLDR
We present an extension of the vector space model for searching XML collections via XML fragments and ranking results by relevance. Expand
  • 232
  • 12
The connectivity sonar: detecting site functionality by structural patterns
TLDR
Web sites today serve many different functions, such as corporate sites, search engines, e-stores, and so forth. Expand
  • 151
  • 11
Social search and discovery using a unified approach
TLDR
This research explores new ways to augment the search and discovery of relations between Web 2.0 entities using multiple types and sources of social information using multifaceted search, which provides an efficient update mechanism for relations between objects. Expand
  • 63
  • 6
  • PDF
Automatic query wefinement using lexical affinities with maximal information gain
TLDR
This work describes an automatic query refinement technique, which focuses on improving precision of the top ranked documents. Expand
  • 94
  • 4
JuruXML - an XML Retrieval System at INEX'02
TLDR
We propose to extend the realm of XML by supporting the information needs of users wishing to query XML collections in a flexible way without knowing much about the documents structure. Expand
  • 54
  • 4
  • PDF
Juru at TREC 10 - Experiments with Index Pruning
TLDR
Evaluation de JURU, systeme Java de recherche d'information, dans les tâches Web de TREC, principalement dans le domaine ad-hoc This is the first year that Juru, a Java IR system developed over the past few years at the IBM Research Lab in Haifa, participated in TREC. Expand
  • 54
  • 3
  • PDF
PicASHOW: pictorial authority search by hyperlinks on the Web
TLDR
We describe PicASHOW, a fully automated WWW image retrieval system that is based on several link-structure analyzing algorithms that is able to retrieve relevant images even when those images are stored in files with meaningless names. Expand
  • 48
  • 3
  • PDF