• Publications
  • Influence
Modern Information Retrieval - the concepts and technology behind search, Second edition
Contents Preface Acknowledgements 1 Introduction 2 User Interfaces for Search by Marti Hearst 3 Modeling 4 Retrieval Evaluation 5 Relevance Feedback and Query Expansion 6 Documents: Languages & Properties with Gonzalo Navarro and Nivio Ziviani 7 Queries: Languages and Properties with Marcos Gonccalves. Expand
A new approach to text searching
We introduce a family of simple and fast algorithms for solving the classical string matching problem, string matching with don't care symbols and complement symbols, and multiple patterns. Expand
Information Retrieval: Data Structures and Algorithms
An edited volume containing data structures and algorithms for information retrieved including a disk with examples written in C. Expand
FA*IR: A Fair Top-k Ranking Algorithm
We define and solve the Fair Top-k Ranking problem, in which we want to determine a subset of k candidates from a large pool of n » k candidates, maximizing utility (i.e., select the "best" candidates). Expand
Design and Implementation of Relevance Assessments Using Crowdsourcing
We introduce a methodology for crowdsourcing relevance assessments and the results of a series of experiments using TREC 8 with a fixed budget. Expand
Improved query difficulty prediction for the web
Query performance prediction aims to predict whether a query will have a high average precision given retrieval from a particular collection, or low average precision. Expand
Link analysis for Web spam detection
We propose link-based techniques for automatic detection of Web spam, a term referring to pages which use deceptive techniques to obtain undeservedly high scores in search engines. The use of WebExpand
Predicting The Next App That You Are Going To Use
We model the prediction of the next app as a classification problem and propose an effective personalized method to solve it that takes full advantage of human-engineered features and automatically derived features. Expand
Using rank propagation and Probabilistic counting for Link-Based Spam Detection
This paper describes a link-based technique for automating the detection of Web spam, that is, pages using deceptive techniques. Expand
Proteomic analysis of peach fruit mesocarp softening and chilling injury using difference gel electrophoresis (DIGE)
BackgroundPeach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However,Expand