- Full text PDF available (8)
- This year (0)
- Last 5 years (2)
- Last 10 years (8)
Journals and Conferences
We address the problem of clustering the refinements of a user search query. The clusters computed by our proposed algorithm can be used to improve the selection and placement of the query suggestions proposed by a search engine, and can also serve to summarize the different aspects of information relevant to the original user query. Our algorithm clusters… (More)
Transmission of infectious diseases, propagation of information, and spread of ideas and influence through social networks are all examples of diffusion. In such cases we say that a contagion spreads through the network, a process that can be modeled by a cascade graph. Studying cascades and network diffusion is challenging due to missing data. Even a… (More)
This paper uncovers a new phenomenon in web search that we call domain bias --- a user's propensity to believe that a page is more relevant just because it comes from a particular domain. We provide evidence of the existence of domain bias in click activity as well as in human judgments via a comprehensive collection of experiments. We begin by studying the… (More)
Current speech synthesis technology is difficult to understand in everyday noise situations. Although there is a significant body of work on how humans modify their speech in noise, the results have yet to be implemented in a synthesizer. Algorithms capable of processing and incorporating these modifications may lead to improved speech intelligibility of… (More)
Analysis of a comprehensive set of features extracted from blogs for prediction of movie sales is presented. We use correlation, clustering and time-series analysis to study which features are best predictors.
We analyze URL and tag propagation on Twitter social network with 54 million nodes and 1.5 billion edges, one of the largest social networks studied in academia. We specifically focus on the interplay of external and network influences. We attribute propagation to the network whenever a user mentions a tag or URL after one of its neighbors has previously… (More)
We study the problem of object reconstruction based on lineage, using photographs as our driving application. In addition to standard forward reconstructions, our model allows inverse transformations, reconstructions that exploit properties (e.g., commutativity), and imperfect reconstructions. With these additions, our model provides many more options for… (More)
We have approximately one year’s worth of blog posts from  with over 12 million web blogs tracked. On average there are 500 thousand blog posts per day. In this project, we are attempting to extract from the blog data the set of features that are predictive of the movie gross sales, critics ratings, and viewers ratings (collected by sites like ).… (More)
Why running a startup is a lot like building a research lab.