You Wu

Learn More
Our news are saturated with claims of " facts " made from data. Database research has in the past focused on how to answer queries, but has not devoted much attention to discerning more subtle qualities of the resulting claims, e.g., is a claim " cherry-picking " ? This paper proposes a framework that models claims based on struc-tured data as parameterized(More)
Objects with multiple numeric attributes can be compared within any "subspace" (subset of attributes). In applications such as computational journalism, users are interested in claims of the form: <i>Karl Malone is one of the only two players in NBA history with at least 25,000 points, 12,000 rebounds, and 5,000 assists in one's career</i>. One challenge in(More)
BACKGROUND Adherence to disease-modifying therapies (DMTs) results in the reduction of the number and severity of relapses and delays the progression of multiple sclerosis (MS). Patients with lower adherence rates experience more inpatient visits and higher MS-related medical costs. Fingolimod, the first oral DMT approved by the US Food and Drug(More)
Towards computational journalism, we present FactWatcher, a system that helps journalists identify data-backed, attention-seizing facts which serve as leads to news stories. FactWatcher discovers three types of facts, including situational facts, one-of-the-few facts, and prominent streaks, through a unified suite of data model, algorithm framework, and(More)
Are you fed up with "lies, d---ned lies, and statistics" made up from data in our media? For claims based on structured data, we present a system to automatically assess the quality of claims (beyond their correctness) and counter misleading claims that cherry-pick data to advance their conclusions. The key insight is to model such claims as parameterized(More)
Purpose. Acute healthcare utilization of stroke and bleeding has been previously examined among patients with nonvalvular atrial fibrillation (NVAF). The long-term cost of such outcomes over several years is not well understood. Methods. Using 1999-2009 Medicare medical and enrollment data, we identified incident NVAF patients without history of stroke or(More)
The most effective way to explore data is through visualizing the results of exploration queries. For example, an exploration query could be an aggregate of some measures over time intervals, and a pattern or abnormality can be discovered through a time series plot of the query results. In this paper, we examine a special kind of exploration query, namely(More)
Our media is saturated with claims of &#8220;facts&#8221; made from data. Database research has in the past focused on how to answer queries, but has not devoted much attention to discerning more subtle qualities of the resulting claims, for example, is a claim &#8220;cherry-picking&#8221;? This article proposes a framework that models claims based on(More)
The increasing popularity of mobile device usage has ush-ered in many features in modern search engines that help users with various information needs. One of those needs is Knowledge Exploration, where related documents are returned in response to a user query, either directly through right-hand side knowledge panels or indirectly through navigable(More)
This paper presents our work on GetDressed, a contextually-aware clothing recommendation system. There is an unmet need for automated and personalized advice about the daily task of selecting an outfit. This task may seem small, but it has a real impact on people's lives and many people are unsatisfied with the outfits they select and the time and effort(More)
  • 1