Learn More
To answer user queries, a data integration system employs a set of semantic mappings between the mediated schema and the schemas of data sources. In dynamic environments sources often undergo changes that invalidate the mappings. Hence, once the system is deployed, the administrator must monitor it over time, to detect and repair broken map-pings. Today(More)
We introduce Cimple, a joint project between the University of Illinois and the University of Wisconsin. Cimple aims to develop a software platform that can be rapidly deployed and customized to manage data-rich online communities. We first describe the envisioned working of such a software platform and our prototype, DBLife, which is a community portal(More)
The purpose of this pilot study was to evaluate the effectiveness of a family caregiver-focused intervention program (CARE) on the outcomes of hospitalized elders and their family caregivers. A randomized clinical trial was conducted with 49 family caregivers of hospitalized elders in a university medical center in upstate New York. Driven by(More)
When integrating data from multiple sources, a key task that online communities often face is to match the schemas of the data sources. Today, such matching often incurs a huge workload that overwhelms the relatively small set of volunteer integrators. In such cases, community members may not even volunteer to be integrators, due to the high workload, and(More)
Current approaches to develop information extraction (IE) programs have largely focused on producing <i>precise IE results</i>. As such, they suffer from three major limitations. First, it is often difficult to execute partially specified IE programs and obtain meaningful results, thereby producing a long "debug loop". Second, it often takes a long time(More)
The rapid growth of distributed data at enterprises and on the WWW has fueled significant interest in building data integration systems. Such a system provides users with a uniform query interface (called mediated schema) to a multitude of data sources, thus freeing them from manually querying each individual source. To address some problems in the MOBS(More)
Building data integration systems today is largely done by hand, in a very labor-intensive and error-prone process. In this paper we describe a conceptually new solution to this problem: that of mass collaboration. The basic idea is to think about a data integration system as having a finite set of parameters whose values must be set. To build such a system(More)
John Boyd recognized in the 1960's the importance of situation awareness for military operations and introduced the notion of the OODA loop (Observe, Orient, Decide, and Act). Today we realize that many applications have to deal with situation awareness: Customer Relationship Management, Human Capital Management, Supply Chain Management, patient care, power(More)
We address the problem of gray mail – messages that could reasonably be considered either spam or good. Email users often disagree on this mail, presenting serious challenges to spam filters in both model training and evaluation. In this paper, we propose four simple methods for detecting gray mail and compare their performance using recall-precision(More)