Learn More
Data cleansing approaches have usually focused on detecting and fixing errors with little attention to scaling to big datasets. This presents a serious impediment since data cleansing often involves costly computations such as enumerating pairs of tuples, handling inequality joins, and dealing with user-defined functions. In this paper, we present(More)
We present NADEEF, an extensible, generic and easy-todeploy data cleaning system. NADEEF distinguishes between a programming interface and a core to achieve generality and extensibility. The programming interface allows users to specify data quality rules by writing code that implements predefined classes. These classes uniformly define what is wrong with(More)
The purpose of this study was to assess the outcomes of treatment of femoral head osteonecrosis using free vascularised fibular grafting in patients with Hodgkin’s disease and non-Hodgkin’s lymphoma. We retrospectively reviewed seven patients (14 hips) with lymphoma who underwent free vascularised fibular grafting for osteonecrosis of the femoral head,(More)
BACKGROUND As the current standard treatment for symptomatic cervical disc disease, anterior cervical decompression and fusion may result in progressive degeneration or disease of the adjacent segments. Cervical disc arthroplasty was theoretically designed to be an ideal substitute for fusion by preserving motion at the operative level and delaying adjacent(More)
PURPOSE Although many total hip bearing implants are widely used all over the world, simultaneous comparisons across the numerous available bearing surfaces are rare. The purpose of this study was to compare the survivorship of total hip arthroplasty (THA) with six available bearing implants. METHODS We conducted a systematic review of randomized(More)
Passive optical networks are a prominent broadband access solution to tackle the "last mile" bottleneck in telecommunications infrastructure. Data transmission over standardized PONs is divided into time slots. Toward the end of PON performance improvement, a critical issue relies on resource management in the upstream transmission from multiple optical(More)
The rapid growth of data-intensive applications, including multimedia, e-business, e-learning, and Internet protocol television (IPTV), is driving the demand for higher data-storage capacity. Organizations want their huge amounts of data to be stored so it can be easily accessible and manageable. Furthermore, they require the critical data to be securely(More)
We establish a system model to analyze the stability of the predictor-based dynamic bandwidth allocation (PDBA) scheme over Ethernet passive optical networks (EPONs). We prove that an EPON system with PDBA is stable by proper pole placement as the traffic changes dynamically. Our analysis suggests a straightforward framework for designing the DBA algorithm(More)
Entity resolution (ER), the process of identifying and eventually merging records that refer to the same real-world entities, is an important and long-standing problem. We present Nadeef/Er, a generic and interactive entity resolution system, which is built as an extension over our open-source generalized data cleaning systemNadeef. Nadeef/Er provides a(More)