Ganesh Krishnan

Learn More
a b o r a t o r y Depar t ment of Com p u te rS c ie n c e s Pur due Uni ver s i t y 1398 De p a r t m e n t o f C om put er Sci ences W e stL afayette, IN 47907{1398 Ab s t r a c t We consider t he p robl emof d i s t r i b u t i n g p ot e n t i a l l y dangerous i n f o rm at i o n t o a n umber of c ompet i ng p art i e s .A s a p rim e e x a mpl e , w(More)
Entity matching (EM) has been a long-standing challenge in data management. Most current EM works focus only on developing matching algorithms. We argue that far more efforts should be devoted to building EM systems. We discuss the limitations of current EM systems, then present as a solution Magellan, a new kind of EM systems. Magellan is novel in four(More)
Big Data industrial systems that address problems such as classification, information extraction, and entity matching very commonly use hand-crafted rules. Today, however, little is understood about the usage of such rules. In this paper we explore this issue. We discuss how these systems differ from those considered in academia. We describe default(More)
Entity matching (EM) has been a long-standing challenge in data management. Most current EM works, however, focus only on developing matching algorithms. We argue that far more efforts should be devoted to building EM systems. We discuss the limitations of current EM systems, then present Magellan, a new kind of EM systems that addresses these limitations.(More)
BACKGROUND A large body of work in the clinical guidelines field has identified requirements for guideline systems, but there are formidable challenges in translating such requirements into production-quality systems that can be used in routine patient care. Detailed analysis of requirements from an implementation perspective can be useful in helping define(More)
Many works have applied crowdsourcing to entity matching (EM). While promising, these approaches are limited in that they often require a developer to be in the loop. As such, it is difficult for an organization to deploy multiple crowdsourced EM solutions, because there are simply not enough developers. To address this problem, a recent work has proposed(More)
To my Grandparents iv ACKNOWLEDGEMENTS This dissertation would not have been possible without the guidance and the help of several individuals who in one way or another contributed and extended their valuable assistance in the preparation and completion of this thesis. First and foremost, my utmost gratitude to my academic advisor, Dr. Madhavan Swaminathan,(More)