Learn More
Witten and Frank 's textbook was one of two books that 1 used for a data mining class in the Fall o f 2001. T h e book covers all major methods o f data mining that p roduce a knowledge representa t ion as output . Knowledge representa t ion is hereby unders tood as a representat ion that can be studied, unders tood, and interpreted by human beings, at(More)
To build a common controlled vocabulary is a formidable challenge in medical informatics. Due to vast scale and multiplicity in interpretation of medical data, it is natural to face overlapping terminologies in the process of practicing medical informatics [A. Rector, Clinical terminology: why is it so hard? Methods Inf. Med. 38 (1999) 239-252]. A major(More)
Comprehension of complex controlled vocabularies is often difficult. We present a method, facilitated by an object-oriented database, for depicting such a vocabulary (the Medical Entities Dictionary (MED) from the Columbia-Presbyterian Medical Center) in a schematic way which uses a sparse inheritance network of area classes. The resulting Object Oriented(More)
The notion of a part-whole relationship plays an important role when modeling data in many advanced application domains. It is therefore important that Object-Oriented Database (OODB) systems include support for this modeling primitive. We present a comprehensive part model for OODB systems. The model's foundation is a part-whole relationship that captures(More)
OBJECTIVE Synonym-substitution algorithms have been developed for the purpose of matching source vocabulary terms with existing Unified Medical Language System (UMLS) terms during the integration process. A drawback is the possible explosion in the number of newly generated (potential) synonyms, which can tax computational and expert review resources.(More)
The vision of the Semantic Web is to reduce manual discovery and usage of Web resources (documents and services) and to allow intelligent agents to automatically identify these Web resources, integrate them and execute them for achieving the intended goals of the user. The composed Web service is represented as a workflow, called service flow. This paper(More)
The UMLS contains more than 100 source vocabularies and is growing via the integration of others. When integrating a new source, the source terms already in the UMLS must first be found. The easiest approach to this is simple string matching. However, string matching usually does not find all concepts that should be found. A new methodology, based on the(More)
OBJECTIVE To develop and test an auditing methodology for detecting errors in medical terminologies satisfying systematic inheritance. This methodology is based on various abstraction taxonomies that provide high-level views of a terminology and highlight potentially erroneous concepts. DESIGN Our auditing methodology is based on dividing concepts of a(More)
The unified medical language system (UMLS) integrates many well-established biomedical terminologies. The UMLS semantic network (SN) can help orient users to the vast knowledge content of the UMLS metathesaurus (META) via its abstract conceptual view. However, the SN itself is large and complex and may still be difficult to comprehend. Our technique(More)