John R. Talburt

Learn More
The topic of entity resolution (ER) has always been a key component of data and information quality. It is not surprising that Lee et al. [2006] point out “multiple sources of the same information” as one of the key root conditions of data quality problems. In particular, it is often the case that these sources reference entities such as customers,(More)
This paper describes a method for using new customer data integration and customer recognition technologies that have been developed in the private sector to solve the problem of anonymous entity resolution across multiple, non-shared data stores held in government agencies. In the method described, new commercial technologies that link records using(More)
This paper reports the results of evaluating a large sample of the 23,858 type-10 homographs found in Roget's International Thesaurus (3rd Ed.) as defined by the Bryan Model of abstract thesauri, of which Roget's is an instantiation. According to the Bryan model, two different entries in a thesaurus that have the same spelling are homographs (semantically(More)
This paper introduces the concept of asserted resolution as a technique for entity resolution. In asserted resolution trusted information sources are used to force the equivalence (or non-equivalence) of entity references and identity structures regardless of matching conditions. The paper proposes five specific forms of assertion to support entity identity(More)
This paper reports on a project to develop SOG (Synthetic Occupancy Generator), a system to create realistic, but synthetic residential occupancy (name and address) histories as input for Entity Resolution (ER) processes. ER processes are intended to link records referencing the same, or related, real-world entities. Most organizations use some type of ER(More)
Prolog meta-circular interpreters, i.e., interpreters for Prolog written in Prolog, perform at least two operations on an object program - they parse it and execute its instructions. There is a useful variant of the meta-circular interpreter, the meta-circular parser, which as its name suggests, parses an object program without executing its instructions.(More)