Learning Field Compatibilities to Extract Database Records from Unstructured Text


Named-entity recognition systems extract entities such as people, organizations, and locations from unstructured text. Rather than extract these mentions in isolation, this paper presents a record extraction system that assembles mentions into records (i.e. database tuples). We construct a probabilistic model of the compatibility between field values, then… (More)


7 Figures and Tables

