Learn More
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retrieve search results consisting of sets of duplicate documents, whether identical duplicates or close matches. Our goal in this work is to investigate the phenomenon and determine(More)
This paper describes how an online directory of expert witnesses was created from jury verdict and settlement documents using text mining techniques. We have created an expert witness directory that contains over 100,000 expert profiles, based on approximately 300,000 jury verdict and settlement documents, publicly available professional license(More)
Medical terms occur across a wide variety of legal, medical, and news corpora. Documents containing these terms are of particular interest to legal professionals operating in such fields as medical malpractice, personal injury, and product liability. This paper describes a novel method of tagging medical terms in legal, medical, and news text that is very(More)
This paper describes using RDF/RDFS/XML to create and navigate a metadata model of relationships among entities in text. The metadata we create is roughly an order of magnitude smaller than the content being modeled, it provides the end-user with context sensitive information about the hyper-linked entities in focus. These entities at the core of the model(More)
  • 1