• Publications
  • Influence
Topic Detection and Tracking Pilot Study Final Report
Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problemExpand
  • 1,112
  • 89
  • PDF
Topic detection and tracking: event-based information organization
TLDR
Topic Detection and Tracking: Event-based Information Organization brings together in one place state-of-the-art research in Topic Detection and tracking (TDT). Expand
  • 863
  • 79
  • PDF
A comparison of statistical significance tests for information retrieval evaluation
TLDR
Information retrieval (IR) researchers commonly use three tests of statistical significance: the Student's paired t-test, the Wilcoxon signed rank test, and the sign test. Expand
  • 609
  • 48
  • PDF
Retrieval and novelty detection at the sentence level
TLDR
This study investigates the more difficult two-part task defined by the TREC 2002 novelty track: given a topic and a group of documents relevant to that topic, find the relevant sentences from the documents, and 2) find the novel sentences from a collection of relevant sentences. Expand
  • 300
  • 43
  • PDF
UMass at TREC 2004: Novelty and HARD
TLDR
We investigated the use of clarification forms, fixed- and variable-length passage retrieval, and use of metadata. Expand
  • 204
  • 40
  • PDF
On-Line New Event Detection and Tracking
TLDR
We define and describe the related problems of new event detection and event tracking within a stream of broadcast news stories within a strict on-line setting-i.e., the system must make decisions about one story before looking at any subsequent stories. Expand
  • 455
  • 35
Automatic Query Expansion Using SMART: TREC 3
TLDR
The Smart information retrieval project emphasizes completely automatic approaches to the understanding and retrieval of large quantities of text. Expand
  • 611
  • 29
  • PDF
Text classification and named entities for new event detection
TLDR
New Event Detection is a challenging task that still offers scope for great improvement after years of effort. Expand
  • 393
  • 29
  • PDF
Introduction to topic detection and tracking
TLDR
The Topic Detection and Tracking program has been running for five years, starting with a pilot study and including yearly open and competitive evaluations since then. Expand
  • 283
  • 24
HARD Track Overview in TREC 2003: High Accuracy Retrieval from Documents
TLDR
The High Accuracy Retrieval from Documents (HARD) track explores methods for improving the accuracy of document retrieval systems. Expand
  • 217
  • 22
  • PDF