Squibs and Discussions Reliability measurement without limits

Abstract

In computational linguistics, a reliability measurement of 0.8 on some statistic such as κ is widely thought to guarantee that hand-coded data is fit for purpose, with 0.67 to 0.8 tolerable, and lower values suspect. We demonstrate that the main use of such data, machine learning, can tolerate data with low reliability as long as any disagreement among… (More)

Topics

3 Figures and Tables

Slides referencing similar topics