ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling


Effective text classification requires experts to annotate data with labels; these training data are time-consuming and expensive to obtain. If you know what labels you want, active learning can reduce the number of labeled documents needed. However, establishing the label set remains difficult. Annotators often lack the global knowledge needed to induce a… (More)


