Experiments in Automatic Library of Congress Classification

Abstract

This article presents the results of research into the automatic selection of Library of Congress Classification numbers based on the titles and subject headings in MARC records. The method used in this study was based on partial match retrieval techniques using various elements of new records (i.e., those to be classified) as “queries,” and a test database of classification clusters generated from previously classified MARC records. Sixty individual methods for automatic classification were tested on a set of 283 new records, using all combinations of four different partial match methods, five query types, and three representations of search terms. The results indicate that if the best method for a particular case can be determined, then up to 88% of the new records may be correctly classified. The single method with the best accuracy was able to select the correct classification for about 46% of the new records.

DOI: 10.1002/(SICI)1097-4571(199203)43:2%3C130::AID-ASI3%3E3.0.CO;2-S

Extracted Key Phrases

13 Figures and Tables

Statistics

0510'95'97'99'01'03'05'07'09'11'13'15'17
Citations per Year

85 Citations

Semantic Scholar estimates that this publication has 85 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Larson1992ExperimentsIA, title={Experiments in Automatic Library of Congress Classification}, author={Ray R. Larson}, journal={JASIS}, year={1992}, volume={43}, pages={130-148} }