Unsupervised Segmentation of Bibliographic Elements with Latent Permutations

Abstract

This paper introduces a novel approach for large-scale unsupervised segmentation of bibliographic elements. Our problem is to segment a word token sequence representing a citation into subsequences each corresponding to a different bibliographic element, e.g. authors, paper title, journal name, publication year, etc. Obviously, each bibliographic element… (More)
DOI: 10.1007/978-3-642-24396-7_20

Topics

5 Figures and Tables