Corpus ID: 63815682

Automatic extraction of bibliography with machine learning

@inproceedings{2003AutomaticEO,
  title={Automatic extraction of bibliography with machine learning},
  author={武 阿辺川 and 英嗣 難波 and 大也 高村 and 学 奥村},
  year={2003}
}
2 Citations

Topics from this paper

Empirical Evaluation of CRF-Based Bibliography Extraction from Reference Strings
TLDR
An empirical evaluation of a CRF-based bibliography parser developed for reference strings of research papers using a conditional random field to estimate the correct bibliographic label such as an author's name and a title for each token in a reference string. Expand
Bibliographic element extraction from scanned documents using conditional random fields
TLDR
An automatic bibliographic element extraction method for academic articles scanned with OCR markup that labels text blocks as predetermined bibliographical elements and further labels the characters in each labeled text block if necessary, and extracts each author name strings from the authorspsila text block. Expand

References

SHOWING 1-4 OF 4 REFERENCES
Template mining for the extraction of citation from digital documents
TLDR
This study hypothesizes that the template mining technique can be used to extract citation information from printed and digital full-text articles so that universal or semi-universal citation databases can be automatically established before too long in the future. Expand
Building Domain-Specific Search Engines with Machine Learning Techniques
TLDR
New research in reinforcement learning, information extraction and text classification that enables efficient spidering, identifying informative text segments, and populating topic hierarchies is described. Expand
Digital Libraries and Autonomous Citation Indexing
TLDR
Digital libraries incorporating ACI can help organize scientific literature and may significantly improve the efficiency of dissemination and feedback and speed the transition to scholarly electronic publishing. Expand
Learning Hidden Markov Model Structure for Information Extraction
TLDR
It is demonstrated that a manually-constructed model that contains multiple states per extraction field outperforms a model with one state per field, and the use of distantly-labeled data to set model parameters provides a significant improvement in extraction accuracy. Expand