Automatic extraction of titles from general documents using machine learning

@article{Hu2005AutomaticEO,
  title={Automatic extraction of titles from general documents using machine learning},
  author={Yunhua Hu and Hang Li and Yunbo Cao and Dmitriy Meyerzon and Qinghua Zheng},
  journal={Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05)},
  year={2005},
  pages={145-154}
}
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of specific genres, including presentations, book chapters, technical papers, brochures, reports, and letters. Previously, methods have been proposed mainly for title extraction from research papers. It has not been clear whether it could be possible to conduct automatic title extraction from general documents. As a case… CONTINUE READING