The Acquisition and Sentence Alignment for Academic Bilingual Resources Based on Web Paper Libraries

@article{Sun2009TheAA,
  title={The Acquisition and Sentence Alignment for Academic Bilingual Resources Based on Web Paper Libraries},
  author={Yueheng Sun and Rui Men and Weijie Ni},
  journal={2009 International Conference on Research Challenges in Computer Science},
  year={2009},
  pages={45-48}
}
This paper presents an approach for acquiring academic bilingual resources from the web paper libraries. By analyzing the structured information of web pages, we first implement a customized crawler to download these pages including paper details, and then use a parser to transfer them into XML format. Based on the classic statistical method for sentence… CONTINUE READING