How OCR Performance can Impact on the Automatic Extraction of Dictionary Content Structures
@inproceedings{Khemakhem2019HowOP, title={How OCR Performance can Impact on the Automatic Extraction of Dictionary Content Structures}, author={Mohamed Khemakhem and Ioana Galleron and Geoffrey G. Williams and Laurent Romary and Pedro Javier Ortiz Su{\'a}rez}, year={2019} }
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou… CONTINUE READING
References
SHOWING 1-9 OF 9 REFERENCES
Retro-digitizing and Automatically Structuring a Large Bibliography Collection
- Computer Science
- 2018
- 2
- PDF
Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields
- Computer Science
- 2017
- 12
- PDF
Geoffrey Williams is a Professor of Applied Linguistics at the University of South Brittany and researcher at UMR 5316, Litt & Arts at the University Grenoble Alpes
His research is focused on enriching lexical and encyclopedic legacy resources using deep learning models
His research is focused on parsing lexical and encyclopedic legacy resources using standard-based machine learning models
Ioana Galleron is a professor of French literature and Digital Humanities at Sorbonne-Nouvelle and UMR 8094 LATTICE of CNRS
Laurent Romary is senior researcher at Inria, team ALMAnaCH and works on data modelling and standards in humanities computing
Pedro Ortiz Suárez is a PhD candidate at Inria, team ALMAnaCH (Paris) and