Comparing "parallel passages" in digital archives

@article{Harris2020ComparingP,
  title={Comparing "parallel passages" in digital archives},
  author={Martyn Harris and M. Levene and Dell Zhang and D. Levene},
  journal={J. Documentation},
  year={2020},
  volume={76},
  pages={271-289}
}
The purpose of this paper is to present a language-agnostic approach to facilitate the discovery of “parallel passages” stored in historic and cultural heritage digital archives.,The authors explore a novel, and relatively simple approach, using a character-based statistical language model combined with a tailored version of the Basic Local Alignment Tool to extract exact and approximate string patterns shared between groups of documents.,The approach is applicable to a wide range of languages… Expand
1 Citations

Topics from this paper

References

SHOWING 1-10 OF 20 REFERENCES
Character N-Gram Tokenization for European Language Text Retrieval
  • 348
  • PDF
Aramaic Dialect Problems. II
  • H. Ginsberg
  • Philosophy
  • The American Journal of Semitic Languages and Literatures
  • 1936
  • 2
Generating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods
  • 253
  • PDF
A Neural Probabilistic Language Model
  • 4,839
  • PDF
What is Text Analysis, Really?
  • 69
...
1
2
...