A Python Toolkit for Universal Transliteration

Abstract

We describe ScriptTranscriber, an open source toolkit for extracting transliterations in comparable corpora from languages written in different scripts. The system includes various methods for extracting potential terms of interest from raw text, for providing guesses on the pronunciations of terms, and for comparing two strings as possible transliterations… (More)
View Slides

4 Figures and Tables

Topics

  • Presentations referencing similar topics