• Publications
  • Influence
Automatic Keyboard Layout Design for Low-Resource Latin-Script Languages
TLDR
We present our approach to automatically designing and implementing keyboard layouts on mobile devices for typing low-resource languages written in the Latin script. Expand
  • 4
  • 1
  • PDF
Writing Across the World's Languages: Deep Internationalization for Gboard, the Google Keyboard
TLDR
We describe how and why we have been adding support for hundreds of language varieties from around the globe, and we describe the trends we see. Expand
  • 8
  • PDF
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
TLDR
Large text corpora are increasingly important for a wide variety of Natural Language Processing (NLP) tasks, and automatic language identification (LangID) is a core technology needed to collect such datasets in a multilingual context. Expand
  • 3
  • PDF
Mining Large-Scale Low-Resource Pronunciation Data From Wikipedia
TLDR
We report on a system we built to mine pronunciation data set in 819 languages from loosely structured tables within Wikipedia. Expand