Jonathan Pool

Learn More
In a multilingual Semantic Web, authors might write in precise, expressive varieties of diverse languages. Do such controlled languages exist? Of 41 candidates, just 4 were (1) designed for multiple domains and genres and (2) documented enough for evaluation. A sample of Web statements on health and human rights revealed limited expressivity or precision in(More)
PanLex, a project of The Long Now Foundation, aims to enable the translation of lexemes among all human languages in the world. By focusing on lexemic translations, rather than grammatical or corpus data, it achieves broader lexical and language coverage than related projects. The PanLex database currently documents 20 million lexemes in about 9,000(More)
PanLex is a lemmatic translation resource which combines a large number of translation dictionaries and other translin-gual lexical resources. It currently covers 1353 language varieties and 12M expressions , but aims to cover all languages and up to 350M expressions. This paper describes the resource and current applications of it, as well as lextract, a(More)
  • Jonathan Pool, James Coleman, Probal Dasˆgupto, Rüdiger Eichholz, David D Laitin, Klaus Schubert +1 other
  • 1991
78 Antagonists in the ancient controversy over world multilingualism agree that a successful artificial language must overcome a coordination problem: to motivate learners when few speak the language. It is believed a takeoff point must be reached, after which the spread of such a language would be self-sustaining. This problem may also frustrate other(More)
Statistical MT is limited by reliance on large parallel corpora. We propose Lemmatic MT, a new paradigm that extends MT to a far broader set of languages, but requires substantial manual encoding effort. We present PANLINGUAL TRANSLATOR, a prototype Lemmatic MT system with high translation adequacy on 59% to 99% of sentences (average 84%) on a sample of 6(More)
The need for translation among the world's thousands of natural languages makes information access and communication costly. One possible solution is lemmatic communication: A human sender encodes a message into sequences of lemmata (dictionary words), a massively multilingual lexical translation engine translates them into lemma sequences in a target(More)
Like other linguistically diverse institutions, the European Union promotes conflicting values as it chooses official languages. Increased activism and the admission of new states threaten a language-policy crisis. One approach to a solution is to analyze more carefully than before the universe of possible language regimes for institutions like the EU and(More)
The world is losing some of its 7,000 languages. Hypothesizing that language attrition might subside if all languages were intertranslatable, the PanLex project supports panlingual lexical translation by integrating all known lexical translations. Semantic Web technologies can flexibly represent and reason with the content of its database and interlink it(More)