Catherine Chavula

  • Citations Per Year
Learn More
The current enormous amount of data on the Semantic Web and its increasing uptake raises the question of how this data can be accessed in several languages. OWL provides limited support for multilingualism through the use of an annotation property. However, it is known that more expressive models are required for linguistically demanding applications. Among(More)
Despite the availability of massive open information and efforts to promote multilingualism on the Web, content in Bantu languages remains negligible. Additionally, Information Retrieval (IR) systems, such as the Google search engine, use algorithms that work well with languages that have the most content. Similarities across related languages such as(More)
Unsupervised morphological segmentation is attractive for low density languages that have little linguistic description, such as many of the Bantu languages. However, techniques that cluster morphologically related words use string similarity metrics that are more suited for languages that have simple morphological systems. This paper proposes a weighted(More)
South Africa is a linguistically diverse country: it is a home to 11 official languages of which nine, excluding English and Afrikaans, are Resource Scarce Languages (RSLs). Accordingly, many South Africans struggle to access information written in their native languages on the Web. Unfortunately, lack of access to information hinders social economic(More)
  • 1