Addressing structural and linguistic heterogeneity in the Web

  title={Addressing structural and linguistic heterogeneity in the Web},
  author={Jacobo Rouces and Gerard de Melo and Katja Hose},
  journal={AI Commun.},
An increasing number of structured knowledge bases have become available on the Web, enabling many new forms of analyses and applications. However, the fact that the data is being published by different parties with different vocabularies and ontologies means that there is a high degree of heterogeneity and no common schema. At the same time, the abundance of different human languages across unstructured data presents a similar problem, because most text mining tools only cater to the English… CONTINUE READING