• Corpus ID: 13915840

Bill Gates is not a Parking Meter: Philosophical Quality Control in Automated Ontology-building

@inproceedings{Legg2012BillGI,
  title={Bill Gates is not a Parking Meter: Philosophical Quality Control in Automated Ontology-building},
  author={Catherine Legg and Samuel Sarjant},
  year={2012}
}
The somewhat old-fashioned concept of philosophical categories is revived and put to work in automated ontology building. We describe a project harvesting knowledge from Wikipedia’s category network in which the principled ontological structure of Cyc was leveraged to furnish an extra layer of accuracychecking over and above more usual corrections which draw on automated measures of semantic relatedness. 
2 Citations

Massive Ontology Interface

An evaluation by 30 users comparing MOI with OpenCyc's original interface showed significant improvements in user understanding of the ontology, although full testing of the interface's social elements lies in the future.

Reasoning-Supported Quality Assurance for Knowledge Bases

A sophisticated framework and the corresponding tool support for partially automating the inspection of ontologies with respect to accuracy is developed and it is shown that this estimate can effectively be learned on-the-fly over the course.

References

SHOWING 1-10 OF 30 REFERENCES

"All You Can Eat" Ontology-Building: Feeding Wikipedia to Cyc

This work describes methods used to add 35K new concepts mined from Wikipedia to collections in ResearchCyc entirely automatically and shows how Cyc itself can be leveraged for ontological quality control by ‘feeding’ it assertions one by one, enabling it to reject those that contradict its other knowledge.

Deriving a Large-Scale Taxonomy from Wikipedia

A large scale taxonomy containing a large amount of subsumption is derived using methods based on connectivity in the network and lexicosyntactic matching to label the semantic relations between categories in Wikipedia.

Yago: a core of semantic knowledge

YAGO builds on entities and relations and currently contains more than 1 million entities and 5 million facts, which includes the Is-A hierarchy as well as non-taxonomic relations between entities (such as HASONEPRIZE).

Integrating Cyc and Wikipedia: Folksonomy meets rigorously defined common-sense

This work maps categories from the largest manually-built ontology, Cyc, onto Wikipedia articles describing corresponding concepts, drawing both on Wikipedia’s rich but chaotic hyperlink structure and Cyc's carefully defined taxonomic and common-sense knowledge.

Mining Domain-Specific Thesauri from Wikipedia: A Case Study

It is shown how the classic thesaurus structure of terms and links can be mined automatically from Wikipedia, and it is found that Wikipedia contains a substantial proportion of its concepts and semantic relations.

Decoding Wikipedia Categories for Knowledge Acquisition

The results support the idea that Wikipedia category names are a rich source of useful and accurate knowledge.

Mining Meaning from Wikipedia

Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language

  • P. Resnik
  • Computer Science
    J. Artif. Intell. Res.
  • 1999
This article presents a measure of semantic similarity in an IS-A taxonomy based on the notion of shared information content that performs better than the traditional edge-counting approach.

Ontologies on the Semantic Web

  • C. Legg
  • Computer Science
    Annu. Rev. Inf. Sci. Technol.
  • 2007
There has been a convergence between the Semantic Web research community and an older tradition with roots in classical Artificial Intelligence (AI) research whose goal is to develop a formal ontology.