Analyzing and visualizing the semantic coverage of Wikipedia and its authors

@article{Holloway2007AnalyzingAV,
  title={Analyzing and visualizing the semantic coverage of Wikipedia and its authors},
  author={Todd Holloway and Miran Bozicevic and Katy B{\"o}rner},
  journal={ArXiv},
  year={2007},
  volume={abs/cs/0512085}
}
This article presents a novel analysis and visualization of English Wikipedia data. Our specific interest is the analysis of basic statistics, the identification of the semantic structure and the age of the categories in this free online encyclopedia, and the content coverage of its highly productive authors. © 2007 Wiley Periodicals, Inc. Complexity: 12: 30–40, 2007 

Figures and Tables from this paper

Wikipedia category visualization using radial layout
TLDR
The design of an information visualization tool is presented that produces overview diagrams of Wikipedia's articles distributed according to category relationships, and examples of visualizing English Wikipedia are shown.
Automatically assigning Wikipedia articles to macro-categories
TLDR
This paper modified an existing approach, based on the shortest paths between categories, in order to account for the direction of the hierarchy, and presents a technique which leverages this rich and disordered graph to assign each article to one or more topics.
A link-based visual search engine for Wikipedia
TLDR
HMpara, a new search engine that aims to make Wikipedia easier to explore, works on top of the encyclopedia's existing link structure, abstracting away from document content and allowing users to navigate the resource at a higher level.
Identification of Wikipedia categories associations based on articles similarities
TLDR
The evaluation of the proposed method indicate it allows to reconstruct already existing associations in category structure as well as introduce new significant relations.
Assigning Wikipedia articles to macro-categories
TLDR
This paper modified an existing approach, based on the shortest paths between categories, in order to account for the direction of the hierarchy, and presents a technique which leverages this rich and disordered graph to assign each article to one or more topics.
Network Analysis of Wikipedia
TLDR
The data suggest at least three stages of growth, the last of which has only recently emerged, and how growth depends upon infrastructure and internal links is considered.
Wikipedia research and tools: Review and comments
TLDR
An overview of Wikipedia and wiki research and tools is given, which serves to describe some key areas of research.
Topic Calculation and Clustering: An Application to Wikipedia
TLDR
A method for finding related Wikipedia articles is proposed, which relies on a framework that clusters documents into semantically-calculated topics and selects the closest documents which could enrich the "See Also" section.
Quantitative data and graphics on lexical specificity and index readability: the case of Wikipedia
TLDR
To what extent collaboratively produced Wikipedia entries are readable and standardized in a way not very dissimilar from those produced by experts in the Encyclopaedia Britannica Online is shown.
A method for category similarity calculation in Wikis
TLDR
A method to calculate similarities between categories is presented, illustrated by a calculation for the top-level categories in the Simple English version of Wikipedia.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 36 REFERENCES
Evaluating authoritative sources using social networks: an insight from Wikipedia
TLDR
It is believed that the approach presented here could be used to improve the authoritativeness of content found in Wikipedia and similar sources and approaches the problem of quality Wikipedia content from a social network point of view.
Wikipedias: collaborative web-based encyclopedias as complex networks.
TLDR
An analysis of Wikipedias in several languages as complex networks is presented, showing that many network characteristics are common to different language versions of Wikipedia, such as their degree distributions, growth, topology, reciprocity, clustering, assortativity, path lengths, and triad significance profiles.
Studying cooperation and conflict between authors with history flow visualizations
TLDR
This paper investigates the dynamics of Wikipedia, a prominent, thriving wiki, and focuses on the relevance of authorship, the value of community surveillance in ameliorating antisocial behavior, and how authors with competing perspectives negotiate their differences.
Mapping the backbone of science
TLDR
A new map representing the structure of all of science, based on journal articles, is presented, including both the natural and social sciences, including biochemistry, which appears as the most interdisciplinary discipline in science.
Knowledge Mining With VxInsight: Discovery Through Interaction
TLDR
A set of properties that such a presentation should have is discussed, and the design and functionality of VxInsight, a visualization tool built to these principles are described.
Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams
TLDR
A novel weighted graph representation is presented that encodes coupled author-paper networks as a weighted co-authorship graph that indicates a drift toward a more cooperative, global collaboration process as the main drive in the production of scientific knowledge.
ThemeRiver: Visualizing Thematic Changes in Large Document Collections
TLDR
The ThemeRiver visualization depicts thematic variations over time within a large collection of documents and uses a river metaphor to convey several key notions, allowing a user to discern patterns that suggest relationships or trends.
The plane with parallel coordinates
TLDR
A new duality betweenbounded and unbounded convex sets and hstars (a generalization of hyperbolas) and between Convex Unions and Intersections is found and motivates some efficient ConveXity algorithms and other results inComputational Geometry.
...
1
2
3
4
...