Analyzing and visualizing the semantic coverage of Wikipedia and its authors
@article{Holloway2007AnalyzingAV, title={Analyzing and visualizing the semantic coverage of Wikipedia and its authors}, author={Todd Holloway and Miran Bozicevic and Katy B{\"o}rner}, journal={ArXiv}, year={2007}, volume={abs/cs/0512085} }
This article presents a novel analysis and visualization of English Wikipedia data. Our specific interest is the analysis of basic statistics, the identification of the semantic structure and the age of the categories in this free online encyclopedia, and the content coverage of its highly productive authors. © 2007 Wiley Periodicals, Inc. Complexity: 12: 30–40, 2007
Figures and Tables from this paper
203 Citations
Wikipedia category visualization using radial layout
- Computer ScienceInt. Sym. Wikis
- 2011
The design of an information visualization tool is presented that produces overview diagrams of Wikipedia's articles distributed according to category relationships, and examples of visualizing English Wikipedia are shown.
Automatically assigning Wikipedia articles to macro-categories
- Computer Science
- 2011
This paper modified an existing approach, based on the shortest paths between categories, in order to account for the direction of the hierarchy, and presents a technique which leverages this rich and disordered graph to assign each article to one or more topics.
A link-based visual search engine for Wikipedia
- Computer ScienceJCDL '11
- 2011
HMpara, a new search engine that aims to make Wikipedia easier to explore, works on top of the encyclopedia's existing link structure, abstracting away from document content and allowing users to navigate the resource at a higher level.
Identification of Wikipedia categories associations based on articles similarities
- Computer Science
- 2013
The evaluation of the proposed method indicate it allows to reconstruct already existing associations in category structure as well as introduce new significant relations.
Assigning Wikipedia articles to macro-categories
- Computer Science
- 2011
This paper modified an existing approach, based on the shortest paths between categories, in order to account for the direction of the hierarchy, and presents a technique which leverages this rich and disordered graph to assign each article to one or more topics.
Network Analysis of Wikipedia
- Economics
- 2008
The data suggest at least three stages of growth, the last of which has only recently emerged, and how growth depends upon infrastructure and internal links is considered.
Wikipedia research and tools: Review and comments
- Geology
- 2012
An overview of Wikipedia and wiki research and tools is given, which serves to describe some key areas of research.
Topic Calculation and Clustering: An Application to Wikipedia
- Computer Science2008 Seventh Mexican International Conference on Artificial Intelligence
- 2008
A method for finding related Wikipedia articles is proposed, which relies on a framework that clusters documents into semantically-calculated topics and selects the closest documents which could enrich the "See Also" section.
Quantitative data and graphics on lexical specificity and index readability: the case of Wikipedia
- Linguistics
- 2009
To what extent collaboratively produced Wikipedia entries are readable and standardized in a way not very dissimilar from those produced by experts in the Encyclopaedia Britannica Online is shown.
A method for category similarity calculation in Wikis
- EngineeringInt. Sym. Wikis
- 2010
A method to calculate similarities between categories is presented, illustrated by a calculation for the top-level categories in the Simple English version of Wikipedia.
References
SHOWING 1-10 OF 36 REFERENCES
Evaluating authoritative sources using social networks: an insight from Wikipedia
- Computer Science, SociologyOnline Inf. Rev.
- 2006
It is believed that the approach presented here could be used to improve the authoritativeness of content found in Wikipedia and similar sources and approaches the problem of quality Wikipedia content from a social network point of view.
Wikipedias: collaborative web-based encyclopedias as complex networks.
- Computer SciencePhysical review. E, Statistical, nonlinear, and soft matter physics
- 2006
An analysis of Wikipedias in several languages as complex networks is presented, showing that many network characteristics are common to different language versions of Wikipedia, such as their degree distributions, growth, topology, reciprocity, clustering, assortativity, path lengths, and triad significance profiles.
Studying cooperation and conflict between authors with history flow visualizations
- Computer ScienceCHI
- 2004
This paper investigates the dynamics of Wikipedia, a prominent, thriving wiki, and focuses on the relevance of authorship, the value of community surveillance in ameliorating antisocial behavior, and how authors with competing perspectives negotiate their differences.
Mapping the backbone of science
- Computer ScienceScientometrics
- 2005
A new map representing the structure of all of science, based on journal articles, is presented, including both the natural and social sciences, including biochemistry, which appears as the most interdisciplinary discipline in science.
Knowledge Mining With VxInsight: Discovery Through Interaction
- Computer ScienceJournal of Intelligent Information Systems
- 2004
A set of properties that such a presentation should have is discussed, and the design and functionality of VxInsight, a visualization tool built to these principles are described.
Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams
- Computer ScienceComplex.
- 2005
A novel weighted graph representation is presented that encodes coupled author-paper networks as a weighted co-authorship graph that indicates a drift toward a more cooperative, global collaboration process as the main drive in the production of scientific knowledge.
ThemeRiver: Visualizing Thematic Changes in Large Document Collections
- Environmental ScienceIEEE Trans. Vis. Comput. Graph.
- 2002
The ThemeRiver visualization depicts thematic variations over time within a large collection of documents and uses a river metaphor to convey several key notions, allowing a user to discern patterns that suggest relationships or trends.
The plane with parallel coordinates
- MathematicsThe Visual Computer
- 2005
A new duality betweenbounded and unbounded convex sets and hstars (a generalization of hyperbolas) and between Convex Unions and Intersections is found and motivates some efficient ConveXity algorithms and other results inComputational Geometry.