Examining Wikipedia With a Broader Lens: Quantifying the Value of Wikipedia's Relationships with Other Large-Scale Online Communities

@article{Vincent2018ExaminingWW,
  title={Examining Wikipedia With a Broader Lens: Quantifying the Value of Wikipedia's Relationships with Other Large-Scale Online Communities},
  author={Nicholas Vincent and Isaac L. Johnson and Brent J. Hecht},
  journal={Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems},
  year={2018}
}
The extensive Wikipedia literature has largely considered Wikipedia in isolation, outside of the context of its broader Internet ecosystem. Very recent research has demonstrated the significance of this limitation, identifying critical relationships between Google and Wikipedia that are highly relevant to many areas of Wikipedia-based research and practice. This paper extends this recent research beyond search engines to examine Wikipedia's relationships with large-scale online communities… 

Figures and Tables from this paper

TWikiL - The Twitter Wikipedia Link Dataset
Recent research has shown how strongly connected Wikipedia and other web applications are. For example, search engines rely heavily on surfacing Wikipedia links to satisfy their users’ information
On the Value of Wikipedia as a Gateway to the Web
TLDR
A detailed analysis of usage logs gathered from Wikipedia users’ client devices sheds light on Wikipedia’s role not only as an important source of information, but also as a high-traffic gateway to the broader Web ecosystem.
Nitpicking online knowledge representations of governmental leadership. The case of Belgian prime ministers in Wikipedia and Wikidata.
TLDR
This article makes an evidence-based contribution to the study of the fundamental challenges that mark the formalisation of knowledge in the humanities by ‘nitpicking’ knowledge representations of governments and governmental leadership in Wikipedia and Wikidata.
Representation of Music Creators on Wikipedia, Differences in Gender and Genre
Wikipedia is not only the world’s largest online encyclope- dia and among the most frequented websites, but provides important data leveraged by many popular services and prod- ucts. Since Wikipedia
Wikipedia Text Reuse: Within and Without
TLDR
To discover reuse beyond verbatim copy and paste, state-of-the-art text reuse detection technology is employed for the first time to process the entire Wikipedia as part of a distributed retrieval pipeline.
Measuring the Importance of User-Generated Content to Search Engines
TLDR
A rigorous audit of the extent to which Google leverages Wikipedia and other user-generated content to respond to queries shows that Wikipedia appears in over 80% of results pages for some query types and is by far the most prevalent individual content source across all query types.
Investigating the Effects of Google's Search Engine Result Page in Evaluating the Credibility of Online News Sources
TLDR
A study in which participants were instructed to do lateral reading for credibility assessment by inspecting Google's search engine result page (SERP) of unfamiliar news sources, and indicates that there are widespread inconsistencies in the coverage and quality of information included in Knowledge Panels.
Templates and Trust-o-meters: Towards a widely deployable indicator of trust in Wikipedia
TLDR
This work identifies and addresses three key challenges: empirically determining which metrics from prior and existing community approaches most impact reader trust; validating indicator placements and designs that are both compact yet noticed by readers; and demonstrating that such indicators can not only lower trust but also increase perceived trust in the system when appropriate.
Understanding Wikipedia as a Resource for Opportunistic Learning of Computing Concepts
TLDR
It is concluded that Wikipedia is not well suited as a resource for the opportunistic learning of new computing concepts, and information sharing practices in on-line programmer communities are recommended to better account for the learning needs of the users.
Detecting and Gauging Impact on Wikipedia Page Views
TLDR
The authors' models effectively estimate the impact of page preview roll-out, but do not detect a significant change following the video campaign in India and the utility of other geographies or language editions for predicting page views from a given area on a given language edition is discussed.
...
...

References

SHOWING 1-10 OF 77 REFERENCES
The Substantial Interdependence of Wikipedia and Google: A Case Study on the Relationship Between Peer Production Communities and Information Technologies
TLDR
Evidence is found that Google’s critical role in providing readership to Wikipedia is in jeopardy and researchers and practitioners should give deeper consideration to the interdependence between peer production communities and the information technologies that use and surface their content.
Drawing a data-driven portrait of Wikipedia editors
TLDR
Using aggregated log data from Yahoo!'s browser toolbar in order to analyze Wikipedians' editing behavior in the context of their online lives beyond Wikipedia finds that Wikipedia editors search more, read more news, play more games, and, perhaps surprisingly, are more immersed in popular culture.
Determining the Influence of Reddit Posts on Wikipedia Pageviews
TLDR
It is shown that the week in which a post references a specific Wikipedia article is associated with a substantial increase in pageviews relative to prior and successive weeks, and functional PCA is applied to the dataset in order to characterize pageview dynamics.
Who does what: Collaboration patterns in the wikipedia and their impact on article quality
TLDR
It is shown that the quality of Wikipedia articles is not only dependent on the different types of contributors but also on how they collaborate, and various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles are identified.
Making peripheral participation legitimate: reader engagement experiments in wikipedia
TLDR
A set of field experiments using the Article Feedback Tool, a system designed to elicit lightweight contributions from Wikipedia's readers, show both qualitatively and quantitatively that peripheral contributors add value to an open collaboration community as long as the cost of identifying low quality contributions remains low.
It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia
TLDR
This paper presents and applies a computational method for assessing gender bias on Wikipedia along multiple dimensions and finds that while women on Wikipedia are covered and featured well in many Wikipedia language editions, the way women are portrayed starkly differs from the way men are portrayed.
What's in a Name? Understanding the Interplay between Titles, Content, and Communities in Social Media
TLDR
This paper studies resubmissions, i.e., content that has been submitted multiple times, with multiple titles, to multiple different communities to understand how to better target social media content: by using the right title, for the right community, at the right time.
The Wikipedia Gender Gap Revisited: Characterizing Survey Response Bias with Propensity Score Estimation
TLDR
Revised estimates are constructed for several of the Wikimedia Foundation and United Nations University at Maastricht claims about Wikipedia editors that the proportion of female US adult editors was 27.5% higher than the original study reported.
Misalignment Between Supply and Demand of Quality Content in Peer Production Communities
TLDR
An analytical framework for studying the relationship between content production and consumption in peer production communities is introduced and it is observed that there is an excess of high-quality content about certain specific topics, and that the majority of articles with insufficient quality are in a stable state.
Measuring self-focus bias in community-maintained knowledge repositories
TLDR
Two methods to quantify self- focus are outlined, one of which is very computationally inexpensive, and empirical evidence for the existence of self-focus is presented using a "hyperlingual" approach that examines 15 different language editions of Wikipedia.
...
...