Measuring article quality in wikipedia: models and evaluation

@inproceedings{Hu2007MeasuringAQ,
  title={Measuring article quality in wikipedia: models and evaluation},
  author={Meiqun Hu and Ee-Peng Lim and Aixin Sun and Hady W. Lauw and Ba-Quy Vuong},
  booktitle={CIKM '07},
  year={2007}
}
Wikipedia has grown to be the world largest and busiest free encyclopedia, in which articles are collaboratively written and maintained by volunteers online. Despite its success as a means of knowledge sharing and collaboration, the public has never stopped criticizing the quality of Wikipedia articles edited by non-experts and inexperienced contributors. In this paper, we investigate the problem of assessing the quality of articles in collaborative authoring of Wikipedia. We propose three… Expand

Figures, Tables, and Topics from this paper

Measuring Quality of Collaboratively Edited Documents: The Case of Wikipedia
  • Quang-Vinh Dang, C. Ignat
  • Computer Science
  • 2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC)
  • 2016
TLDR
An automatic assessment method of Wikipedia articles quality is presented by analyzing their content in terms of their format features and readability scores and results show improvements both in Terms of accuracy and information gain compared with other existing approaches. Expand
Structure-Based Features for Predicting the Quality of Articles in Wikipedia
TLDR
A quality model integrating both temporal and structural features captured from the implicit peer review process enabled by Wikipedia is proposed and a generic HITS-like framework is developed and able to capture both the quality of the content and the authority of the associated authors. Expand
Measuring article quality in Wikipedia using the collaboration network
TLDR
This work gives a generic formulation of the Mutual Reinforcement principle held between articles quality and authors authority and take explicitly advantage of the co-edits graph generated by individuals to tackle the problem of automatically establishing the quality of Wikipedia articles. Expand
Measuring article quality in Wikipedia: Lexical clue model
TLDR
A lexical clue based measuring method is proposed to assess article quality in Wikipedia inspired by the idea that the good articles have more regular statistic features on lexical usage than the primary ones due to the more revise by more people. Expand
Statistical measure of quality in Wikipedia
TLDR
This study model the evolution of content quality in Wikipedia articles in order to estimate the fraction of time during which articles retain high-quality status and assess the quality of Wikipedia's featured and non-featured articles. Expand
WikiLyzer: Interactive Information Quality Assessment in Wikipedia
Digital libraries and services enable users to access large amounts of data on demand. Yet, quality assessment of information encountered on the Internet remains an elusive open issue. For example,Expand
QuWi: quality control in Wikipedia
TLDR
Preliminary results demonstrate that the proposed algorithm seems to appropriately identify high and low quality articles, and that high quality authors produce more long-lived contributions than low quality authors. Expand
Statistical Measure of the Effectiveness of the Open Editing Model of Wikipedia
Wikipedia is commonly viewed as the main online encyclopedia. Its content quality, however, has often been questioned due to the open nature of its editing model. A high– quality contribution by anExpand
A Multi-view Approach for the Quality Assessment of Wiki Articles
TLDR
This work proposed to group the indicators in semantically meaningful views of quality and investigated a new approach to combine these views based on a meta-learning method, known as stacking, and demonstrated that it is possible to use this approach in collaborative encyclopedias such as Wikipedia and Wikia. Expand
On ranking controversies in wikipedia: models and evaluation
TLDR
Three models are proposed to identify controversial articles in Wikipedia, namely the Basic model and two Controversy Rank (CR) models, which draw clues from collaboration and edit history instead of interpreting the actual articles or edited content. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 32 REFERENCES
Measuring Wikipedia
Wikipedia, an international project that uses Wiki software to collaboratively create an encyclopaedia, is becoming more and more popular. Everyone can directly edit articles and every edit isExpand
Measuring Qualities of Articles Contributed by Online Communities
TLDR
Two models, namely basic model and peer review model, are developed for measuring the qualities of Wikipedia articles and the authorities of their contributors and show that the two models can effectively determine the articles' qualities and contributors' authorities. Expand
Wikipedia as Participatory Journalism: Reliable Sources? Metrics for evaluating collaborative media as a news resource
  • A. Lih
  • Computer Science, Political Science
  • 2004
TLDR
This study examines the growth of Wikipedia and analyzes the crucial technologies and community policies that have enabled the project to prosper, and establishes a set of metrics based on established encyclopedia taxonomies and analyzed the trends in Wikipedia being used as a source. Expand
Identifying document topics using the Wikipedia category network
TLDR
It is shown that even a fairly simple algorithm that exploits only the titles and categories of Wikipedia articles can characterize documents by Wikipedia categories surprisingly well. Expand
A content-driven reputation system for the wikipedia
TLDR
The results show that the notion of reputation has good predictive value: changes performed by low-reputation authors have a significantly larger than average probability of having poor quality, as judged by human observers, and of being later undone, as measured by the algorithms. Expand
Puppy smoothies: Improving the reliability of open, collaborative wikis
TLDR
This paper provides a practical proposal for improving user confidence in wiki information by coloring the text of a wiki article based on the venerability of the text on the philosophy that bad information is less likely to survive a collaborative editing process over large numbers of edits. Expand
Semantic Wikipedia
TLDR
This paper provides an extension to be integrated in Wikipedia, that allows the typing of links between articles and the specification of typed data inside the articles in an easy-to-use manner, and presents the design, implementation, and possible uses of this extension. Expand
Quality in Internet Collective Goods : Zealots and Good Samaritans in the Case of Wikipedia
One important innovation in information and communication technology developed over the past decade was organizational rather than merely technological. Open source production is remarkable becauseExpand
On the Evolution of Wikipedia
TLDR
This paper aims to provide a first, extensive study of the user behavior on Wikipedia and its evolution, finding that the evolution and updates of Wikipedia is governed by a self-similar process, not by the Poisson process that has been observed for the general Web. Expand
Discovering missing links in Wikipedia
TLDR
The method is proposed consists of two steps: first, a cluster of highly similar pages around a given page is computed, and then candidate links from those similar pages that might be missing on the given page are identified. Expand
...
1
2
3
4
...