The Detection and effect of social events on Wikipedia data-set for studying human preferences

@article{Assuied2022TheDA,
  title={The Detection and effect of social events on Wikipedia data-set for studying human preferences},
  author={Julien Assuied and Y{\'e}rali Gandica},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.06998}
}
Several studies have used Wikipedia (WP) data-set to analyse worldwide human preferences by languages. However, those studies could suffer from bias related to exceptional social circumstances. Any massive event promoting the exceptional edition of WP can be defined as a source of bias. In this article, we follow a procedure for detecting outliers. Our study is based on 12 languages and 13 different categories. Our methodology defines a parameter, which is language-depending instead of being… 

Figures and Tables from this paper

References

SHOWING 1-5 OF 5 REFERENCES

Linguistic neighbourhoods: explaining cultural borders on Wikipedia through multilingual co-editing activity

This study sheds light on how culture is reflected in the collective process of archiving knowledge on Wikipedia, and demonstrates that cross-lingual interconnections on Wikipedia are not dominated by one powerful language.

Multiple Texts as a Limiting Factor in Online Learning: Quantifying (Dis-)similarities of Knowledge Networks

The article develops a hybrid model of intra- and intertextual similarity of different parts of the information landscape and tests this model on the example of 35 languages and corresponding Wikipedias, and goes beyond existing approaches by examining their structural and semantic aspects intra-and intertextually.

Robust statistics for outlier detection

An overview of several robust methods and outlier detection tools for univariate, low‐dimensional, and high‐dimensional data such as estimation of location and scatter, linear regression, principal component analysis, and classification are presented.

Not in the other language

    Population's preferences trough Wikipedia edits

    • Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS)
    • 2018