WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia

  title={WikiContradiction: Detecting Self-Contradiction Articles on Wikipedia},
  author={Cheng-Mao Hsu and Cheng-te Li and Diego S{\'a}ez-Trumper and Yi-Zhan Hsu},
  journal={2021 IEEE International Conference on Big Data (Big Data)},
While Wikipedia has been utilized for fact-checking and claim verification to debunk misinformation and disinformation, it is essential to either improve article quality and rule out noisy articles. Self-contradiction is one of the low-quality article types in Wikipedia. In this work, we propose a task of detecting self-contradiction articles in Wikipedia. Based on the "self-contradictory" template, we create a novel dataset for the self-contradiction detection task. Conventional contradiction… 

Figures and Tables from this paper


Contradiction Detection with Contradiction-Specific Word Embedding
A tailored neural network to learn contradiction-specific word embedding (CWE), which can separate antonyms in the opposite ends of a spectrum and performs comparably with the top-performing system in accuracy of three-category classification.
Finding Contradictions in Text
It is demonstrated that a system for contradiction needs to make more fine-grained distinctions than the common systems for entailment, and is argued for the centrality of event coreference and therefore incorporate such a component based on topicality.
Automated Fact-Checking of Claims from Wikipedia
Automated fact checking is becoming increasingly vital as both truthful and fallacious information accumulate online. Research on fact checking has benefited from large-scale datasets such as FEVER
Contradiction Detection for Rumorous Claims
This work identifies two different contexts in which contradiction emerges: its broader form can be observed across independently posted tweets and its more specific form in threaded conversations, and how the two scenarios differ in terms of central elements of argumentation: claims and conversation structure.
WikiCheck: An End-to-end Open Source Automatic Fact-Checking API based on Wikipedia
A new fact-checking system is presented, the WikiCheck API that automatically performs a facts validation process based on the Wikipedia knowledge base and is comparable to SOTA solutions in terms of accuracy and can be used on low-memory CPU instances.
WikiWrite: Generating Wikipedia Articles Automatically
This work proposes Wiki-Write, a system capable of generating content for new Wikipedia articles automatically and proposes a novel abstractive summarization technique that uses a two-step integer-linear programming (ILP) model to synthesize the assigned content in each section and rewrite the content to produce a well-formed informative summary.
A Computational Approach to Finding Contradictions in User Opinionated Text
A typology of contradictions was proposed, and then the state-of-art deep learning models were adopted and enhanced by three methods of incorporating sentiment analysis to demonstrate its usefulness in understanding contradiction semantically and quantitatively in a large amount of user opinionated text.
FEVER: a Large-scale Dataset for Fact Extraction and VERification
This paper introduces a new publicly available dataset for verification against textual sources, FEVER, which consists of 185,445 claims generated by altering sentences extracted from Wikipedia and subsequently verified without knowledge of the sentence they were derived from.
Monolingual Social Media Datasets for Detecting Contradiction and Entailment
A new 3-way judgement Recognizing Textual Entailment (RTE) resource that originates in the Social Media domain is reported on, and a semi-automatic creation method for the special purpose of information verification, which draws on manually established rumourous claims reported during crisis events is explained.
Predicting Contradiction Intensity: Low, Strong or Very Strong?
Results showed that variation of ratings, variation of polarities, and reviews quantity are the best predictors of contradiction intensity and J48 was the most effective learning approach for this type of classification.