CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection

@inproceedings{Blaschke2020CyberWallEAS,
  title={CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection},
  author={Verena Blaschke and M. V. Korniyenko and Sam Tureski},
  booktitle={SEMEVAL},
  year={2020}
}
This paper describes our participation in the SemEval-2020 task Detection of Propaganda Techniques in News Articles. We participate in both subtasks: Span Identification (SI) and Technique Classification (TC). We use a bi-LSTM architecture in the SI subtask and train a complex ensemble model for the TC subtask. Our architectures are built using embeddings from BERT in combination with additional lexical features and extensive label post-processing. Our systems achieve a rank of 8 out of 35… 

Figures and Tables from this paper

SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles

TLDR
The results and the main findings of SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles are presented and the system submissions and the methods they used are discussed.

Multilingual Disinformation Detection for Digital Advertising

TLDR
A machine learning model based on multilingual text embeddings that determines whether the page mentions a topic of interest, then estimates the likelihood of the content being malicious, creating a shortlist of publishers that will be reviewed by human ex-perts.

NNPCov19: Artificial Neural Network-Based Propaganda Identification on Social Media in COVID-19 Era

TLDR
Improved artificial neural network algorithm is proposed to classify tweets into propagandistic and nonpropagandistic class and outperforms other machine learning algorithms by having 77.15% accuracy, 77% of recall, and 79% precision.

References

SHOWING 1-10 OF 25 REFERENCES

SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles

TLDR
The results and the main findings of SemEval-2020 Task 11 on Detection of Propaganda Techniques in News Articles are presented and the system submissions and the methods they used are discussed.

Neural Architectures for Fine-Grained Propaganda Detection in News

TLDR
This system has designed multi-granularity and multi-tasking neural architectures to jointly perform both the sentence and fragment level propaganda detection and investigates different ensemble schemes such as majority-v voting, relax-voting, etc. to boost overall system performance.

Fine-Grained Propaganda Detection with Fine-Tuned BERT

TLDR
This paper presents the winning solution of the Fragment Level Classification (FLC) task in the Fine Grained Propaganda Detection competition at the NLP4IF’19 workshop, and investigates the attention heads in the model, which provide insights into what the model learns, as well as aspects for potential improvements.

Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection

TLDR
This work presents the shared task on Fine-Grained Propaganda Detection, which was organized as part of the NLP4IF workshop at EMNLP-IJCNLP 2019 and saw 12 teams submit systems for the FLC task, and 14 teams eventually submitted a system description paper.

JUSTDeep at NLP4IF 2019 Task 1: Propaganda Detection using Ensemble Deep Learning Models

TLDR
This research paper provides an ensemble deep learning model using BiLSTM, XGBoost, and BERT to detect propaganda and it shows a significant performance over the baseline model.

Fine-Grained Analysis of Propaganda in News Articles

TLDR
A novel task: performing fine-grained analysis of texts by detecting all fragments that contain propaganda techniques as well as their type is proposed, and a novel multi-granularity neural network is designed that outperforms several strong BERT-based baselines.

On Sentence Representations for Propaganda Detection: From Handcrafted Features to Word Embeddings

TLDR
This work explores the task of sentence-level propaganda detection, and experiment with both handcrafted features and learned dense semantic representations in an attempt to curb the influence of class distribution on the system’s performance.

Divisive Language and Propaganda Detection using Multi-head Attention Transformers with Deep Learning BERT-based Language Models for Binary Classification

On the NLP4IF 2019 sentence level propaganda classification task, we used a BERT language model that was pre-trained on Wikipedia and BookCorpus as team ltuorp ranking #1 of 26. It uses deep learning

Detection of Propaganda Using Logistic Regression

TLDR
A Logistic Regression-based tool that automatically classifies whether a sentence is propagandistic or not is developed, using features like TF-IDF, BERT vector, sentence length, readability grade level, emotion feature, LIWC feature and emphatic content feature to help differentiate these two categories.

Proppy: A System to Unmask Propaganda in Online News

We present proppy, the first publicly available real-world, real-time propaganda detection system for online news, which aims at raising awareness, thus potentially limiting the impact of propaganda