• Corpus ID: 233365095

TeamUNCC@LT-EDI-EACL2021: Hope Speech Detection using Transfer Learning with Transformers

  title={TeamUNCC@LT-EDI-EACL2021: Hope Speech Detection using Transfer Learning with Transformers},
  author={Khyati Mahajan and Erfan Al-Hossami and Samira Shaikh},
In this paper, we describe our approach towards utilizing pre-trained models for the task of hope speech detection. We participated in Task 2: Hope Speech Detection for Equality, Diversity and Inclusion at LT-EDI-2021 @ EACL2021. The goal of this task is to predict the presence of hope speech, along with the presence of samples that do not belong to the same language in the dataset. We describe our approach to fine-tuning RoBERTa for Hope Speech detection in English and our approach to fine… 

Figures and Tables from this paper

CURAJ_IIITDWD@LT-EDI-ACL 2022: Hope Speech Detection in English YouTube Comments using Deep Learning Techniques
Several deep learning based models such as DNN (dense or fully connected neural network), CNN (Convolutional Neural Network), Bi-LSTM (Bidirectional Long Short Term Memory Network), and GRU(Gated Recurrent Unit) were employed to identify the hopeful comments in English-language YouTube comments.
LPS@LT-EDI-ACL2022:An Ensemble Approach about Hope Speech Detection
The task shared by sponsor about Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI-ACL-2022.The goal of this task is to identify whether a given comment contains hope speech or
CIC@LT-EDI-ACL2022: Are transformers the only hope? Hope speech detection for Spanish and English comments
Hope is an inherent part of human life and essential for improving the quality of life. Hope increases happiness and reduces stress and feelings of helplessness. Hope speech is the desired outcome
Overlapping Word Removal is All You Need: Revisiting Data Imbalance in Hope Speech Detection
It is shown that introducing focal loss as part of Multilingual-BerT’s (M-BERT) training process mitigates the ef- fect of class imbalance and improves overall F1-Macro by 0.11, and that overlapping word removal based on pre-processing, though simple, improves F 1- Macro by0.28.
Findings of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion
The shared task of hope speech detection for Tamil, English, and Malayalam languages was conducted as a part of the EACL 2021 workshop on Language Technology for Equality, Diversity, and Inclusion.
Multilingual hope speech detection in English and Dravidian languages
A multilingual hope speech dataset that promotes equality, diversity and inclusion (EDI) in English, Tamil, Malayalam and Kannada is presented that contains data collected from the LGBTQIA+ community, persons with disabilities and women in science, engineering, technology and management (STEM).
Hope speech detection in YouTube comments
This work created a multilingual dataset to recognize and encourage positivity in the comments, and proposes a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence.
Pilot Recommender System Enabling Students to Indirectly Help Each Other and Foster Belonging Through Reflections
Without a sense of belonging, students may become disheartened and give up when faced with new challenges. Moreover, with the sudden growth of remote learning due to COVID-19, it may be even more
Clustering Students’ Short Text Reflections: A Software Engineering Course Case Study
Student reflections can provide instructors with beneficial knowledge regarding their progress in the course, what challenges they are facing, and how the instructor can provide more effectively to
Overview of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion
An overview of the findings and results from the shared task on hope speech detection for Tamil, Malayalam, Kannada, English and Spanish languages conducted in the second workshop on Language Technology for Equality, Diversity and Inclusion organised as a part of ACL 2022 is reported.


HopeEDI: A Multilingual Hope Speech Detection Dataset for Equality, Diversity, and Inclusion
A Hope Speech dataset for Equality, Diversity and Inclusion (HopeEDI) containing user-generated comments from the social media platform YouTube with 28,451, 20,198 and 10,705 comments in English, Tamil and Malayalam, respectively, manually labelled as containing hope speech or not is constructed.
IIITT@LT-EDI-EACL2021-Hope Speech Detection: There is always hope in Transformers
This paper portrays the work for the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI 2021- EACL 2021 and works with several transformer-based models to classify social media comments as hope speech or not hope speech in English, Malayalam, and Tamil languages.
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
HateXplain is introduced, the first benchmark hate speech dataset covering multiple aspects of the issue, and it is observed that existing state-of-the-art models, which utilize the human rationales for training, perform better in re- ducing unintended bias towards target communities.
SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter
The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter, and provides an analysis and discussion about the participant systems and the results they achieved in both subtasks.
Making Monolingual Sentence Embeddings Multilingual Using Knowledge Distillation
An easy and efficient method to extend existing sentence embedding models to new languages by using the original (monolingual) model to generate sentence embeddings for the source language and then training a new system on translated sentences to mimic the original model.
Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter
This work proposes novel Deep Neural Network structures serving as effective feature extractors, and explores the usage of background information in the form of different word embeddings pre-trained from unlabelled corpora to address the very challenging nature of identifying hate speech on the social media.
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
A benchmark of nine diverse NLU tasks, an auxiliary dataset for probing models for understanding of specific linguistic phenomena, and an online platform for evaluating and comparing models, which favors models that can represent linguistic knowledge in a way that facilitates sample-efficient learning and effective knowledge-transfer across tasks.
ETHOS: a multi-label hate speech detection dataset
‘ETHOS’ (multi-labEl haTe speecH detectiOn dataSet), a textual dataset with two variants: binary and multi-label, based on YouTube and Reddit comments validated using the Figure-Eight crowdsourcing platform, and the annotation protocol used to create this dataset is presented.
Transformers: State-of-the-Art Natural Language Processing
Transformers is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.
Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German
This paper presents the HASOC track and its two parts, creating test collections for languages with few resources and English for comparison, and presents the tasks, the data and the main results.