TeamUNCC@LT-EDI-EACL2021: Hope Speech Detection using Transfer Learning with Transformers
@inproceedings{Mahajan2021TeamUNCCLTEDIEACL2021HS, title={TeamUNCC@LT-EDI-EACL2021: Hope Speech Detection using Transfer Learning with Transformers}, author={Khyati Mahajan and Erfan Al-Hossami and Samira Shaikh}, booktitle={LTEDI}, year={2021} }
In this paper, we describe our approach towards utilizing pre-trained models for the task of hope speech detection. We participated in Task 2: Hope Speech Detection for Equality, Diversity and Inclusion at LT-EDI-2021 @ EACL2021. The goal of this task is to predict the presence of hope speech, along with the presence of samples that do not belong to the same language in the dataset. We describe our approach to fine-tuning RoBERTa for Hope Speech detection in English and our approach to fine…
10 Citations
CURAJ_IIITDWD@LT-EDI-ACL 2022: Hope Speech Detection in English YouTube Comments using Deep Learning Techniques
- Computer ScienceLTEDI
- 2022
Several deep learning based models such as DNN (dense or fully connected neural network), CNN (Convolutional Neural Network), Bi-LSTM (Bidirectional Long Short Term Memory Network), and GRU(Gated Recurrent Unit) were employed to identify the hopeful comments in English-language YouTube comments.
LPS@LT-EDI-ACL2022:An Ensemble Approach about Hope Speech Detection
- PsychologyLTEDI
- 2022
The task shared by sponsor about Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI-ACL-2022.The goal of this task is to identify whether a given comment contains hope speech or…
CIC@LT-EDI-ACL2022: Are transformers the only hope? Hope speech detection for Spanish and English comments
- LinguisticsLTEDI
- 2022
Hope is an inherent part of human life and essential for improving the quality of life. Hope increases happiness and reduces stress and feelings of helplessness. Hope speech is the desired outcome…
Overlapping Word Removal is All You Need: Revisiting Data Imbalance in Hope Speech Detection
- Computer ScienceArXiv
- 2022
It is shown that introducing focal loss as part of Multilingual-BerT’s (M-BERT) training process mitigates the ef- fect of class imbalance and improves overall F1-Macro by 0.11, and that overlapping word removal based on pre-processing, though simple, improves F 1- Macro by0.28.
Findings of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion
- Computer ScienceLTEDI
- 2021
The shared task of hope speech detection for Tamil, English, and Malayalam languages was conducted as a part of the EACL 2021 workshop on Language Technology for Equality, Diversity, and Inclusion.
Multilingual hope speech detection in English and Dravidian languages
- Computer ScienceInternational journal of data science and analytics
- 2022
A multilingual hope speech dataset that promotes equality, diversity and inclusion (EDI) in English, Tamil, Malayalam and Kannada is presented that contains data collected from the LGBTQIA+ community, persons with disabilities and women in science, engineering, technology and management (STEM).
Hope speech detection in YouTube comments
- Computer ScienceSocial network analysis and mining
- 2022
This work created a multilingual dataset to recognize and encourage positivity in the comments, and proposes a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence.
Pilot Recommender System Enabling Students to Indirectly Help Each Other and Foster Belonging Through Reflections
- EducationLAK
- 2022
Without a sense of belonging, students may become disheartened and give up when faced with new challenges. Moreover, with the sudden growth of remote learning due to COVID-19, it may be even more…
Clustering Students’ Short Text Reflections: A Software Engineering Course Case Study
- Education
- 2021
Student reflections can provide instructors with beneficial knowledge regarding their progress in the course, what challenges they are facing, and how the instructor can provide more effectively to…
Overview of the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion
- Computer ScienceLTEDI
- 2022
An overview of the findings and results from the shared task on hope speech detection for Tamil, Malayalam, Kannada, English and Spanish languages conducted in the second workshop on Language Technology for Equality, Diversity and Inclusion organised as a part of ACL 2022 is reported.
References
SHOWING 1-10 OF 33 REFERENCES
HopeEDI: A Multilingual Hope Speech Detection Dataset for Equality, Diversity, and Inclusion
- Computer SciencePEOPLES
- 2020
A Hope Speech dataset for Equality, Diversity and Inclusion (HopeEDI) containing user-generated comments from the social media platform YouTube with 28,451, 20,198 and 10,705 comments in English, Tamil and Malayalam, respectively, manually labelled as containing hope speech or not is constructed.
IIITT@LT-EDI-EACL2021-Hope Speech Detection: There is always hope in Transformers
- Computer ScienceLTEDI
- 2021
This paper portrays the work for the Shared Task on Hope Speech Detection for Equality, Diversity, and Inclusion at LT-EDI 2021- EACL 2021 and works with several transformer-based models to classify social media comments as hope speech or not hope speech in English, Malayalam, and Tamil languages.
HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection
- Computer ScienceAAAI
- 2021
HateXplain is introduced, the first benchmark hate speech dataset covering multiple aspects of the issue, and it is observed that existing state-of-the-art models, which utilize the human rationales for training, perform better in re- ducing unintended bias towards target communities.
SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter
- Computer Science*SEMEVAL
- 2019
The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in Spanish and English messages extracted from Twitter, and provides an analysis and discussion about the participant systems and the results they achieved in both subtasks.
Making Monolingual Sentence Embeddings Multilingual Using Knowledge Distillation
- Computer ScienceEMNLP
- 2020
An easy and efficient method to extend existing sentence embedding models to new languages by using the original (monolingual) model to generate sentence embeddings for the source language and then training a new system on translated sentences to mimic the original model.
Hate Speech Detection: A Solved Problem? The Challenging Case of Long Tail on Twitter
- Computer ScienceSemantic Web
- 2019
This work proposes novel Deep Neural Network structures serving as effective feature extractors, and explores the usage of background information in the form of different word embeddings pre-trained from unlabelled corpora to address the very challenging nature of identifying hate speech on the social media.
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
- Computer ScienceBlackboxNLP@EMNLP
- 2018
A benchmark of nine diverse NLU tasks, an auxiliary dataset for probing models for understanding of specific linguistic phenomena, and an online platform for evaluating and comparing models, which favors models that can represent linguistic knowledge in a way that facilitates sample-efficient learning and effective knowledge-transfer across tasks.
ETHOS: a multi-label hate speech detection dataset
- Computer ScienceComplex & Intelligent Systems
- 2022
‘ETHOS’ (multi-labEl haTe speecH detectiOn dataSet), a textual dataset with two variants: binary and multi-label, based on YouTube and Reddit comments validated using the Figure-Eight crowdsourcing platform, and the annotation protocol used to create this dataset is presented.
Transformers: State-of-the-Art Natural Language Processing
- Computer ScienceEMNLP
- 2020
Transformers is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.
Overview of the HASOC Track at FIRE 2020: Hate Speech and Offensive Language Identification in Tamil, Malayalam, Hindi, English and German
- EducationFIRE
- 2020
This paper presents the HASOC track and its two parts, creating test collections for languages with few resources and English for comparison, and presents the tasks, the data and the main results.