• Corpus ID: 236493282

Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking of Financial Terms

@article{Chopra2021TermEA,
  title={Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking of Financial Terms},
  author={Ankush Chopra and Sohom Ghosh},
  journal={ArXiv},
  year={2021},
  volume={abs/2107.13764}
}
Hypernym and synonym matching are one of the mainstream Natural Language Processing (NLP) tasks. In this paper, we present systems that attempt to solve this problem. We designed these systems to participate in the FinSim-3, a shared task of FinNLP workshop at IJCAI-2021. The shared task is focused on solving this problem for the financial domain. We experimented with various transformer based pre-trained embeddings by fine-tuning these for either classification or phrase similarity tasks. We… 

Figures and Tables from this paper

References

SHOWING 1-10 OF 26 REFERENCES

The FinSim-2 2021 Shared Task: Learning Semantic Similarities for the Financial Domain

The FinSim-2 is a second edition of FinSim Shared Task on Learning Semantic Similarities for the Financial Domain, colocated with the FinWeb workshop. FinSim-2 proposed the challenge to automatically

The FinSim 2020 Shared Task: Learning Semantic Representations for the Financial Domain

The FinSim 2020 shared task, colocated with the FinNLP workshop, offered the challenge to automatically learn effective and precise semantic models for the financial domain, the first time such a task was proposed for the Financial domain.

GOAT at the FinSim-2 task: Learning Word Representations of Financial Data with Customized Corpus

The experimental results demonstrate that these customized word embeddings can effectively improve the classification performance and achieve better results than the direct utilization of the provided word embedDings.

Anuj at the FinSim Task: Anuj@FINSIM¡VLearning Semantic Representation of Financial Domain with Investopedia

A hybrid approach that uses various NLP, machine learning, and deep learning models to develop a financial terms classifier is explained and use of a financial domain encyclopedia called Investopedia is explained to enrich terms for better context.

IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings

This paper presents the approaches for the FinSim 2020 shared task on "Learning Semantic Representations for the Financial Domain", to classify financial terms into the most relevant hypernym (or top-level) concept in an external ontology.

L3i_LBPAM at the FinSim-2 task: Learning Financial Semantic Similarities with Siamese Transformers

The proposed methods outperformed the authors' baselines and proved the robustness of the models based on textual similarity siamese network.

SemEval-2018 Task 9: Hypernym Discovery

This paper put forward this task as a complementary benchmark for modeling hypernymy, a problem which has traditionally been cast as a binary classification task, taking a pair of candidate words as input.

TCS_WITM_2021 @FinSim-2: Transformer based Models for Automatic Classification of Financial Terms

A transformer-based BERT architecture that captures such contextual information from a set of domain specific raw documents and then performs a classification task to segregate domain terms into fixed number of class labels is proposed.

SemEval-2016 Task 14: Semantic Taxonomy Enrichment

Only one participating system outperformed the second, morecompetitive baseline that attaches a new term to the first word in its gloss with the appropriate part of speech, which indicates that techniques must be adapted to exploit the structure of glosses.

PolyU-CBS at the FinSim-2 Task: Combining Distributional, String-Based and Transformers-Based Features for Hypernymy Detection in the Financial Domain

The results show that a simple logistic regression classifier, when trained on a combination of word embeddings, semantic and string similarity metrics and BERT-derived probabilities, achieves a strong performance (above 90%) in financial hypernymy detection.