• Corpus ID: 226226405

Effective Approach to Develop a Sentiment Annotator For Legal Domain in a Low Resource Setting

  title={Effective Approach to Develop a Sentiment Annotator For Legal Domain in a Low Resource Setting},
  author={Gathika Ratnayaka and Nisansa de Silva and Amal Shehan Perera and Ramesh Pathirana},
  booktitle={Pacific Asia Conference on Language, Information and Computation},
Analyzing the sentiments of legal opinions available in Legal Opinion Texts can facilitate several use cases such as legal judgement prediction, contradictory statements identification and party-based sentiment analysis. However, the task of developing a legal domain specific sentiment annotator is challenging due to resource constraints such as lack of domain specific labelled data and domain expertise. In this study, we propose novel techniques that can be used to develop a sentiment… 

Tables from this paper

Unified Legal Party Based Sentiment Analysis Pipeline

This study discusses the unification of two such models capable of doing the two task into a single pipeline to perform party based sentiment analysis efficiently.

Identifying Legal Party Members from Legal Opinion Documents using Natural Language Processing

This study proposes a novel deep learning methodology which can be effectively used to find a solution to the problem of identifying legal party members in legal documents and introduces a novel data set which is annotated with legal party information by an expert in the legal domain.

Context Sensitive Verb Similarity Dataset for Legal Information Extraction

LeCoVe is a verb similarity dataset intended towards facilitating the process of identifying verbs with similar meanings in a legal domain specific context and it is demonstrated that a neural network model, which was developed by combining semantic, syntactic, and contextual features that can be obtained from the outputs of embedding models, can perform comparatively well, even in a low resource scenario.


This research created a dataset which consists 2000 legal opinion text fetched from court cases in order to train the models and to conduct the experiments, and developed a rule-based model which is primarily built around a phrase-level sentiment annotator and using rationally built rules to perform Party-Based Sentiment Analysis.

Legal Case Winning Party Prediction With Domain Specific Auxiliary Models

This paper discusses a novel approach for predicting the winning party of a current court case by training an analytical model on a corpus of prior court cases which is then run on the prepared text on theCurrent court case.

Some Languages are More Equal than Others: Probing Deeper into the Linguistic Disparity in the NLP World

Linguistic disparity in the NLP world is a problem that has been widely acknowledged recently. However, different facets of this problem, or the reasons behind this disparity are seldom discussed

Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

The 3B-parameter zero-shot model outperforms all models, including ensembles, in the COLIEE 2021 test set and also achieves the best performance of a single model in theCOLIEE 2022 competition, second only to the ensemble composed of the 3B model itself and a smaller version of the same model.



Fast Approach to Build an Automatic Sentiment Annotator for Legal Domain using Transfer Learning

This study proposes a novel way of identifying the sentiment of the phrases used in the legal domain using a transfer learning approach which can be used for other domain adaptation tasks as well.

Identifying Transferable Information Across Domains for Cross-domain Sentiment Classification

This paper proposes that words that do not change their polarity and significance represent the transferable (usable) information across domains for cross-domain sentiment classification and presents a novel approach based on χ2 test and cosine-similarity between context vector of words to identify polarity preserving significant words across domains.

Shift-of-Perspective Identification within Legal Cases

This study focused on the identification of sentences in legal opinion texts which convey different perspectives on a certain topic or entity by combining several approaches based on semantic analysis, open information extraction, and sentiment analysis to achieve the objective.

Opinion mining in legal blogs

A survey into the scope and utility of opinion mining in legal Weblogs (a.k.a. blawgs) is performed and a language modeling approach deployed for both subjectivity analysis and polarity analysis is examined.

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank

A Sentiment Treebank that includes fine grained sentiment labels for 215,154 phrases in the parse trees of 11,855 sentences and presents new challenges for sentiment compositionality, and introduces the Recursive Neural Tensor Network.

Fine-grained Sentiment Classification using BERT

A promising deep learning model called BERT is used to solve the fine-grained sentiment classification task and it is shown that this model outperforms other popular models for this task without sophisticated architecture.

Sentiment in short strength detection informal text

A new algorithm is partly filled with a new algorithm, SentiStrength, to extract sentiment strength from informal English text, using new methods to exploit the de facto grammars and spelling styles of cyberspace.

A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs

This work wanted to examine how well ANEW and other word lists performs for the detection of sentiment strength in microblog posts in comparison with a new word list specifically constructed for microblogs.

SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining

This work discusses SENTIWORDNET 3.0, a lexical resource explicitly devised for supporting sentiment classification and opinion mining applications, and reports on the improvements concerning aspect (b) that it embodies with respect to version 1.0.

Synergistic union of Word2Vec and lexicon for domain specific semantic similarity

This study introduces a domain specific semantic similarity measure that was created by the synergistic union of word2vec, a word embedding method that is used for semantic similarity calculation and lexicon based (lexical) semantic similarity methods.