Corpus ID: 236087214

Unsupervised Identification of Relevant Prior Cases

  title={Unsupervised Identification of Relevant Prior Cases},
  author={Shivangi Bithel and Sumitra S Malagi},
Document retrieval has taken its role in almost all domains of knowledge understanding including the legal domain. Precedent refers to a court decision that is considered as authority for deciding subsequent cases involving identical or similar facts, or similar legal issues. In this work, we are proposing different unsupervised approaches to solve the task of identifying relevant precedents to a given query case. Our proposed approaches are using word embeddings like word2vec, doc2vec and… Expand

Figures and Tables from this paper


IR-BERT: Leveraging BERT for Semantic Search in Background Linking for News Articles
The empirically show that employing a language model benefits the approach in understanding the context as well as the background of the query article, and a diversity measure is proposed to evaluate the effectiveness of the various approaches in retrieving a diverse set of documents. Expand
PARADE: Passage Representation Aggregation for Document Reranking
We present PARADE, an end-to-end Transformer-based model that considers document-level context for document reranking. PARADE leverages passage-level relevance representations to predict a documentExpand
A Summary of the COLIEE 2019 Competition
The evaluation of the 6th Competition on Legal Information Extraction/Entailment (COLIEE 2019) consists of four tasks: two on case law and two on statute law, which attempts to confirm whether a particular statute applies to a yes/no question. Expand
CUSAT NLP@AILA-FIRE2019: Similarity in Legal Texts using Document Level Embeddings
A working model developed for the track Artificial Intelligence for Legal Assistance in Forum for Information Retrieval Evaluation, 2019 (AILA-FIRE2019) is explained, using an embedding model approach to represent legal texts in a semantic vector space. Expand
FIRE 2019 AILA Track: Artificial Intelligence for Legal Assistance
The FIRE 2019 AILA track focused on creating a framework for evaluating different methods of retrieving relevant prior/precedent cases and statutes given a factual scenario. There were two tasks forExpand
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity is presented. Expand
n.d.]. IITP in COLIEE@ICAIL 2019: Legal Information Retrieval usingBM25 and BERT
  • ([n. d.]). https://www.researchgate
  • 2019
Improvements to BM25 and Language Models Examined
This investigation finds that once trained (using particle swarm optimization) there is very little difference in performance between these functions, that relevance feedback is effective, that stemming is effective and that it remains unclear which function is best over-all. Expand
Score-safe term-dependency processing with hybrid indexes
This work presents a hybrid approach which leverages score-safe processing and suffix-based self-indexing structures in order to provide efficient and effective top-k document retrieval. Expand
Lower-bounding term frequency normalization
This paper proposes a general and efficient method to introduce a sufficiently large lower bound for TF normalization which can be shown analytically to fix or alleviate the problem of very long documents being overly penalized. Expand