ANOPPI: A Pseudonymization Service for Finnish Court Documents

@inproceedings{Oksanen2019ANOPPIAP,
  title={ANOPPI: A Pseudonymization Service for Finnish Court Documents},
  author={Arttu Oksanen and Minna Tamper and J. Tuominen and Aki Hietanen and Eero Hyv{\"o}nen},
  booktitle={International Conference on Legal Knowledge and Information Systems},
  year={2019}
}
To comply with the EU General Data Protection Regulation (GDPR) publishing court judgments online requires that personal data contained in them must be disguised. However, anonymizing the documents manually is a costly and time-consuming procedure. This paper presents Anoppi service for automatic and semi-automatic pseudonymization of Finnish court judgments. Utilizing both statisticsand rule-based named entity recognition methods and morphological analysis, Anoppi is able to automatically… 

Figures from this paper

A Pseudonymization Tool for Legal Documents for Linked Data Publication and Use on the Semantic Web

Evaluation shows that ANOPPI performs well with different types of documents, however, further improving the performance of the named entity recognition and disambiguation methods would enhance the usefulness of the software.

An Anonymization Tool for Open Data Publication of Legal Documents

Evaluation shows that ANOPPI performs well with different types of documents, however, further improving the performance of the named entity recognition and disambiguation methods would enhance the usefulness of the software.

A Tool for Pseudonymization of Textual Documents for Digital Humanities Research and Publication

Evaluation shows that Anoppi performs well with different types of documents, however, further improving the performance of the named entity recognition and disambiguation methods would enhance the usefulness of the software and motivate organizations to bring AnOppi into use.

LawSampo: A Semantic Portal on a Linked Open Data Service for Finnish Legislation and Case Law

The semantic portal prototype LawSampo is presented for serving end users with legal data from the Linked Open Data service Semantic Finlex of the Finnish Ministry of Justice, to aggregate heterogeneous distributed data and represent it as a harmonized knowledge graph on top of which intelligent usecase application perspectives can be created.

Challenges and Open Problems of Legal Document Anonymization

This paper aims to summarize and highlight the open and symmetrical problems from the fields of structured and unstructured text anonymization and the possible methods for anonymizing legal documents discussed and illustrated by case studies from the Hungarian legal practice.

Extending the Finnish Linked Data Infrastructure with Natural Language Processing Services in FIN-CLARIAH

This poster paper introduces work in FIN-CLARIAH relating to the idea of integrating natural language processing (NLP) tools with the Linked Open Data (LOD) Infrastructure for Digital Humanities in Finland (LODI4DH) and presents a plan for NLP services to be opened as part of the linked Data Finland (LDF.fi) platform.

Building a Production-Ready Multi-Label Classifier for Legal Documents with Digital-Twin-Distiller

The proposed paper shows a solution where this multi-label classification problem is decomposed into more than a hundred binary classification problems, and could increase the e-discoverability of the documents by about 50%.

Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web

This paper argues for the idea of publishing legislation and case law as Linked Open Data (LOD) on the Semantic Web, to cater several user groups, including the general public, legislators, lawyers,

Modeling and Publishing Finnish Person Names as a Linked Open Data Ontology

An ontology and a Linked Open Data service of tens of thousands of Finnish person names, extracted from contemporary and historical name registries, intended for Named Entity Recognition and Linking in automatic annotation and data anonymization tasks, as well as for enriching data in, e.g., genealogical research.

References

SHOWING 1-9 OF 9 REFERENCES

Anonymization of Court Orders

An anonymization tool that was commissioned by and specified together with Schultz, a publishing company specialized in Danish law related publications attains a reassuringly good recall, makes almost no chunk errors and reduces the found entity designators to a nearly correct set of entities that the input text refers to, minimizing the time needed for manual check and post-editing.

Semantic Finlex: Transforming, Publishing, and Using Finnish Legislation and Case Law As Linked Open Data on the Web

Semantic Finlex is presented, a national in-use data resource and service for publishing Finnish legislation and related case law as Linked Open Data for legal applications to use and methods and tools under development to automatically annotate legal texts and to anonymize case law documents prior to their publication on the Web.

Building the essential resources for Finnish: the Turku Dependency Treebank

The final version of a publicly available treebank of Finnish, the Turku Dependency Treebank is presented and the first open source Finnish dependency parser is presented, trained on the newly introduced treebank.

On-Line Publication of Court Decisions in the EU: Report of the Policy Group of the Project ‘Building on the European Case Law Identifier’

This report contains an extensive comparative research on the on-line publication of court decisions in Europe. It focuses on three main themes – policy and practices with regard to on-line

Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling

By using simulated annealing in place of Viterbi decoding in sequence models such as HMMs, CMMs, and CRFs, it is possible to incorporate non-local structure while preserving tractable inference.

Visualizing and Analyzing Networks of Named Entities in Biographical Dictionaries for Digital Humanities Research

This paper shows how named entity extraction and network analysis can be used to examine biographies individually and in groups to aid historians in biographical and prosopographical research. For

Anonymisointipalvelut. tarve ja toteutusvaihtoehdot, 2017. Liikenne- ja viestintäministeriön julkaisuja 7/2017

  • 2017

Free access to legislation in Finland: Principles, practices and prospects

  • Law via the Internet. Free Access – Quality of Information – Effectiveness of Rights
  • 2009