• Corpus ID: 27981152

Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics

@inproceedings{Proisl2017TranslationIA,
  title={Translation Inference across Dictionaries via a Combination of Graph-based Methods and Co-occurrence Statistics},
  author={Thomas Proisl and Philipp Heinrich and Stefan Evert and Besim Kabashi},
  booktitle={LDK Workshops},
  year={2017}
}
This system description explains how to use several bilingual dictionaries and aligned corpora in order to create translation candidates for novel language pairs. It proposes (1) a graph-based approach which does not depend on cyclical translations and (2) a combination of this method with a collocation-based model using the multilingually aligned Europarl corpus. 

Figures and Tables from this paper

Title Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
TLDR
This work explored the use of multi-way neural machine translation trained on corpora of languages from the same family and trained additionally with a relatively small human-validated dictionary to infer new translation candidates and showed not only that new dictionary entries can be identified and extracted from the translation model, but also that the expected precision and recall of the resulting dictionary can be adjusted by using different thresholds.
Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation
This publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number SFI/12/RC/2289, co-funded by the European Regional
NUIG at TIAD 2021: Cross-lingual word embeddings for translation inference
This work has received funding from the EU’s Horizon 2020 Research and Innovation programme through the ELEXIS project under grant agreement No. 731015.
Cross-Dictionary Linking at Sense Level with a Double-Layer Classifier
TLDR
This work presents a system for linking dictionaries at the sense level, which is part of a wider programme aiming to extend current lexical resources and to create new ones by automatic means, by addressing the task as a binary classification problem using standard Machine Learning methods.

References

SHOWING 1-5 OF 5 REFERENCES
Leveraging RDF Graphs for Crossing Multiple Bilingual Dictionaries
The experiments presented here exploit the properties of the Apertium RDF Graph, principally cycle density and nodes’ degree, to automatically generate new translation relations between words, and
Translating Collocations for Bilingual Lexicons: A Statistical Approach
TLDR
A program named Champollion is described which, given a pair of parallel corpora in two different languages and a list of collocations in one of them, automatically produces their translations, to provide a tool for compiling bilingual lexical information above the word level in multiple languages, for different domains.
Construction of a Bilingual Dictionary Intermediated by a Third Language
TLDR
A method to treat equivalencies from inappropriate words derived as a result of ambiguity in the third language by utilizing the structures of dictionaries to measure the nearness of the meanings of words is proposed.
From Dictionary to Database: Creating a Global Multi-Language Series
TLDR
K Dictionaries is developing a global dictionary series for learners and general users, which includes over twenty languages to date, and the DTD has undergone numerous amendments and improvements over the time to solidify the structure and enrich it.
Parallel Data, Tools and Interfaces in OPUS
TLDR
New data sets and their features, additional annotation tools and models provided from the website and essential interfaces and on-line services included in the OPUS project are reported.