But Dictionaries Are Data Too

@inproceedings{Brown1993ButDA,
  title={But Dictionaries Are Data Too},
  author={Peter F. Brown and Stephen Della Pietra and Vincent J. Della Pietra and Meredith J. Goldsmith and Jan Hajic and Robert L. Mercer and Surya Mohanty},
  booktitle={HLT},
  year={1993}
}
Although empiricist approaches to machine translation depend vitally on data in the form of large bilingual corpora, bilingual dictionaries are also a source of information. We show how to model at least a part of the information contained in a bilingual dictionary so that we can treat a bilingual dictionary and a bilingual corpus as two facets of a unified collection of data from which to extract values for the parameters of a probabilistic machine translation system. We give an algorithm for… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-10 OF 40 CITATIONS

Using Parallel Texts and Lexicons for Verbal Word Sense Disambiguation

VIEW 5 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Improving IBM Word Alignment Model 1

VIEW 6 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Models of Translational Equivalence among Words

  • Computational Linguistics
  • 2000
VIEW 14 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Learning Tractable Word Alignment Models with Complex Constraints

  • Computational Linguistics
  • 2010
VIEW 6 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

References

Publications referenced by this paper.
SHOWING 1-3 OF 3 REFERENCES

HarperCollins Robert French Dictionary

B. T. Atkins, A. Dnv, +5 authors M.-N. Lamy
  • 1990
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL