Corpus ID: 23661160

About the creation of a parallel bilingual corpora of web-publications

@article{Lande2008AboutTC,
  title={About the creation of a parallel bilingual corpora of web-publications},
  author={D. V. Lande and V. V. Zhygalo},
  journal={ArXiv},
  year={2008},
  volume={abs/0807.0311}
}
  • D. V. Lande, V. V. Zhygalo
  • Published 2008
  • Computer Science
  • ArXiv
  • The algorithm of the creation texts parallel corpora was presented. The algorithm is based on the use of "key words" in text documents, and on the means of their automated translation. Key words were singled out by means of using Russian and Ukrainian morphological dictionaries, as well as dictionaries of the translation of nouns for the Russian and Ukrainianlanguages. Besides, to calculate the weights of the terms in the documents, empiric-statistic rules were used. The algorithm under… CONTINUE READING

    Citations

    Publications citing this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-5 OF 5 REFERENCES

    Term-Weighting Approaches in Automatic Text Retrieval

    VIEW 4 EXCERPTS
    HIGHLY INFLUENTIAL

    Term-Weighting Approaches // Automatic Text Retrieval. Information Processing and Management

    • G Salton, C Buckley
    • Term-Weighting Approaches // Automatic Text Retrieval. Information Processing and Management
    • 1988