Learn More
Languages are born, evolve and, eventually, die. During this evolution their spelling rules (and sometimes the syntactic and semantic ones) change, putting old documents out of use. In Portugal, a pair of political agreements with Brazil forced relevant changes on the way the Portuguese language is written. In this article we will detail these two(More)
This document presents the TerminUM project and the work done in its statistical word aligner workbench (NATools). It shows a variety of alignment methods for parallel corpora and discusses the resulting terminological dictionaries and their use: evaluation of sentence translations; construction of a multi-level navigation system for linguistic studies or(More)
According to recent research, nearly 95 percent of a corporate information is stored in documents. Further studies indicate that companies spent between 6 and 10 percent of their gross revenues printing and distributing documents in several ways: web and cdrom publishing, database storage and retrieval and printing. In this context documents exist in some(More)
Resumo Neste trabalho apresentamos o projecto Procura-PALvras (P-PAL) cujo principal objectivó e de-senvolver uma ferramenta electrónica que disponibilize informação sobré ındices psicolinguísticos ob-jectivos e subjectivos de palavras do Português Europeu (PE). O P-PAL será disponibilizado gratuita-mentè a comunidade científica num formato amigável a(More)
Besides source code, the fundamental source of information about Open Source Software lies in documentation, and other non source code files, like README, INSTALL, or HowTo files, commonly available in the software ecosystem. These documents, written in natural language, provide valuable information during the software development stage, but also in future(More)
Resumen: Los corpora paralelos son fuentes ricas en recursos de traducción. Este documento presenta una metodología para la extracción de sintagmas nominales bil-ingües (candidatos terminológicos) a partir de corpora paralelos, utilizando reglas de traducción. Los modelos propuestos en este trabajo especifican las alteraciones en el orden de las palabras(More)
The analysis of business/financial news has become a popular area of research because of the possibility to infer the future prospects of companies, economies and economic actors in general on information contained in the media. The classical approaches rely upon a "coarse" polarity classification of a news story, however this may not be an optimal solution(More)
Multilingual resources are useful for linguistic studies, translation, and many other tasks. Unfortunately, these resources are difficult to obtain and organize. In this document we describe a set of tools designed to help in the task of mining bilingual resources from the web, from a specific site, from a file system, from a list of URLs, or from a(More)