Elisabete Ranchhod

Learn More
Using standard methods and formats established at LADL, and adopted by several European research teams to construct large-coverage electronic dictionaries and grammars, we elaborated for Portuguese a set of lexlcal resources, that were implemented in IN'rEX We describe the main features of such linguistic data, refer to their mmntenance and extension, and(More)
In this paper I present briefly Linguateca 1 , an infrastructure project for Portuguese which is ten years old, and will show how it provides several possibilities to study grammatical and semantical differences between varieties of the language. After a short history of Portuguese corpus linguistics, presenting the main projects in and sketch some ideas(More)
The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and grammars, developed by LabEL. These resources are formalized and applied to texts by means of finite-state techniques, more and more acknowledged in Natural Language Processing. On the one hand, it illustrates methods on lexical representation for simple words(More)
In this paper, we present the current state of development of a large-scale lexicon built at LabEL 1 for Portuguese. We will concentrate on multiword expressions (MWE), particularly on multiword nouns, (i) illustrating their most relevant morphological features, and (ii) pointing out the methods and techniques adopted to generate the inflected forms from(More)
We report experiments about the syntactic variations of support verb constructions, a special type of multiword expressions (MWEs) containing predicative nouns. In these expressions, the noun can occur with or without the verb, with no clear-cut semantic difference. We extracted from a large French corpus a set of examples of the two situations and derived(More)
1 Apresentação A Linguateca nasceu da identificação do processamento da linguagem natural aplicado ao português como uma área estratégica, por parte das autoridades portuguesas em Ciência e Tecnologia [33]. Após um recenseamento dos principais entraves ao desenvolvimento da área, ficou clara a necessidade da existência de recursos públicos e gratuitos e da(More)
SUMMARY In this paper we present the state of development of DIGRAMA, a system of electronic dictionaries and grammars of Portuguese. We describe brieey the general characteristics of the system, concentrating then on the analysis of two dictionary modules: DIGRAS and DIGRAF. The former is a dictionary of simple words, the latter is a dictionary of innected(More)
As unidades lexicais multipalavra (ULM) foram tradicionalmente consideradas como objectos linguísticos idiossincráticos e marginais. A linguística teórica dedicou-lhes pouca atenção. Nas gramáticas, as referências são escassas e pouco consistentes. Nos dicionários, só as ULM grafadas com hífen constituem entradas lexicais (bem-estar, guarda-costas,(More)