Elisabete Ranchhod

Learn More
Using standard methods and formats established at LADL, and adopted by several European research teams to construct large-coverage electronic dictionaries and grammars, we elaborated for Portuguese a set of lexlcal resources, that were implemented in IN'rEX We describe the main features of such linguistic data, refer to their mmntenance and extension, and(More)
In this paper I present briefly Linguateca 1 , an infrastructure project for Portuguese which is ten years old, and will show how it provides several possibilities to study grammatical and semantical differences between varieties of the language. After a short history of Portuguese corpus linguistics, presenting the main projects in and sketch some ideas(More)
The paper describes Portuguese large-scale linguistic resources, mainly computational lexicons and grammars, developed by LabEL. These resources are formalized and applied to texts by means of finite-state techniques, more and more acknowledged in Natural Language Processing. On the one hand, it illustrates methods on lexical representation for simple words(More)
In this paper, we present the current state of development of a large-scale lexicon built at LabEL 1 for Portuguese. We will concentrate on multiword expressions (MWE), particularly on multiword nouns, (i) illustrating their most relevant morphological features, and (ii) pointing out the methods and techniques adopted to generate the inflected forms from(More)
We report experiments about the syntactic variations of support verb constructions, a special type of multiword expressions (MWEs) containing predicative nouns. In these expressions, the noun can occur with or without the verb, with no clear-cut semantic difference. We extracted from a large French corpus a set of examples of the two situations and derived(More)
1 Apresentação A Linguateca nasceu da identificação do processamento da linguagem natural aplicado ao português como uma área estratégica, por parte das autoridades portuguesas em Ciência e Tecnologia [33]. Após um recenseamento dos principais entraves ao desenvolvimento da área, ficou clara a necessidade da existência de recursos públicos e gratuitos e da(More)
As unidades lexicais multipalavra (ULM) foram tradicionalmente consideradas como objectos linguísticos idiossincráticos e marginais. A linguística teórica dedicou-lhes pouca atenção. Nas gramáticas, as referências são escassas e pouco consistentes. Nos dicionários, só as ULM grafadas com hífen constituem entradas lexicais (bem-estar, guarda-costas,(More)
Like common noun phrases, proper names contain ambiguous conjoined phrases that make their delimitation and classification difficult in text. This paper presents a finite-state approach to the disambiguation of Portuguese candidate proper name strings containing the coordinating conjunction e (and). In such name strings, the conjunction can denote a(More)