Learn More
The increasing use of methods in natural language processing (NLP) which are based on huge corpora require that the lexical, morpho-syntactic and syntactic homogeneity of texts be mastered. We have developed a methodology and associate tools for text calibration or "profiling" within the ELRA benchmark called "Contribution to the construction of(More)
We describe a novel biotope at 633 to 762 m depth on a vertical wall in the Whittard Canyon, an extensive canyon system reaching from the shelf to the deep sea on Ireland's continental margin. We explored this wall with an ROV and compiled a photomosaic of the habitat. The assemblage contributing to the biotope was dominated by large limid bivalves, Acesta(More)
In 2009, the Marine Biodiscovery Laboratory was set-up at the Marine Institute with funds from the Marine Institute and the Beaufort Marine Biodiscovery Research Programme. The Marine Biodiscovery Laboratory has already processed over 130 marine specimens from coastal zones and from the Deep Sea (≤3,000 m) within the Marine Irish Exclusive Economic Zone.(More)
Very large corpora are increasingly exploited to improve Natural Language Processing (NLP) Systems. This however implies that the lexical, morpho−syntactic and syntactic homogeneity of the data used are mastered. This control in turn requires the development of tools aimed at text calibration or profiling. We are implementing such profiling tools and(More)
OCEAN is a tool for a posteriori visual data mining that uses the output of a text miner to help users better explore a document space. Clustered documents are transformed into a hierarchical 3D representation analog to reconfigurable disk trees. An intermediary document representation allows for interface customization and offers a generic approach to 3D(More)
Extensible Markup Language (XML) is playing an increasingly important role in the exchange of a wide variety of data on the Web and elsewhere. It is a simple, very flexible text format, used to annotate data by means of markup. XML documents can be checked for syntactic well-formedness and semantic coherence through DTD and schema validation which makes(More)
In order to acquire the concepts related to a professional domain, 4600 web pages are structured into 900 hierarchical clusters by TEMIS. The design of the 3D interface for exploring these classes is described together with a qualitative evaluation by six professionals.The interface is defined through an XML language for virtual world design. Scene graph(More)
  • 1