Learn More
Web information is ephemeral. Several organizations around the world are struggling to archive information from the web before it vanishes. However, users demand efficient and effective search mechanisms to access the already vast collections of historical information held by web archives. The Portuguese Web Archive is the largest full-text searchable web(More)
Today many companies decide to select free/open source software (F/OSS) products for various reasons, for example, economical or quality reasons. For many areas of application, they can choose from a variety of packages provided by different communities. Introducing a software tool into a company – either for supporting a certain business process or for the(More)
This paper reports the participation of the University of Lisbon at the 2007 GeoCLEF task. We adopted a novel approach for GIR, focused on handling geographic features and feature types on both queries and documents, generating signatures with multiple geographic concepts as a scope of interest. We experimented new query expansion and text mining(More)
The web was invented to quickly exchange data between scientists, but it became a crucial communication tool to connect the world. However, the web is extremely ephemeral. Most of the information published online becomes quickly unavailable and is lost forever. There are several initiatives worldwide that struggle to archive information from the web before(More)
This paper reports the participation of the XLDB Group from the University of Lisbon at the 2007 GeoCLEF task. We adopted a novel approach for GIR, focused on handling geographic features and feature types on both queries and documents, generating geographic signatures with multiple geographic concepts as a scope of interest. We experimented new query(More)
The dissertation report presents the development of a geographic information search system which implements geographic signatures, a novel approach for the modeling of the geographic information present in documents. The goal of the project was to determine if the information with geographic semantics present in documents, captured as geographic signatures,(More)
Today many companies decide to select Free/Open Source software (F/OSS) products for various purposes, for example because of economical or quality reasons. For many areas of application they can choose from a variety of packages provided by different communities. Introducing a software tool into a company – either for supporting a certain business process(More)
The web became a mass means of publication that has been replacing printed media. However, its information is extremely ephemeral. Currently, most of the information available on the web is less than 1 year old. There are several initiatives worldwide that struggle to archive information from the web before it vanishes. However, search mechanisms to access(More)