Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 225,317,619 papers from all fields of science
Search
Sign In
Create Free Account
Portuguese Web Archive
The Portuguese Web Archive (PWA) is the national Web archive of Portugal. Its mission is to periodically archive contents of national interest…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
3 relations
List of Web archiving initiatives
Web archiving
World Wide Web
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
Review
2013
Review
2013
Cross-lingual web spam classification
A. Garzó
,
B. Daróczy
,
Tamás Kiss
,
Dávid Siklósi
,
A. Benczúr
The Web Conference
2013
Corpus ID: 13164121
While Web spam training data exists in English, we face an expensive human labeling procedure if we want to filter a Web domain…
Expand
2013
2013
Acquiring and providing access to historical web collections
Daniel Gomes
,
David Cruz
,
J. Miranda
,
Miguel Costa
,
Simão Fontes
International Conference on Digital Preservation
2013
Corpus ID: 583905
Every day, unique valuable information that describes our current days disappears from the web. National archives or libraries…
Expand
Review
2012
Review
2012
The Portuguese Web Archive: an overview
Fccn Daniel Gomes
2012
Corpus ID: 109428124
The Portuguese Web Archive preserves and provides access to information published on the web of main interest to the Portuguese…
Expand
2012
2012
Creating a searchable web archive ( Technical Report )
Daniel Gomes
,
David Cruz
,
J. Miranda
,
Miguel Costa
,
Simão Fontes
2012
Corpus ID: 19330539
The web became a mass means of publication that has been replacing printed media. However, its information is extremely ephemeral…
Expand
2011
2011
The Portuguese Web Archive and other tools for historical research
Fccn Daniel Gomes
2011
Corpus ID: 131026803
2009
2009
An Updated Portrait of the Portuguese Web
J. Miranda
,
Daniel Gomes
2009
Corpus ID: 15299874
This study presents an updated characterization of the Portuguese Web derived from a crawl of 48 million contents belonging to…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE