We describe a generalized toolset developed by the Perseus Project to manage XML documents in the context of a large, heterogeneous digital library. The system manages multiple DTDs through mappings from elements in the DTD to abstract document structures. The abstraction of document metadata, both structural and descriptive, facilitates the development of… (More)
is exploring the problems that a general digital library for the humanities would face: support from the Digital Library Initiative, and particularly from NSF and NEH, has allowed us to capitalize on a decade of work developing a digital library on Greco-Roman antiquity and systematically to explore the problems raised by other domains within the… (More)
The Perseus Digital Library (http://www.perseus.tufts.edu) has three different kinds of search tools, each of which presents its results in a different way. Two of them, the word-search tool and the Perseus Lookup Tool, allow users to request searches of the texts and the databases of the digital library. The third search tool works transparently on the… (More)
Corpus editions can only be useful to scholars when users know what to expect of the texts. We argue for text quality indicators, both general and domain-specific.
The Open Archives Initiative has gained success by aiming between complex federation schemes and low functionality web crawling. Much information still remains hidden inside documents catalogued by OAI metadata. We discuss how subdocument information can be exposed by data providers and exploited by service providers. We discuss services for citation… (More)