Querying databases of annotated speech

@article{Cassidy2000QueryingDO,
  title={Querying databases of annotated speech},
  author={S. Cassidy and Steven Bird},
  journal={Proceedings 11th Australasian Database Conference. ADC 2000 (Cat. No.PR00528)},
  year={2000},
  pages={12-20}
}
  • S. Cassidy, Steven Bird
  • Published 2000
  • Computer Science
  • Proceedings 11th Australasian Database Conference. ADC 2000 (Cat. No.PR00528)
Annotated speech corpora are databases consisting of signal data along with time-aligned symbolic 'transcriptions'. Such databases are typically multidimensional, heterogeneous and dynamic. These properties present a number of tough challenges for representation and query. The temporal nature of the data adds an additional layer of complexity. This paper presents and harmonises two independent efforts to model annotated speech databases, one at Macquarie University, and one at the University of… Expand
Querying Linguistic Trees
TLDR
This work describes the domain of linguistic trees and discusses the expressive requirements for a query language, and presents a language that can express a wide range of queries over these trees, and shows that the language is first-order complete over trees. Expand
2 Linguistic Tree Query
Large databases of linguistic annotations are used for testing linguistic hypotheses and for training language processing models. These linguistic annotations are often syntactic or prosodic inExpand
Testing the adequacy of query languages against annotated spoken dialog
TLDR
This paper presents an example of a richly annotated discourse segment which includes both DAMSL style discourse level annotation and ToBI intonational analysis and describes how this annotation could be realised in either the Emu, MATE or Annotation Graph formalisms. Expand
Multi-level annotation in the Emu speech database management system
TLDR
This paper discusses the design of the Emu system, giving a detailed description of the annotation structures that it supports, and argues that these structures are sufficiently general to allow Emu to read potentially any time-aligned linguistic annotation. Expand
A Formal Framework for Linguistic Annotation (revised version)
TLDR
A wide variety of existing annotation formats are surveyed and a common conceptual core, the annotation graph, is demonstrated, providing a formal framework for constructing, maintaining and searching linguistic annotations, while remaining consistent with many alternative data structures and file formats. Expand
A formal framework for linguistic annotation
TLDR
A wide variety of existing annotation formats are surveyed and a common conceptual core, the annotation graph, is demonstrated, which provides a formal framework for constructing, maintaining and searching linguistic annotations, while remaining consistent with many alternative data structures and file formats. Expand
ATLAS: A Flexible and Extensible Architecture for Linguistic Annotation
TLDR
A formal model for annotating linguistic artifacts is described, from which an application programming interface (API) to a suite of tools for manipulating these annotations are derived, and a review of the current efforts towards implementing key pieces of this architecture is reviewed. Expand
Towards a Query Language for Annotation Graphs
TLDR
A query language is developed and optimization techniques for an underlying relational representation of annotation graphs are described, which is rather different from the customary graph models for semistructured data. Expand
Building an interface between EMU and Praat: a modular approach to speech database analysis
TLDR
It is argued that both the variety of existing speech databases as well as the multitude of different possible types of speech analysis require a modular approach allowing the integration of a number of different stand-alone components that are adapted to different aspects of creating, annotation, querying and analysing speech data. Expand
Annotated text databases in the context of the Kaj Munk corpus: One database model, one query language, and several applications
TLDR
This PhD dissertation reflects the role of a computer scientist seeking to represent annotated versions of Kaj Munk’s works in a computer database system, and supporting easy querying of these anno tated texts. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 22 REFERENCES
Multi-level Annotation of Speech: An Overview of The Emu Speech Database Management System
TLDR
The design of the Emu system is discussed, giving a detailed description of the annotation structures that it supports, and it is argued that these structures are suciently general to potentially allow Emu to read any time-aligned linguistic annotation. Expand
Compiling multi-tiered speech databases into the relational model: experiments with the emu system
TLDR
A method of translating an Emu database into the relational model, as used by most commercial database systems, is proposed using a Tcl script and results show a marked increase in speed for the relational system on most queries. Expand
A formal framework for linguistic annotation
TLDR
A wide variety of existing annotation formats are surveyed and a common conceptual core, the annotation graph, is demonstrated, which provides a formal framework for constructing, maintaining and searching linguistic annotations, while remaining consistent with many alternative data structures and file formats. Expand
EMU: an Enhanced Hierarchical Speech Data Management System
TLDR
A database for Cantonese, and a database used in a kinematic study of vowel lengthening, both of which include facilities for automatically generating hierarchies are described. Expand
Annotation graphs as a framework for multidimensional linguistic data analysis
TLDR
This work motivates and illustrates the approach using discourse-level annotations of text and speech data drawn from the CALLHOME, COCONUT, MUC-7, DAMSL and TRAINS annotation schemes to show how annotation graphs can represent hybrid multi-level structures which derive from a diverse set of file formats. Expand
Building a Large Annotated Corpus of English: The Penn Treebank
TLDR
As a result of this grant, the researchers have now published on CDROM a corpus of over 4 million words of running text annotated with part-of- speech (POS) tags, which includes a fully hand-parsed version of the classic Brown corpus. Expand
Computational Phonology: A Constraint-Based Approach
TLDR
The goal of this book is to explore the use of constraints in modern non-linear phonology and then - drawing on insights from constraint-based grammar and constraint logic programming - to formalise and implement a constraint- based phonology. Expand
Editors
  • Brain Research Bulletin
  • 1986
This specification defines the Document Object Model Level 2 HTML, a platformand language-neutral interface that allows programs and scripts to dynamically access and update the content and structureExpand
and D
  • Suciu. XML-QL: A query language for XML
  • 1998
and D
  • Schach. XML query language (XQL)
  • 1998
...
1
2
3
...