Using Acoustic Structure in a Hand-Held Audio Playback Device

@article{Schmandt1996UsingAS,
  title={Using Acoustic Structure in a Hand-Held Audio Playback Device},
  author={C. Schmandt and D. Roy},
  journal={IBM Syst. J.},
  year={1996},
  volume={35},
  pages={453-472}
}
This paper discusses issues in navigation and presentation of voice documents, and their application to a particular hand-held audio playback device, called NewsComm. It discusses situations amenable to auditory information retrieval, techniques for deriving document structure based on acoustical cues, and techniques for interactive presentation of digital audio. NewsComm provides a portable user interface to digitized radio news and interview programs, and it allows occasional connectivity to… Expand
The audio notebook: paper and pen interaction with structured speech
TLDR
A new device-the Audio Notebook-for taking notes and interacting with a speech recording, a combination of a digital audio recorder and paper notebook, all in one device. Expand
After direct manipulation---direct sonification
TLDR
It was found that with interactive multiple- stream audio, the ten users could accurately complete the browsing tasks significantly faster than those who had single-stream audio support. Expand
Listening to Rooms and Objects
TLDR
An experiment with both blind and blindfolded participants to perform localisation tasks is designed, on the human side, to investigate the basis for the addition of virtual interaction sounds in virtual immersive environments. Expand
Audio Source Separation Using Perceptual Principles for Content-Based Coding and Information Management
TLDR
The basic requirements of any hypermedia system are to chunk the data into self-contained conceptual units that can contain anchor points for the source or destination of links; • Links: a means to define and manage relationships between the nodes; and • Interface: a presentation engine that is independent of the node-link structure. Expand
The Technical Assessment of Software Usability with Reference to Screen Readers for the Graphical User Interface (GUI)
TLDR
A method of assessing software usability, using test instruments developed Iteratively through the process of conducting usability assessment of existing screen reader software for Windows 95 is described, which is particularly suited to end-users responsible for purchasing decisions or training. Expand
A Complete Bibliography of Publications in the IBM Systems Journal
A.I [?]. ABC [?]. ABLE [?]. Abstract [?, ?]. Abstracts [?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,Expand

References

SHOWING 1-10 OF 35 REFERENCES
NewsComm: a hand-held interface for interactive access to structured audio
The NewsComm system delivers personally selected audio information to mobile users through a hand-held audio playback device. The system provides a one-toone connection from individual users toExpand
Speaker segmentation for browsing recorded audio
TLDR
A system which automatically divides a multi-speaker recording into speaker segments and displays this information graphically allows a user to easily access the parts of a recording where given people are talking. Expand
SpeechSkimmer: interactively skimming recorded speech
TLDR
This paper presents a multi-level approach to auditory skimming, along with user interface techniques for interacting with the audio and providing feedback, and a prototype user interface for skimming speech. Expand
Interactively skimming recorded speech
TLDR
This research makes it easier and more efficient to listen to recorded speech by using the SpeechSkimmer system, a user interface for interactively skimming speech recordings and exploits properties of spontaneous speech to automatically select and present salient audio segments in a time-efficient manner. Expand
Capturing, structuring, and representing ubiquitous audio
TLDR
Applications for capturing and structuring audio from office discussions and telephone calls, and mechanisms for later retrieval of these stored interactions, and the evolution of a family of representations across a range of applications are described. Expand
Working with audio: integrating personal tape recorders and desktop computers
TLDR
Early work aimed at creating computer tools that support the ways users may want to work with audio data are described, using existing devices such as portable tape recorders. Expand
AudioStreamer: exploiting simultaneity for listening
TLDR
The main contributions of AudioStreamer are the use of spatial separation and simultaneous listening for audio document retrieval and modeling listener interest to enhance the effectiveness of simultaneous listening. Expand
Hyperspeech: navigating in speech-only hypermedia
  • B. Arons
  • Computer Science
  • HYPERTEXT '91
  • 1991
TLDR
The hyperspeech system described in this paper, a speech-only hypermedia application, explores issues of navigation and system architecture in an audio environment without a visual display, and uses speech recognition to maneuver in a database of digitally recorded speech segments. Expand
The use of emphasis to automatically summarize a spoken discourse
  • Francine R. Chen, M. Withgott
  • Computer Science
  • [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1992
The authors describe a method for exploiting prosodic information in natural, conventional speed for the purpose of automatically creating an audio summary. The method is based on identifyingExpand
Segregation of speakers for speech recognition and speaker identification
  • H. Gish, M. Siu, J. R. Rohlicek
  • Computer Science
  • [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing
  • 1991
TLDR
A method for segregating speech from speakers engaged in dialogs employs a distance measure between speech segments used in conjunction with a clustering algorithm to perform the segregation. Expand
...
1
2
3
4
...