Author pages are created from data sourced from our academic publisher partnerships and public sources.
Share This Author
The ICSI Meeting Corpus
- Adam L. Janin, D. Baron, Chuck Wooters
- Computer ScienceIEEE International Conference on Acoustics…
- 6 April 2003
A corpus of data from natural meetings that occurred at the International Computer Science Institute in Berkeley, California over the last three years is collected, which supports work in automatic speech recognition, noise robustness, dialog modeling, prosody, rich transcription, information retrieval, and more.
Acoustic Beamforming for Speaker Diarization of Meetings
- Xavier Anguera Miró, Chuck Wooters, J. Hernando
- PhysicsIEEE Transactions on Audio, Speech, and Language…
- 1 September 2007
The use of classic acoustic beamforming techniques is proposed together with several novel algorithms to create a complete frontend for speaker diarization in the meeting room domain and shows improvements in a speech recognition task.
The ICSI RT07s Speaker Diarization System
This paper used the most recent available version of the beam-forming toolkit, implemented a new speech/non-speech detector that does not require models trained on meeting data and performed the development on a much larger set of recordings.
Stochastic pronunciation modelling from hand-labelled phonetic corpora
A robust speaker clustering algorithm
- J. Ajmera, Chuck Wooters
- Computer ScienceIEEE Workshop on Automatic Speech Recognition and…
- 30 November 2003
The algorithm automatically performs both speaker segmentation and clustering without any prior knowledge of the identities or the number of speakers and has the following advantages: no threshold adjustment requirements; no need for training/development data; and robustness to different data conditions.
Building a Large Lexical Databank Which Provides Deep Semantics
The database will show the semantic and syntactic combinatorial possibilities (based on frame membership) of the lexical items it includes, as these are documented through grammatical and semantic annotations of sentences extracted from a large corpus of contemporary written English.
Speech Recognition for Illiterate Access to Information and Technology
- M. Plauché, Udhyakumar Nallasamy, J. Pal, Chuck Wooters, D. Ramachandran
- Computer ScienceInternational Conference on Information and…
- 1 May 2006
This paper presents an inexpensive approach for gathering the linguistic resources needed to power a simple spoken dialog system and addresses the unique social and economic challenges of the developing world by relying on modifiable and highly transparent software and hardware.
TOWARDS ROBUST SPEAKER SEGMENTATION: THE ICSI-SRI FALL 2004 DIARIZATION SYSTEM
The ICSI-SRI system is an agglomerative clustering system that uses a BIC-like measure to determine when to stop merging clusters and to decide which pairs of clusters to merge, providing robustness and portability.
Robust speaker diarization for meetings: ICSI RT06s evaluation system
Four of the main improvements to the ICSI speaker diarization system submitted for the NIST Rich Transcription evaluation (RT06s) conducted on the meetings environment are introduced: a new training-free speech/non-speech detection algorithm, a new algorithm for system initialization, and a frame purification algorithm to increase clusters differentiability.
Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information
The correlation between signals coming from multiple microphones is analyzed and an improved method for carrying out speaker diarization for meetings with multiple distant microphones is proposed, improving the Diarization Error Rate (DER) by 15% to 20% relative to previous systems.