The ICSI Meeting Corpus
- Adam L. Janin, D. Baron, Chuck Wooters
- Computer ScienceIEEE International Conference on Acoustics…
- 6 April 2003
A corpus of data from natural meetings that occurred at the International Computer Science Institute in Berkeley, California over the last three years is collected, which supports work in automatic speech recognition, noise robustness, dialog modeling, prosody, rich transcription, information retrieval, and more.
Acoustic Beamforming for Speaker Diarization of Meetings
- Xavier Anguera Miró, Chuck Wooters, J. Hernando
- PhysicsIEEE Transactions on Audio, Speech, and Language…
- 1 September 2007
The use of classic acoustic beamforming techniques is proposed together with several novel algorithms to create a complete frontend for speaker diarization in the meeting room domain and shows improvements in a speech recognition task.
The ICSI RT07s Speaker Diarization System
This paper used the most recent available version of the beam-forming toolkit, implemented a new speech/non-speech detector that does not require models trained on meeting data and performed the development on a much larger set of recordings.
A robust speaker clustering algorithm
- J. Ajmera, Chuck Wooters
- Computer ScienceIEEE Workshop on Automatic Speech Recognition and…
- 30 November 2003
The algorithm automatically performs both speaker segmentation and clustering without any prior knowledge of the identities or the number of speakers and has the following advantages: no threshold adjustment requirements; no need for training/development data; and robustness to different data conditions.
Stochastic pronunciation modelling from hand-labelled phonetic corpora
Building a Large Lexical Databank Which Provides Deep Semantics
- C. Fillmore, Chuck Wooters, Collin F. Baker
- LinguisticsPacific Asia Conference on Language, Information…
The database will show the semantic and syntactic combinatorial possibilities (based on frame membership) of the lexical items it includes, as these are documented through grammatical and semantic annotations of sentences extracted from a large corpus of contemporary written English.
Robust Speaker Diarization for meetings
Four of the main improvements to the ICSI speaker diarization system submitted for the NIST Rich Transcription evaluation (RT06s) conducted on the meetings environment are introduced: a new training-free speech/non-speech detection algorithm, the introduction of a new algorithm for system initialization, and a frame purification algorithm to increase clusters differentiability.
Speech Recognition for Illiterate Access to Information and Technology
- M. Plauché, Udhyakumar Nallasamy, J. Pal, Chuck Wooters, D. Ramachandran
- Computer ScienceInternational Conference on Information and…
- 1 May 2006
This paper presents an inexpensive approach for gathering the linguistic resources needed to power a simple spoken dialog system and addresses the unique social and economic challenges of the developing world by relying on modifiable and highly transparent software and hardware.
The ICSI Meeting Project: Resources and Research
A general description of the official ICSI Meeting Corpus is included, as currently available through the Linguistic Data Consortium, some of the existing and planned annotations which augment the basic transcripts provided there are discussed, and several research efforts that make use of these materials.
Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information
The correlation between signals coming from multiple microphones is analyzed and an improved method for carrying out speaker diarization for meetings with multiple distant microphones is proposed, improving the Diarization Error Rate (DER) by 15% to 20% relative to previous systems.