• Publications
  • Influence
The ICSI Meeting Corpus
TLDR
We have collected a corpus of data from natural meetings that occurred at the International Computer Science Institute in Berkeley, California over the last three years. Expand
  • 655
  • 61
  • PDF
Acoustic Beamforming for Speaker Diarization of Meetings
TLDR
We present an approach, based on popular acoustic beamforming techniques, to obtain a single enhanced signal and speaker-position information from a number of microphones. Expand
  • 320
  • 44
  • PDF
The ICSI RT07s Speaker Diarization System
In this paper, we present the ICSI speaker diarization system. This system was used in the 2007 National Institute of Standards and Technology (NIST) Rich Transcription evaluation. The ICSI systemExpand
  • 227
  • 30
  • PDF
A robust speaker clustering algorithm
TLDR
We present a novel speaker segmentation and clustering algorithm with the following advantages: no threshold adjustment requirements; no need for training/development data; and robustness to different data conditions. Expand
  • 268
  • 23
  • PDF
Stochastic pronunciation modelling from hand-labelled phonetic corpora
TLDR
In the early 1990s, the availability of the TIMIT read-speech phonetically transcribed corpus led to work at AT&T on the automatic inference of pronunciation variation. Expand
  • 162
  • 22
  • PDF
Building a Large Lexical Databank Which Provides Deep Semantics
TLDR
This paper reports on the design of a lexical database for English which is currently under construction ("FrameNet-2" ), and describes the kinds of linguistic facts that the database is intended to make available. Expand
  • 153
  • 20
  • PDF
TOWARDS ROBUST SPEAKER SEGMENTATION: THE ICSI-SRI FALL 2004 DIARIZATION SYSTEM
TLDR
We describe the ICSI-SRI entry in the Fall 2004 DARPA EARS Metadata Evaluation, an agglomerative clustering system that uses a BIC-like measure to determine when to stop merging clusters and to decide which pairs of clusters to merge. Expand
  • 95
  • 9
  • PDF
Speech Recognition for Illiterate Access to Information and Technology
TLDR
In rural Tamil Nadu and other predominantly illiterate communities throughout the world, computers and technology are currently inaccessible without the help of a literate mediator. Expand
  • 110
  • 8
  • PDF
The ICSI Meeting Project: Resources and Research
TLDR
This paper provides a progress report on ICSI s Meeting Project, including both the data collected and annotated as part of the pro-ject, as well as the research lines such materials support. Expand
  • 117
  • 6
  • PDF
Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information
TLDR
We analyze the correlation between signals coming from multiple microphones and propose an improved method for carrying out speaker diarization for meetings with multiple distant microphones. Expand
  • 82
  • 6
  • PDF
...
1
2
3
4
5
...