Robert E. Yantorno

Learn More
The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1]. The MBSD measure takes into account the noise masking threshold in order to use only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD over the(More)
The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1][2]. The MBSD measure estimates speech distortion in the loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation(More)
A new system for translating the infant cries from its facial image and cry sounds is presented in this paper. The system is designed to analyze the facial image and sound of the crying infant to derive the reason why the infant is crying. The image and the sound represent the same cry event. The image processing module determines the state of certain(More)
Anisosmotic cell swelling triggers a regulatory volume decrease (RVD) in cell lines derived from human nonpigmented ciliary epithelium. Measurements of cell volume have indicated that the RVD reflects activation of K+ and/or Cl- channels. We have begun to characterize the putative channels by whole cell patch clamping. The results obtained by altering the(More)
Recently, a " usable speech " extraction system [1] was proposed to separate co-channel speech into " usable " frames that are minimally corrupted by interfering speech. Studies indicate [2] that a significant amount of co-channel speech can be considered " usable " for speaker identification (SID). Therefore, it is necessary to establish criteria for(More)
Recently, a novel method to process co-channel speech was proposed [1]. Previous methods include enhancing the target speech, or suppressing the interfering speech or both enhancing the target and suppressing the interferer. The proposed new method searches for usable speech frames which are usually found in clusters under co-channel conditions. The term(More)
A novel approach to co-channel speaker separation is presented here. The technique uses the statistical properties of combinations of high Target-to-Interferer Ratio (TIR) speech segments, which were extracted from a 0 dB overall TIR co-channel utterance. The problem is broken down into making three simpler decisions. First, closed-set speaker(More)
The accuracy of speech processing techniques degrades when operating in a co-channel environment. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech segmentation is to identify and extract those portions of co-channel speech that are minimally degraded but still useful for speech processing application(More)
Speech that is corrupted by nonstationary interference, but contains segments that are still usable for applications such as speaker identification or speech recognition, is referred to as " usable " speech. A common example of nonstationary interference occurs when there is more than one person talking at the same time, which is referred to as co-channel(More)