Learn More
There has recently been an increasing interest in the generation of a sound field that is audible in one spatial region and inaudible in an adjacent region. The method proposed here ensures the control of the amplitude and phase of multiple acoustic sources in order to maximize the acoustic energy difference between two adjacent regions while also ensuring(More)
In this paper, we propose a method of restoring principal to ambient energy ratio (PAR) at the decoder in the principal component analysis (PCA)-based parametric audio coding. The conventional approach applies the post-scaling at the decoder using the energy information extracted from the input signal at the encoder. However, this approach has a problem(More)
The use of distinguishable complex vibrations that have multiple spectral components can improve the transfer of information by vibrotactile interfaces. We investigated the qualitative characteristics of dual-frequency vibrations as the simplest complex vibrations compared to single-frequency vibrations. Two psychophysical experiments were conducted to(More)
This paper proposes a pitch-synchronous deep neural network (DNN)-based statistical parametric speech synthesis (SPSS) system. The pitch-synchronous frames defined by the locations of glottal closure instants (GCIs) are used to extract speech parameters, which significantly reduce coupling effects between vocal tract and excitation signals. As a result, the(More)
This paper presents a panoramic video transmission system using spatially divided tiles based on the spatial relationships descriptions of MPEG-DASH. The proposed server system provides tiling and encoding functionalities for ROI-based retrieving. Moreover, it guarantees the temporal and spatial synchronization when rendering multiple tiles by a(More)