Toni Hirvonen

Learn More
The localization of virtual sources generated with different two-dimensional (2-D) multichannel reproduction systems has been studied by means of auditory model simulations and listening tests. The reproduction was implemented with typical five- and eight-channel loudspeaker setups. The microphone systems used were first- and second-order Ambisonics as well(More)
Compressed sensing (CS) samples signals at a much lower rate than the Nyquist rate if they are sparse in some basis. In this paper, the CS methodology is applied to sinusoidally modeled audio signals. As this model is sparse by definition in the frequency domain (being equal to the sum of a small number of sinusoids), we investigate whether CS can be used(More)
In this paper, the compressed sensing (CS) methodology is applied to the harmonic part of sinusoidally-modeled audio signals. As this part of the model is sparse by definition in the frequency domain, we investigate how CS can be used to encode this signal at low bitrates, instead of encoding the sinusoidal parameters (amplitude, frequency, phase) as(More)
This paper performs a comparative analysis of the perceived quality of (unidirectional, non-interactive) VoIP calls under various wireless network conditions (e.g., handover, high traffic demand). It employs the PESQ tool, E-model and auditory tests to evaluate the impact of these network conditions on the perceived quality of VoIP calls. It also reveals(More)
This paper focuses on a comparative statistical analysis of the performance of VoIP calls under various situations, namely, during a handover and under different background traffic conditions at a wireless access point (AP). Using empirical-based measurements, it demonstrates that these network conditions exhibit distinct statistical behaviour, in terms of(More)
Audio signals are represented via the sinusoidal model as a summation of a small number of sinusoids. This approach introduces sparsity to the audio signals in the frequency domain, which is exploited in this paper by applying Compressed Sensing (CS) to this sparse representation. CS allows sampling of signals at a much lower rate than the Nyquist rate if(More)
Sinusoidal modeling of audio requires the model parameters to be selected by analyzing the original signal spectrum. This paper proposes two improvements in sinusoidal selection by considering how psychoacoustic masking curves can be calculated using a top-down strategy in certain situations. First, a non-iterative component selection method to be used in(More)
A recently proposed auditory model is examined using simulated binaural masking level difference (BMLD) and dichotic pitch (Huggins and binaural edge pitch) stimuli. The model is based on calculating the instantaneous interaural level difference, i.e. the difference between the left and right ear neural signals. The model output produces pronounced maximum(More)