• Publications
  • Influence
Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
TLDR
This paper proposes a method for performing blind source separation (BSS) and blind dereverberation (BD) at the same time for speech mixtures. Expand
  • 130
  • 12
  • PDF
Audio-visual speech recognition using deep learning
TLDR
This study introduces a connectionist-hidden Markov model (HMM) system for noise-robust AVSR. Expand
  • 266
  • 9
  • PDF
Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection
TLDR
This paper describes a method for automatic singer identification from polyphonic musical audio signals including sounds of various instruments. Expand
  • 63
  • 7
  • PDF
Design and Implementation of Robot Audition System 'HARK' — Open Source Software for Listening to Three Simultaneous Speakers
TLDR
This paper presents the design and implementation of the HARK robot audition software system consisting of sound source localization modules, sound source separation modules and automatic speech recognition modules of separated speech signals that works on any robot with any microphone configuration. Expand
  • 185
  • 6
  • PDF
A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval
TLDR
This paper describes a method of modeling the characteristics of a singing voice from polyphonic musical audio signals including sounds of various musical instruments. Expand
  • 76
  • 6
  • PDF
Active Audition for Humanoid
TLDR
In this paper, we present an active audition system for humanoid robot “SIG the humanoid” . Expand
  • 218
  • 5
  • PDF
Lipreading using convolutional neural network
TLDR
In this paper, we propose to apply a convolutional neural network (CNN) as a visual feature extraction mechanism for VSR. Expand
  • 79
  • 5
  • PDF
Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps
TLDR
We provide a new solution to the problem of feature variations caused by the overlapping of sounds in instrument identification in polyphonic music. Expand
  • 87
  • 5
  • PDF
Flexible Guidance Generation Using User Model in Spoken Dialogue Systems
TLDR
We address appropriate user modeling in order to generate cooperative responses to each user in spoken dialogue systems. Expand
  • 39
  • 5
  • PDF
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model
TLDR
This paper presents a hybrid music recommender system that ranks musical pieces while efficiently maintaining collaborative and content-based data, i.e., rating scores given by users and acoustic features of audio signals. Expand
  • 137
  • 4
  • PDF