Yasuhiro Kodama

Learn More
For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypothesized word. Unlike previous works on confidence measures , this paper studies features for confidence measures that are extracted from outputs of more than one LVCSR models. More specifically, this paper experimentally(More)
This paper studies speech-driven Web retrieval models which accepts spoken search topics (queries) in the NTCIR-3 Web retrieval task. The major focus of this paper is on improving speech recognition accuracy of spoken queries and then improving retrieval accuracy in speech-driven Web retrieval. We experimentally evaluate the techniques of combining outputs(More)
For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypothesized word. Unlike previous works on confidence measures, we have proposed features for confidence measures that are extracted from outputs of more than one LVCSR models. For further analysis of the proposed confidence(More)
This paper proposes to apply machine learning techniques to the task of combining outputs of multiple LVCSR models. The proposed technique has advantages over that by voting schemes such as ROVER, especially when the majority of participating models are not reliable. In this machine learning framework, as features of machine learning, information such as(More)
The design and implementation of a control program for biped walking robots using the genetic algorithms (GA) are presented. The most difficult problem with biped walking robots is that they have too many possible gaits. Generally it is impossible to find the optimal gait for a given route. In order to control biped walking robots, we have employed GA to(More)
SUMMARY This paper proposes to apply machine learning techniques to the task of combining outputs of multiple LVCSR models, where, as features of machine learning, information such as the models which output the hypothesized word, its part-of-speech, and its syllable length are useful for improving the word recognition rate. Experimental results show that(More)
It is important that the total bandwidth of the multiple streams should not exceed the network bandwidth in order to achieve a stable network flow with high performance in high bandwidth-delay product networks. Software pacing of TCP/IP for each stream sometimes exceeds the specified bandwidth, especially at the beginning of the stream or when buffer was(More)
  • 1