Learn More
Many recent studies show that Augmented Reality (AR) and Automatic Speech Recognition (ASR) technologies can be used to help people with disabilities. Many of these studies have been performed only in their specialized field. Audio-Visual Speech Recognition (AVSR) is one of the advances in ASR technology that combines audio, video, and facial expressions to(More)
This paper highlights the differences in spectral features between British, Australian and American English accents and applies the cross-entropy information measure for comparative quantification of the impacts of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Comparison(More)
Image matching plays an important role in many aspects of computer vision. Our proposed method is based on Scale Invariant Feature Transform (SIFT) which is one of the popular image matching methods. The main ideas behind our method are removing the excess keypoints, adding oriented patterns to descriptor, and decreasing the size of the descriptors. By(More)
In this paper, a low order recursive linear prediction method and recursive least square as an adaptive filter (LP-RLS) are introduced to predict the speech and the excitation signals. In real-time packet-based communication systems, one major problem is misrouted or delayed packets which results in degraded perceived voice quality. If packets are not(More)
Medical images include information about human body which are used for different purposes such as surgical and diagnostic plans. Compression of medical images is used in some applications such as profiling patient's data and transmission systems. Regard to importance of medical images information, lossless or near-lossless compression is preferred. Lossless(More)
In real-time packet-based communication systems, one major problem is misrouted or delayed packets which results in degraded perceived voice quality. If packets are not available on time, the packet is known as lost packet. The easiest task of a network terminal receiver is to replace silence for the duration of lost speech segments. In a high quality(More)
In this paper, a Kalman filter technique which is operated in time is introduced for noise reduction on CT set of projections to reconstruct medical images. The experiments were done on medical image of kidneys and the simulated projections are captured by CT scanner. Evaluation results indicated that as the number of projections increase in the collected(More)
This paper investigates the use of cross-entropy information measure for quantification and comparison of the impact of the variations of accents, speaker groups and recordings on the probability models of spectral features of phonetic units of speech. Cross-entropy measure can be used in applications such as accent identification, improved speech(More)