Nitendra Rajput

Learn More
In this paper we present a study comparing speech and dialed input voice user interfaces for farmers in Gujarat, India. We ran a controlled, between-subjects experiment with 45 participants. We found that the task completion rates were significantly higher with dialed input, particularly for subjects under age 30 and those with less than an eighth grade(More)
In developing regions, literacy levels and Internet penetration is considerably low, but phone penetration is high and is growing rapidly. In such a setting, the World Wide Telecom Web (WWTW), commonly known as the Spoken Web, provides a compelling alternative to the World Wide Web, since it relies on audio interaction over the phone. WWTW consists of(More)
The World Wide Web (WWW) enabled quick and easy information dissemination and brought about fundamental changes to various aspects of our lives. However, a very large number of people, mostly in developing regions, are still untouched by this revolution. Compared to PCs, the primary access mechanism to WWW, mobile phones have made a phenomenal penetration(More)
This paper describes a morphing-based audio driven facial animation system. Based on an incoming audio stream, a face image is animated with full lip synchronization and synthesized expressions. A novel scheme to implement a language independent system for audio-driven facial animation given a speech recognition system for just one language, in our case,(More)
83% of the world population does not have access to Internet. Therefore there is a need for a simple and affordable interaction technology that can enable easy content creation and dissemination for this population. In this paper, we present the design, development and usage pattern of a VoiKiosk system that provides a voice-based kiosk solution for people(More)
Various sectors in developing countries are typically dominated by the presence of a large number of small and micro-businesses that operate in an informal, unorganized manner. Many of these are single person run micro-businesses and cannot afford to buy and maintain their own IT infrastructure. For others, easy availability of cheap labour provides a(More)
We consider the problem of combining visual cues with audio signals for the purpose of improved automatic machine recognition of speech. Although signi cant progress has been made in machine transcription of large vocabulary continuous speech (LVCSR) over the last few years, the technology to date is most e ective only under controlled conditions such as(More)
One of the several initiatives to bridge the digital divide in developing countries has been the deployment of information kiosks or knowledge centers in villages in rural parts of the country. These kiosks provide services ranging from email, chat and browsing to distance education programs, agricultural services and eGovernance services. A kiosk typically(More)
Spoken Web proposes to build an alternate web for the underprivileged population that is yet untouched by the enormous benefits of Internet and World Wide Web. Just over 22% of World's population today has access to Internet [Stats 2008]. The remaining 78% either cannot afford computers or high end mobiles, or are textually illiterate or simply reside in(More)