Corpus ID: 215776530

Motion-Based Handwriting Recognition and Word Reconstruction

@article{Chen2021MotionBasedHR,
  title={Motion-Based Handwriting Recognition and Word Reconstruction},
  author={Junshen Chen and Wanze Xie and Yutong He},
  journal={ArXiv},
  year={2021},
  volume={abs/2101.06025}
}
In this project, we leverage a trained single-letter classifier to predict the written word from a continuously written word sequence, by designing a word reconstruction pipeline consisting of a dynamicprogramming algorithm and an auto-correction model. We conduct experiments to optimize models in this pipeline, then employ domain adaptation to explore using this pipeline on unseen data distributions. 

Figures and Tables from this paper

References

SHOWING 1-10 OF 27 REFERENCES
Fully Convolutional Networks for Handwriting Recognition
TLDR
This work introduces an attention based mechanism which can automatically target variants of handwriting, such as slant, stroke width, or noise, and uses both local and global context and mitigates the need for heavy preprocessing steps such as symbol alignment correction. Expand
Motion-Based Handwriting Recognition
TLDR
This project proposes a solution by using a pen equipped with motion sensor to predict the English letters written by the user using machine learning techniques to classify the characters the user is writing. Expand
A Scalable Handwritten Text Recognition System
TLDR
This paper addresses three problems in building large scale multilingual OCR systems: data, efficiency, and integration by using online handwriting data collected for a large scale production online handwriting recognition system, and proposes a line recognition model based on neural networks without recurrent connections. Expand
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models
  • C. Chiu, T. Sainath, +11 authors M. Bacchiani
  • Computer Science, Engineering
  • 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2018
TLDR
A variety of structural and optimization improvements to the Listen, Attend, and Spell model are explored, which significantly improve performance and a multi-head attention architecture is introduced, which offers improvements over the commonly-used single- head attention. Expand
Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks
TLDR
A new method is proposed that performs pixel-level classification with a convolutional neural network that separates handwritten and machine-printed components that are mixed and overlapped in documents. Expand
Streaming Automatic Speech Recognition with the Transformer Model
TLDR
This work proposes a transformer based end-to-end ASR system for streaming ASR, where an output must be generated shortly after each spoken word, and applies time-restricted self-attention for the encoder and triggered attention for theEncoder-decoder attention mechanism. Expand
Airwriting: Hands-Free Mobile Text Input by Spotting and Continuous Recognition of 3d-Space Handwriting with Inertial Sensors
TLDR
An input method which enables complex hands-free interaction through 3d handwriting recognition through Hidden Markov Models and a statistical language model is used to enhance recognition performance and restrict the search space. Expand
Pentelligence: Combining Pen Tip Motion and Writing Sounds for Handwritten Digit Recognition
TLDR
Pentelligence is presented, a pen for handwritten digit recognition that operates on regular paper and does not require a separate tracking device, and senses the pen tip's motions and sound emissions when stroking. Expand
Gradient-based learning applied to document recognition
TLDR
This paper reviews various methods applied to handwritten character recognition and compares them on a standard handwritten digit recognition task, and Convolutional neural networks are shown to outperform all other techniques. Expand
Optical character recognition (OCR)
TLDR
The long history of research in this area, commercial success, and the continuing need and ability to handle less restricted forms of text make OCR the most important application area in machine perception to date. Expand
...
1
2
3
...