• Corpus ID: 212628526

Toward Adaptive Guidance: Modeling the Variety of User Behaviors in Continuous-Skill-Improving Experiences of Machine Operation Tasks

@article{Chen2020TowardAG,
  title={Toward Adaptive Guidance: Modeling the Variety of User Behaviors in Continuous-Skill-Improving Experiences of Machine Operation Tasks},
  author={Long-fei Chen and Yuichi Nakamura and Kazuaki Kondo},
  journal={ArXiv},
  year={2020},
  volume={abs/2003.03025}
}
An adaptive guidance system that supports equipment operators requires a comprehensive model of task and user behavior that considers different skill and knowledge levels as well as diverse situations. In this study, we investigated the relationships between user behaviors and skill levels under operational conditions. We captured sixty samples of two sewing tasks performed by five operators using a headmounted RGB-d camera and a static gaze tracker. We examined the operators’ gaze and head… 
1 Citations

Piano Skills Assessment

A first-of-its-kind dataset for multimodal skill assessment focusing on assessing piano player’s skill level is collected, questions are answered, work in automated evaluation of piano playing skills is initiated and baselines for future work are provided.

References

SHOWING 1-10 OF 33 REFERENCES

Manipulation-Skill Assessment from Videos with Spatial Attention Network

A novel RNN-based spatial attention model is proposed that considers accumulated attention state from previous frames as well as high-level information about the progress of an undergoing task in automatic skill assessment.

Automated capture and delivery of assistive task guidance with an eyewear computer: the GlaciAR system

An assistive mixed reality system that aims to augment users in tasks by combining automated and unsupervised information collection with minimally invasive video guides, called GlaciAR (Glass-enabled Contextual Interactions for Augmented Reality).

Hotspots Integrating of Expert and Beginner Experiences of Machine Operations through Egocentric Vision

A framework for integrating beginners' and experts' experiences into a unified operation model based on the experiences of a wide range of users was developed and showed good potentials in modeling the common and different behaviors among experts and beginners.

The coordination of eye, head, and hand movements in a natural task

The temporal coordination of eye and head in gaze changes is usually the consequence of a synergistic linkage rather than an obligatory one, which suggests that observers maintain coordination by setting up a temporary, task-specific synergy between the eye and hand.

Vision, eye movements, and natural behavior

  • M. Land
  • Psychology
    Visual Neuroscience
  • 2009
The main conclusions are that complex action sequences consist of a succession of individual object-related actions, each of which typically involve a turn toward the object (if needed), followed by fixation and finally manipulation monitored by vision.

You-Do, I-Learn: Discovering Task Relevant Objects and their Modes of Interaction from Multi-User Egocentric Video

This work presents a fully unsupervised approach for the discovery of i) task relevant objects and ii) how these objects have been used and introduces the term Mode of Interaction (MOI) to refer to the different ways in which TROs are used.

Relative Hidden Markov Models for Video-Based Evaluation of Motion Skills in Surgical Training

  • Qiang ZhangBaoxin Li
  • Computer Science
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2015
Experimental results suggest that the proposed approach provides a promising solution to video-based motion skill evaluation and the potential of generalizing the method to other applications of temporal analysis is illustrated.

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

  • Paritosh ParmarB. Morris
  • Computer Science
    2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2019
This paper proposes to learn spatio-temporal features that explain three related tasks - fine-grained action recognition, commentary generation, and estimating the AQA score, and shows that the MTL approach outperforms STL approach using two different kinds of architectures: C3D-AVG and MSCADC.

Multi-User Egocentric Online System for Unsupervised Assistance on Object Usage

The paper proposes a method for selecting a suitable video guide to be displayed to a novice user indicating how to use an object, purely triggered by the user’s gaze.