Learn More
We present a technique to jointly learn the high level abstractions of sequential features (such as, pitch and MFCC's) and combine time-aggregated features (such as, mean length of pauses, recognizer confidence scores, etc.) to optimize the automated scoring of non-native spoken responses. We use a bidirectional long short term memory (BLSTM) network, a(More)
  • 1