Learning Spatio-temporal Features with Partial Expression Sequences for on-the-Fly Prediction


Spatio-temporal feature encoding is essential for encoding facial expression dynamics in video sequences. At test time, most spatio-temporal encoding methods assume that a temporally segmented sequence is fed to a learned model, which could require the prediction to wait until the full sequence is available to an auxiliary task that performs the temporal… (More)


8 Figures and Tables