ChaLearn Looking at People RGB-D Isolated and Continuous Datasets for Gesture Recognition
- Jun Wan, S. Li, Yibing Zhao, Shuai Zhou, I. Ramadass Subramanian, Sergio Escalera
- Computer ScienceIEEE Conference on Computer Vision and Pattern…
- 1 June 2016
Two large video multi-modal datasets for RGB and RGB-D gesture recognition are presented and the baseline method based on the bag of visual words model is presented, designed for gesture classification from segmented data.
ChaLearn Looking at People Challenge 2014: Dataset and Results
- Sergio Escalera, Xavier Baró, I. Ramadass Subramanian
- Computer ScienceECCV Workshops
- 6 September 2014
In this edition of the ChaLearn challenge, two large novel data sets were made publicly available and the Microsoft Codalab platform were used to manage the competition.
ChaLearn Looking at People 2015: Apparent Age and Cultural Event Recognition Datasets and Results
- Sergio Escalera, J. Fabian, I. Ramadass Subramanian
- Computer ScienceIEEE International Conference on Computer Vision…
- 7 December 2015
A crowd-sourcing application was developed to collect and label data about the apparent age of people (as opposed to the real age) and in terms of cultural event recognition, one hundred categories had to be recognized.
ChaLearn LAP 2016: First Round Challenge on First Impressions - Dataset and Results
- V. Ponce-López, Baiyu Chen, Sergio Escalera
- Computer ScienceECCV Workshops
- 8 October 2016
This paper summarizes the ChaLearn Looking at People 2016 First Impressions challenge data and results obtained by the teams in the first round of the competition, to automatically evaluate five “apparent” personality traits from videos of subjects speaking in front of a camera, by using human judgment.
Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions
- Reza Azad, Maryam Asadi-Aghbolaghi, M. Fathy, Sergio Escalera
- Computer ScienceIEEE/CVF International Conference on Computer…
- 31 August 2019
This paper proposes an extension of U-Net, Bi-directional ConvLSTM U- net with Densely connected convolutions (BCDU-Net), for medical image segmentation, in which the full advantages of U -Net, bi- directional Conv lSTM (BConvL STM) and the mechanism of dense convolutions are taken.
On the Decoding Process in Ternary Error-Correcting Output Codes
- Sergio Escalera, O. Pujol, P. Radeva
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine…
- 2010
A taxonomy is presented that embeds all binary and ternary ECOC decoding strategies into four groups and shows that the zero symbol introduces two kinds of biases that require redefinition of the decoding design.
Multi-modal gesture recognition challenge 2013: dataset and results
- Sergio Escalera, Jordi Gonzàlez, H. Escalante
- Computer ScienceInternational Conference on Multimodal…
- 9 December 2013
A challenge on multi-modal gesture recognition with 54 international teams, providing the audio, skeletal model, user mask, RGB and depth images, and outstanding results were obtained by the first ranked participants.
A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing
- Shifeng Zhang, Xiaobo Wang, S. Li
- Computer ScienceComputer Vision and Pattern Recognition
- 2 December 2018
A large-scale multi-modal dataset, namely CASIA-SURF, is introduced, which is the largest publicly available dataset for face anti-spoofing in terms of both subjects and visual modalities and a new multi- modal fusion method is presented, which performs feature re-weighting to select the more informative channel features while suppressing the less useful ones for each modal.
Deep Structure Inference Network for Facial Action Unit Recognition
- C. Corneanu, M. Madadi, Sergio Escalera
- Computer ScienceEuropean Conference on Computer Vision
- 15 March 2018
A deep neural architecture is proposed that combines learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages to improve state-of-the-art performance.
LSTA: Long Short-Term Attention for Egocentric Action Recognition
- Swathikiran Sudhakaran, Sergio Escalera, O. Lanz
- Computer ScienceComputer Vision and Pattern Recognition
- 26 November 2018
This paper proposes LSTA as a mechanism to focus on features from spatial relevant parts while attention is being tracked smoothly across the video sequence, achieving state-of-the-art performance on four standard benchmarks.
...
...