ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
- Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, Radu Soricut
- Computer ScienceInternational Conference on Learning…
- 26 September 2019
This work presents two parameter-reduction techniques to lower memory consumption and increase the training speed of BERT, and uses a self-supervised loss that focuses on modeling inter-sentence coherence.
Joint segmentation and classification of human actions in video
- Minh Hoai Nguyen, Zhenzhong Lan, F. D. L. Torre
- Computer ScienceComputer Vision and Pattern Recognition
- 20 June 2011
This method is based on a discriminative temporal extension of the spatial bag-of-words model that has been very popular in object recognition and is performed robustly within a multi-class SVM framework whereas the inference over the segments is done efficiently with dynamic programming.
Self-Paced Learning with Diversity
- Lu Jiang, Deyu Meng, Shoou-I Yu, Zhenzhong Lan, S. Shan, Alexander Hauptmann
- Computer Science, EducationNIPS
- 8 December 2014
This work proposes an approach called self-paced learning with diversity (SPLD) which formalizes the preference for both easy and diverse samples into a general regularization term, independent of the learning objective, and thus can be easily generalized into various learning tasks.
Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition
- Zhenzhong Lan, Ming Lin, Xuanchong Li, Alexander Hauptmann, B. Raj
- Computer ScienceComputer Vision and Pattern Recognition
- 24 November 2014
This work proposes a novel feature enhancing technique called Multi-skIp Feature Stacking (MIFS), which stacks features extracted using a family of differential filters parameterized with multiple time skips and encodes shift-invariance into the frequency space and proves that MIFS enhances the learnability of differential-based features exponentially.
Hidden Two-Stream Convolutional Networks for Action Recognition
- Yi Zhu, Zhenzhong Lan, S. Newsam, Alexander Hauptmann
- Computer ScienceAsian Conference on Computer Vision
- 2 April 2017
This paper presents a novel CNN architecture that implicitly captures motion information between adjacent frames and directly predicts action classes without explicitly computing optical flow, and significantly outperforms the previous best real-time approaches.
Deep Local Video Feature for Action Recognition
- Zhenzhong Lan, Yi Zhu, A. Hauptmann, S. Newsam
- Computer ScienceIEEE Conference on Computer Vision and Pattern…
- 25 January 2017
Experimental results on the HMDB51 and UCF101 datasets show that a simple maximum pooling on the sparsely sampled local features leads to significant performance improvement.
Double Fusion for Multimedia Event Detection
- Zhenzhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, Alexander Hauptmann
- Computer ScienceConference on Multimedia Modeling
- 4 January 2012
This paper introduces a fusion scheme named double fusion, which combines early fusion and late fusion together to incorporate their advantages and results are reported on TRECVID MED 2010 and 2011 data sets.
Extending Semi-supervised Learning Methods for Inductive Transfer Learning
- Yuan Shi, Zhenzhong Lan, W. Liu, Wei Bi
- Computer ScienceNinth IEEE International Conference on Data…
- 6 December 2009
A new transfer learning method is developed, namely COITL, by extending the co-training method in semi-supervised learning by pointing out that many semi- supervised learning methods can be extended for inductive transfer learning, if the step of labeling an unlabeled instance is replaced by re-weighting a diff-distribution instance.
Multimedia classification and event detection using double fusion
- Zhenzhong Lan, Lei Bao, Shoou-I Yu, Wei Liu, Alexander Hauptmann
- Computer ScienceMultimedia tools and applications
- 1 July 2014
This paper introduces a fusion scheme named double fusion, which simply combines early fusion and late fusion together to incorporate their advantages, and reports the best reported results to date.
Viral Video Style: A Closer Look at Viral Videos on YouTube
- Lu Jiang, Y. Miao, Yezhou Yang, Zhenzhong Lan, Alexander Hauptmann
- Computer ScienceInternational Conference on Multimedia Retrieval
- 1 April 2014
The proposed method is unique in that it is the first attempt to incorporate video metadata into the peak day prediction, and outperforms the state-of-the-art methods, with statistically significant differences.
...
...