AP17-OLR challenge: Data, plan, and baseline

@article{Tang2017AP17OLRCD,
  title={AP17-OLR challenge: Data, plan, and baseline},
  author={Zhiyuan Tang and Dong Wang and Yixiang Chen and Qing Chen},
  journal={2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)},
  year={2017},
  pages={749-753}
}
  • Zhiyuan Tang, Dong Wang, +1 author Q. Chen
  • Published 2017
  • Computer Science
  • 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
We present the data profile and the evaluation plan of the second oriental language recognition (OLR) challenge AP17-OLR. Compared to the event last year (AP16-OLR), the new challenge involves more languages and focuses more on short utterances. The data is offered by SpeechOcean and the NSFC M2ASR project. Two types of baselines are constructed to assist the participants, one is based on the i-vector model and the other is based on various neural networks. We report the baseline results… Expand
AP18-OLR Challenge: Three Tasks and Their Baselines
  • Zhiyuan Tang, D. Wang, Q. Chen
  • Computer Science, Engineering
  • 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2018
TLDR
The third oriental language recognition (OLR) challenge AP18-OLR is introduced in this paper, including the data profile, the tasks and the evaluation principles, and it is demonstrated that the three tasks are truly challenging. Expand
AP19-OLR Challenge: Three Tasks and Their Baselines
  • Zhiyuan Tang, D. Wang, Liming Song
  • Computer Science, Engineering
  • 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2019
TLDR
The fourth oriental language recognition (OLR) challenge AP19-OLR is introduced, including the data profile, the tasks and the evaluation principles, and it is demonstrated that the three tasks are worth some efforts to achieve better performance. Expand
AP20-OLR Challenge: Three Tasks and Their Baselines
  • Z. Li, Miao Zhao, +5 authors Cheng Yang
  • Computer Science
  • 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2020
TLDR
The fifth oriental language recognition (OLR) challenge AP20-OLR is introduced, which intends to improve the performance of language recognition systems, along with APSIPA Annual Summit and Conference (APSIPA ASC). Expand
Ju l 2 01 9 AP 19-OLR Challenge : Three Tasks and Their Baselines
This paper introduces the fourth oriental language recognition (OLR) challenge AP19-OLR, including the data profile, the tasks and the evaluation principles. The OLR challenge has been heldExpand
A ug 2 01 9 AP 19-OLR Challenge : Three Tasks and Their Baselines
This paper introduces the fourth oriental language recognition (OLR) challenge AP19-OLR, including the data profile, the tasks and the evaluation principles. The OLR challenge has been heldExpand
OLR 2021 Challenge: Datasets, Rules and Baselines
  • Binling Wang, Wenxuan Hu, +7 authors Cheng Yang
  • Computer Science, Engineering
  • ArXiv
  • 2021
TLDR
The sixth Oriental Language Recognition 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems within multilingual scenarios, is introduced, with the data profile, four tasks, two baselines, and the evaluation principles introduced. Expand
DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge
  • Haiwei Wu, W. Cai, +4 authors Shen Huang
  • Computer Science
  • 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2019
TLDR
This paper proposes a deep residual network to obtain the utterance-level language posteriors in an end-to-end manner through i- vector and x-vector modelings for oriental language recognition AP18- OLR Challenge. Expand
The XMUSPEECH System for the AP19-OLR Challenge
TLDR
This paper presents the XMUSPEECH system, a system for the oriental language recognition (OLR) challenge, AP19-OLR, which leveraged the system pipeline from three aspects, including front-end training, back-end processing, and fusion strategy. Expand
Oriental Language Recognition (OLR) 2020: Summary and Analysis
  • Jing Li, Binling Wang, +4 authors Dong Wang
  • Computer Science, Engineering
  • Interspeech 2021
  • 2021
TLDR
This paper describes the three tasks, the database profile, and the final results, and outlines the novel approaches that improve the performance of language recognition systems most significantly, such as the utilization of auxiliary information. Expand
C L ] 2 3 Ju l 2 02 1 OLR 2021 CHALLENGE : DATASETS , RULES AND BASELINES
This paper introduces the sixth Oriental Language Recognition (OLR) 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems withinExpand
...
1
2
3
4
...

References

SHOWING 1-10 OF 21 REFERENCES
AP16-OL7: A multilingual database for oriental languages and a language recognition baseline
TLDR
It is demonstrated that AP16-OL7 is a reasonable data resource for multilingual research and a baseline system was constructed on the basis of the i-vector model to evaluate the baseline results. Expand
Stacked Long-Term TDNN for Spoken Language Recognition
TLDR
A stacked architecture that uses a time delay neural network (TDNN) to model long-term patterns for spoken language identification and provides complementary information to fuse with the new generation of bottleneckbased i-vector systems that model short-term dependencies. Expand
Automatic language identification using deep neural networks
TLDR
This work adapts DNNs to the problem of identifying the language of a given spoken utterance from short-term acoustic features and finds relative improvements up to 70%, in Cavg, over the baseline system. Expand
LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification
TLDR
It is shown that LID-senones combine a compact representation of the original acoustic feature space with a powerful descriptive and discriminative capability and a novel incremental training method is proposed to extract the weak language information buried in the acoustic features of insufficient language resources. Expand
Automatic language identification using long short-term memory recurrent neural networks
TLDR
This work explores the use of Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) for automatic language identification (LID) and shows LSTM RNNs achieve better performance than the best DNN system with an order of magnitude fewer parameters. Expand
Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
TLDR
An open-source, end-to-end, LSTM RNN system running on limited computational resources (a single GPU) that outperforms a reference i-vector system on a subset of the NIST Language Recognition Evaluation (8 target languages, 3s task) by up to a 26%. Expand
Language Identification Using Time Delay Neural Network D-Vector on Short Utterances
TLDR
It was found that the TDNN system performance is close at any durations while i-vector shows good results only at long time, and compared LID TDNN d-vector results to i- vector baseline. Expand
THCHS-30 : A Free Chinese Speech Corpus
TLDR
This paper releases a free Chinese speech database THCHS-30 that can be used to build a full- edged Chinese speech recognition system, and reports the baseline system established with this database, including the performance under highly noisy conditions. Expand
Phonetic Temporal Neural Model for Language Identification
TLDR
A phonetic temporal neural model for LID is presented, which is an LSTM-RNN LID system that accepts phonetic features produced by a phone-discriminative DNN as the input, rather than raw acoustic features. Expand
The Kaldi Speech Recognition Toolkit
TLDR
The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems. Expand
...
1
2
3
...