AP16-OL7: A multilingual database for oriental languages and a language recognition baseline

@article{Wang2016AP16OL7AM,
  title={AP16-OL7: A multilingual database for oriental languages and a language recognition baseline},
  author={Dong Wang and Lantian Li and Difei Tang and Qing Chen},
  journal={2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)},
  year={2016},
  pages={1-5}
}
  • Dong Wang, Lantian Li, +1 author Q. Chen
  • Published 27 September 2016
  • Computer Science
  • 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)
We present the AP16-OL7 database which was released as the training and test data for the oriental language recognition (OLR) challenge on APSIPA 2016. [...] Key Method Based on the database, a baseline system was constructed on the basis of the i-vector model. We report the baseline results evaluated in various metrics defined by the AP16-OLR evaluation plan and demonstrate that AP16-OL7 is a reasonable data resource for multilingual research.Expand
AP17-OLR challenge: Data, plan, and baseline
TLDR
The baseline results are evaluated with various metrics defined by the AP17-OLR evaluation plan and it is demonstrated that the combined database is a reasonable data resource for multilingual research. Expand
OLR 2021 Challenge: Datasets, Rules and Baselines
  • Binling Wang, Wenxuan Hu, +7 authors Cheng Yang
  • Computer Science, Engineering
  • ArXiv
  • 2021
TLDR
The sixth Oriental Language Recognition 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems within multilingual scenarios, is introduced, with the data profile, four tasks, two baselines, and the evaluation principles introduced. Expand
Oriental Language Recognition (OLR) 2020: Summary and Analysis
  • Jing Li, Binling Wang, +4 authors Dong Wang
  • Computer Science, Engineering
  • Interspeech 2021
  • 2021
TLDR
This paper describes the three tasks, the database profile, and the final results, and outlines the novel approaches that improve the performance of language recognition systems most significantly, such as the utilization of auxiliary information. Expand
AP20-OLR Challenge: Three Tasks and Their Baselines
  • Z. Li, Miao Zhao, +5 authors Cheng Yang
  • Computer Science
  • 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2020
TLDR
The fifth oriental language recognition (OLR) challenge AP20-OLR is introduced, which intends to improve the performance of language recognition systems, along with APSIPA Annual Summit and Conference (APSIPA ASC). Expand
AP19-OLR Challenge: Three Tasks and Their Baselines
  • Zhiyuan Tang, D. Wang, Liming Song
  • Computer Science, Engineering
  • 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2019
TLDR
The fourth oriental language recognition (OLR) challenge AP19-OLR is introduced, including the data profile, the tasks and the evaluation principles, and it is demonstrated that the three tasks are worth some efforts to achieve better performance. Expand
DKU-Tencent Submission to Oriental Language Recognition AP18-OLR Challenge
  • Haiwei Wu, W. Cai, +4 authors Shen Huang
  • Computer Science
  • 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2019
TLDR
This paper proposes a deep residual network to obtain the utterance-level language posteriors in an end-to-end manner through i- vector and x-vector modelings for oriental language recognition AP18- OLR Challenge. Expand
C L ] 2 3 Ju l 2 02 1 OLR 2021 CHALLENGE : DATASETS , RULES AND BASELINES
This paper introduces the sixth Oriental Language Recognition (OLR) 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems withinExpand
A Study on Low-resource Language Identification
  • Zhaodi Qi, Yong Ma, M. Gu
  • Computer Science
  • 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
  • 2019
TLDR
Two approaches are investigated to deal with the LID task on low-resource languages: data augmentation, which enlarges the data set by incorporating various distortions into the original data; and the second approach is multi-lingual bottleneck feature extraction, which extracts multiple sets of bottleneck features (BNF) based on speech recognition systems of multiple languages. Expand
Multilingual Convolutional, Long Short-Term Memory, Deep Neural Networks for Low Resource Speech Recognition
TLDR
This paper uses CNNs and DNNs for multilingual speech recognition, for the prediction and correction (PAC) architecture, in order to calculate the state probability, and proposes a proposed model known as PAC-MCLDNN. Expand
Multi-task Learning in Prediction and Correction for Low Resource Speech Recognition
TLDR
The performance of Multitask learning (MTL) for the combined model of Convolutional, Long Short-Term Memory and Deep neural Networks (CLDNN) for low resource speech recognition tasks is investigated and extra grapheme modeling task is still efficient with one hour of training data making it highly beneficial for zero resource languages. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 23 REFERENCES
BABEL: an Eastern European multi-language database
  • P. Roach, S. Arnfield, +11 authors K. Vicsi
  • Computer Science
  • Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
  • 1996
TLDR
The project is producing a multi-language database of five of the most widely-differing Eastern European languages (Bulgarian, Estonian, Hungarian, Romanian and Polish). Expand
Improvements on hierarchical language identification based on automatic language clustering
Hierarchical language identification (HLID) is a novel framework for combining multiple features or primary systems in language identification. In this paper, several key components of HLID areExpand
Globalphone: a multilingual speech and text database developed at karlsruhe university
TLDR
The design, collection, and current status of the multilingual database GlobalPhone is described, an ongoing project since 1995 at Karlsruhe University, which is suitable for the development of large vocabulary speech recognition systems in many languages. Expand
Multilingual Speech Databases at LDC
TLDR
The Linguistic Data Consortium intends to provide the resources needed for research and development activities, especially in telephone-based, small-vocabulary recognition applications; language identification research; and large vocabulary continuous speech recognition research. Expand
Language Recognition via i-vectors and Dimensionality Reduction
TLDR
A new language identification system is presented based on the total variability approach previously developed in the field of speaker identification and developed results in excellent performance on the 2009 LRE evaluation set without the need for any post-processing or backend techniques. Expand
Speechdat-e: five eastern european speech databases for voice-operated teleservices completed
TLDR
The Speechdat-E project five medium large telephone speech databases have been collected for Czech, Hungarian, Polish, Russian, and Slovak and experiences gained from the data recordings and from the validation of the databases are elaborated on. Expand
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments
TLDR
An overview of the SpeechDat-Car project is given with a focus on the production phases (recording platforms, speaker recruitment, annotation and distribution). Expand
AREAL LINGUISTICS AND MAINLAND SOUTHEAST ASIA
AbstractMainland Southeast Asia provides a dramatic demonstration of the areal phenomenon in linguistics: When languages are spoken historically in the same location they often show significantExpand
Front-End Factor Analysis for Speaker Verification
TLDR
An extension of the previous work which proposes a new speaker representation for speaker verification, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis, named the total variability space because it models both speaker and channel variabilities. Expand
The languages of Japan
TLDR
The Ainu Language: 1. Sound structure 2. Grammatical structure 3. Words formation 4. Dialects 5. Genetic affiliation 6. Phonology 7. Lexicon 8. phonology 9. Word formation 12. Epilogue References Index. Expand
...
1
2
3
...