Published on Fri Nov 29 2019

Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset

Akam Qader, Hossein Hassani
0
0
0
Abstract

We present an experimental dataset, Basic Dataset for Sorani Kurdish Automatic Speech Recognition (BD-4SK-ASR), which we used in the first attempt in developing an automatic speech recognition for Sorani Kurdish. The objective of the project was to develop a system that automatically could recognize simple sentences based on the vocabulary which is used in grades one to three of the primary schools in the Kurdistan Region of Iraq. We used CMUSphinx as our experimental environment. We developed a dataset to train the system. The dataset is publicly available for non-commercial use under the CC BY-NC-SA 4.0 license.

Thu Aug 13 2020
Machine Learning
MASRI-HEADSET: A Maltese Corpus for Speech Recognition
0
0
0
Sun Feb 16 2020
NLP
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Ainu is an unwritten language that has been spoken by Ainu people. It is recognized as critically endangered by UNESCO. A considerable amount of voice recordings of Ainu folklore have been produced to save their culture.
0
0
0
Tue Feb 09 2021
NLP
BembaSpeech: A Speech Recognition Corpus for the Bemba Language
We present a preprocessed, ready-to-use automatic speech recognition corpus. It consists over 24 hours of read speech in the Bemba language, a low-resourced language spoken by over 30% of the population in Zambia.
0
0
0
Tue Sep 22 2020
NLP
A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
The Kazakh speech corpus (KSC) contains around 332 hours of transcribed audio. It contains 153,000 utterances spoken by participants from different regions and age groups.
0
0
0
Mon Sep 07 2020
NLP
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
KoSpeech is an open-source software, which is modular and extensible. It is a Korean automatic speech recognition (ASR) toolkit based on the deep learning library PyTorch.
0
0
0
Fri Jul 30 2021
NLP
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments
The Uzbek speech corpus (USC) comprises 958 different speakers with a total of 105 hours of transcribed audio recordings. To ensure high quality, the USC has been manually checked by native speakers. The experimental results demonstrate promising results for the applicability of the USC for ASR.
3
0
0