Open-Source Speech Corpus and Initial Speech Recognition Experiments

A.A. Rakhmanov

doi:10.31149/ijhcs.v4i8.3578

A.A. Rakhmanov chief specialist of the territorial administration of the Samarkand region of the Ministry for the Development of Information Technologies and Communications of the Republic of Uzbekistan

DOI: https://doi.org/10.31149/ijhcs.v4i8.3578

Keywords: speech corpus, E2E, USC, ASR, USC, DNN-HMM, WER

Abstract

We present a speech corpus and report preliminary results of automatic speech recognition (ASR) using a hidden deep neural network Markov model (DNNHMM) and an end-to-end (E2E) architecture.

References

1. Speechocean’s Uzbek speech corpus. http://en.speechocean.com/datacenter/details/1847.html, accessed: 2021-05-21
2. Uzbek language. https://en.wikipedia.org/wiki/Uzbek language, accessed: 2021-05-20
3. Voxforge. http://www.voxforge.org/, accessed: 2021-05-11 Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Process. Mag. 29(6), 82–97 (2012)
4. Abdel-Hamid, O., Mohamed, A., Jiang, H., Deng, L., Penn, G., Yu, D.: Convolutional neural networks for speech recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(10), 1533–1545 (2014)

Open-Source Speech Corpus and Initial Speech Recognition Experiments

Abstract

References

Most read articles by the same author(s)

International Journal of Human Computing Studies