Improving the usage of subword-based units for Turkish speech recognition
Citation
G. Çetinkaya, E. Arısoy and M. Saraçlar, (5-7 Oct. 2020). Improving the Usage of Subword-Based Units for Turkish Speech Recognition, 2020 28th Signal Processing and Communications Applications Conference (SIU), pp. 1-4, doi: 10.1109/SIU49456.2020.9302043. Abstract
Subword units are often utilized to achieve better performance in speech recognition because of the high number of observed words in agglutinative languages. In this study, the proper use of subword units is explored in recognition by a reconsideration of details such as silence modeling and position-dependent phones. A modified lexicon by finite-state transducers is implemented to represent the subword units correctly. Also, we experiment with different types of word boundary markers and achieve the best performance by adding a marker both to the left and right side of a subword unit. In our experiments on a Turkish broadcast news dataset, the subword models do outperform word-based models and naive subword implementations. Results show that using proper subword units leads to a relative word error rate (WER) reductions, which is 2.4%, compared with the word level automatic speech recognition (ASR) system for Turkish.