DESIGNING AN INFORMATION SYSTEM WITH THE POSSIBILITY OF VOICE CONTROL

Keywords: voice control, phoneme, vocal apparatus, artificial neural networks, genetic algorithm, spectral readings, cepstral coefficients

Abstract

The work is devoted to the creation of an information system for recognizing voice commands based on artificial neural networks. With the development of computer systems, it is becoming more and more obvious that the use of speech recognition systems will be greatly expanded if it becomes possible to use human language when working directly with the computer, and in particular, it becomes possible to control the machine with a normal voice in real time, as well as input and output information in the form of ordinary human language. One of the promising ways of organizing human-machine interaction is the transmission of user instructions to the computer system in the form of language commands. Voice interface is a necessary component when it comes to creating comfortable living conditions for people with disabilities. In the paper, the approaches to the selection of informative features describing the speech signal are defined: the method of linear prediction and spectral analysis, the structure of a neural network with one feedback is considered, and it is established that the learning of the neural network is carried out by successive presentation of the training sample, with simultaneous adjustment weights according to a specific procedure until the tuning error across the set reaches an acceptably low level. The value of the obtained results lies in the improvement of a new method of speech recognition, which is better adapted to the user's speech, which requires a minimum of resources and the creation of an information system with the possibility of voice control using devices based on various operating systems. An informative cross-platform application with a voice interface was designed based on this approach.

References

1. Dong Yu, Li Deng. Automatic Speech Recognition: A Deep Learning Approach. – L.: Springer-Verlag London, 2015. 320 p.
2. Automatic Speech recognition: short introduction. URL: https://www.esat.kuleuven.be/psi/spraak/demo/Recog/asr_intro.html
3. Автоматичне розпізнавання, розуміння та синтез мовленнєвих сигналів в Україні / Т.К. Вінцюк, М.М. Сажок, Р.А. Селюх, Д.Я. Федорин, О.А. Юхименко, В.В. Робейко. Управляющие системы и машины. 2018. № 6. С. 7–24.
4. Глибовець М. М., Олецький О.В. Штучний інтелект. Київ : «Києво-Могилянська академія», 2002. 364 с.
5. Home Assistant. URL: https://home-assistant.io/
6. Introducing the Web Speech API. URL: https://www.sitepoint.com/introducing-web-speech-api/
7. JavaScript: Web API читання тексту та розпізнавання голосу. URL: https://archakov.im/post/javascriptweb-api-recognition-and-speech-text.html
8. Understand the Smart Home Skill API. URL: https://developer.amazon.com/docs/smarthome/understandthe-smart-home-skill-api.html#how-the-smart-home-skill-api-works
9. annyang! Tutorial. URL: https://github.com/TalAter/annyang
Published
2023-12-18
How to Cite
BezverkhyіO. I., Aleksandrenko, D. O., & Luts, V. Y. (2023). DESIGNING AN INFORMATION SYSTEM WITH THE POSSIBILITY OF VOICE CONTROL. Systems and Technologies, 66(2), 13-20. https://doi.org/10.32782/2521-6643-2023.2-66.2
Section
COMPUTER SCIENCES