Audio-to-Text Urdu Chatbot using Deep Learning Algorithms RNN and wav2vec2

Authors

  • Areeba Khalid The Islamia University of Bahawalpur, Bahawalpur, 63100, Pakistan.
  • Malik Daler Ali Awan The Islamia University of Bahawalpur, Bahawalpur, 63100, Pakistan.
  • Nadeem Iqbal Kajla MNS University of Agriculture, Multan, 60000, Pakistan.
  • Amnah Firdous The Government Sadiq College Women University Bahawalpur, 63100, Pakistan.
  • Hafiz Muhammad Sanaullah Badar MNS University of Agriculture, Multan, 60000, Pakistan.
  • Malik Muhammad Saad Missen The Islamia University of Bahawalpur, Bahawalpur, 63100, Pakistan.

Keywords:

Natural Language process (nlp), Urdu question answer dataset (Uquad), Wav2vec, Automated Speech Recognition(asr)

Abstract

Advancement in technology limited the distances via communication. People globally exchange thoughts in different languages using many ways like text, audio, pictures, and videos to express their ideas. Among many languages Urdu language has more than 100 million people around the world. It is necessities the development of smart applications to facilitate Urdu language users that can communicate via audio instead of only text. A conversation bot system enables individuals and computers to communicate using natural language. Numerous Chabot’s have been developed in English, German, Korean, Spanish, and Chinese languages. Because of the significant language barrier, those who do not speak English, German, Korea, or Spanish well cannot use these chatbots. In this research work we developed a smart chatbot system that can take voice as input for Urdu language using RNN a deep learning model. The proposed system is developed using two datasets UQuaD and custom dataset. A pretrained model is used to convert Urdu audio to text named as “wav2vec2-large-xls-r-300m-Urdu”. The proposed system on UQuaD and custom achieved an accuracy of 68.30% on the UQuaD dataset and 89.6% on the custom dataset.

Downloads

Published

2024-04-01

How to Cite

Areeba Khalid, Malik Daler Ali Awan, Nadeem Iqbal Kajla, Amnah Firdous, Hafiz Muhammad Sanaullah Badar, & Malik Muhammad Saad Missen. (2024). Audio-to-Text Urdu Chatbot using Deep Learning Algorithms RNN and wav2vec2. Journal of Computing & Biomedical Informatics. Retrieved from https://jcbi.org/index.php/Main/article/view/420