Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Digital Personal Assistant using Bangla Voice Command Recognition and Face Detection
Department of Computer Science and Engineering, Port City International University, Chattogram, Bangladesh.
Department of Computer Science and Engineering, University of Chittagong, Chattogram, Bangladesh.
University of Chittagong, Bangladesh.ORCID iD: 0000-0002-7473-8185
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Computer Science. (Pervasive Mobile Computing)ORCID iD: 0000-0003-0244-3561
Show others and affiliations
2019 (English)In: Proceedings of IEEE International Conference on Robotics, Automation, Artificial-Intelligence and Internet-of-Things 2019, 2019Conference paper, Published paper (Refereed)
Abstract [en]

Though speech recognition has been a common interest of researchers over the last couple of decades, but very few works have been done on Bangla voice recognition. In this research, we developed a digital personal assistant for handicapped people which recognizes continuous Bangla voice commands. We employed the cross-correlation technique which compares the energy of Bangla voice commands with prerecorded reference signals. After recognizing a Bangla command, it executes a task specified by that command. Mouse cursor can also be controlled using the facial movement of a user. We validated our model in three different environments (noisy, moderate and noiseless) so that the model can act naturally. We also compared our proposed model with a combined model of MFCC & DTW, and another model which combines crosscorrelation with LPC. Results indicate that the proposed model achieves a huge accuracy and smaller response time comparing to the other two techniques.

Place, publisher, year, edition, pages
2019.
Keywords [en]
Bangla voice recognition, Speech recognition, Face detection, Personal assistance, Handicapped people
National Category
Computer Sciences
Research subject
Pervasive Mobile Computing
Identifiers
URN: urn:nbn:se:ltu:diva-76791OAI: oai:DiVA.org:ltu-76791DiVA, id: diva2:1371708
Conference
IEEE International Conference on Robotics, Automation, Artificial-Intelligence and Internet-of-Things 2019
Available from: 2019-11-20 Created: 2019-11-20 Last updated: 2019-12-06

Open Access in DiVA

fulltext(327 kB)33 downloads
File information
File name FULLTEXT01.pdfFile size 327 kBChecksum SHA-512
6135d71f9082d232ef925dcde36dea1fb77ff15b7bdeb4f443cbfb071496ec95b0d6da494ba368aff7f6fc357b80695a37480a6ede7ba25b694ed6b409a5e380
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Hossain, Mohammad ShahadatAndersson, Karl
By organisation
Computer Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 33 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 143 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf