Applied Mathematics & Information Sciences

Mouth Segmentation Using Coordinate-Based Method for the Improvement of Visual Speech Recognition

P. Sujatha, Department of Computer Science and Engineering, Sudharsan Engineering College, Tamilnadu, IndiaFollow
M. Radhakrishnan, Department of Civil Engineering, Sethu Institute of Technology, Tamilnadu, IndiaFollow

Author Country (or Countries)

India

Abstract

Visual Speech Recognition (VSR) is a process of understanding speech by interpreting visual information of speakers lip movement. Efficient and accurate mouth detection is an essential step in the field of speech recognition using visual-only signals. This research paper proposes a novel approach using Coordinate Based Super-pixel Segmentation algorithm (CBSS) to improve the accuracy of mouth segmentation. The proposed CBSS algorithm is able to robustly segment the mouth region that belongs to a given mouth shape. For the extracted mouth region, Discrete Cosine Transform (DCT) is applied to segregate the crucial features. Then the visual lip features are trained using Support Vector Machine (SVM) to recognize the speech. Experiments are conducted on in-house database with normal hearing persons and hearing impaired persons and also on publically available CUAVE databases. The results from the studies indicate that the proposed CBSS algorithm drastically improves the mouth detection accuracy compared to the existing techniques. This leads to significant improvement in recognition rate for identifying the isolated words.

Digital Object Identifier (DOI)

http://dx.doi.org/10.18576/amis/120424

Recommended Citation

Sujatha, P. and Radhakrishnan, M. (2018) "Mouth Segmentation Using Coordinate-Based Method for the Improvement of Visual Speech Recognition," Applied Mathematics & Information Sciences: Vol. 12: Iss. 4, Article 24.
DOI: http://dx.doi.org/10.18576/amis/120424
Available at: https://digitalcommons.aaru.edu.jo/amis/vol12/iss4/24

Download

COinS

Applied Mathematics & Information Sciences

Mouth Segmentation Using Coordinate-Based Method for the Improvement of Visual Speech Recognition

Authors

Author Country (or Countries)

Abstract

Digital Object Identifier (DOI)

Recommended Citation

Share

Search