Nurul Wahidah Arshad scite author profile

This paper implements and compares the performance of a number of techniques proposed for improving the accuracy of Automatic Speech Recognition (ASR) systems. As ASR that uses only speech can be contaminated by environmental noise, in some applications it may be improve performance to employ Audio-Visual Speech Recognition (AVSR), in which recognition uses both audio information and mouth movements obtained from a video recording of the speaker's face region. In this paper, model validation techniques, namely the holdout method, leave-one-out cross validation and bootstrap validation, are implemented to validate the performance of an AVSR system as well as to provide a comparison of the performance of the validation techniques themselves. A new speech data corpus is used, namely the Loughborough University Audio-Visual (LUNA-V) dataset that contains 10 speakers with five sets of samples uttered by each speaker. The database is divided into training and testing sets and processed in manners suitable for the validation techniques under investigation. The performance is evaluated using a range of different signal-to-noise ratio values using a variety of noise types obtained from the NOISEX-92 dataset.

show abstract

Contrast modification for pre-enhancement process in multi-contrast rubeosis iridis images

Karim

Arshad

Wahab

2023

TELKOMNIKA

View full text Add to dashboard Cite

Existing researchers for rubeosis iridis disease focused on image enhancement as a collective group without considering the multi-contrast of the images. In this paper, the pre-enhancement process was proposed to improve the quality of iris images for rubeosis iridis disease by separating the image into three groups; low, medium and high contrast. Increment, decrement and maintenance of the images' original contrast were further operated by noise reduction and multi-contrast manipulation to attain the best contrast value in each category for increased compatibility prior subsequent enhancement. As a result, this study proved that there have three rules for the contrast modification method. Firstly, the histogram equalization (HE) filter and increasing the image contrast by 50% will achieve the optimum value for the low contrast category. Experimental revealed that HE filters successfully increase the luminance value before undergoing the contrast modification method. Secondly, reducing the 50% of the image contrast to achieve the optimum value for the high contrast category. Finally, the image contrast was maintained for the middle contrast category to optimise contrast. The mean square error (MSE) and peak signal-to-noise ratio (PSNR) of the outputs were then calculated, yielding an average of 18.25 and 28.87, respectively.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nurul Wahidah Arshad

Unclean hand detection machine using vision sensor

EMG Activity of Leg Muscles with Knee Pain during Islamic Prayer (Salat)

Speech processing for makhraj recognition

A Comparison of Model Validation Techniques for Audio-Visual Speech Recognition

Contrast modification for pre-enhancement process in multi-contrast rubeosis iridis images

Contact Info

Product

Resources

About