Machine Learning Dysphonia Detection System

Information Technology (IT) and Software

Prototype

Reg. ID : 17519

603-89212020 Comments

Description

Dysphonia refers to a voice disorder that affects the vocal quality and production, making it crucial to develop reliable and accurate detection systems. Since dysphonia can have a significant impact on an individual's daily life and communication abilities, early detection and intervention are crucial for effective treatment and management of this condition. In this innovative system, we designed and evaluated the performance of Mel-Frequency Cepstral Coefficient (MFCC) with combination of various methods to identify dysphonic voices accurately. The motivation behind designing a new innovative Dysphonia Detection System stems from the pressing need for accurate and reliable dysphonia detection tools. The evaluation encompasses various aspects, such as classification accuracy, robustness against noise and variations in speech quality, computational efficiency, and generalization capabilities. The new innovative system is capable of detecting early voice disorder problems and determines the treatment that should be taken by the particular person.

Highlights

Voice disorder detection has gained considerable attention in the field of machine learning research. While previous studies have primarily focused on distinguishing between healthy and pathological voices, only a few have explored the classification of multiple voice disorders. In this innovative approach, a hierarchical Support Vector Machine (SVM) with additional features is proposed to accurately identify non-organic voice disorders. To overcome challenges and mitigate the risks of overfitting, the researchers employed k-fold cross-validation and spectral parameters in their methodology. This ensured robustness and reliability in the classification process. The use of k-fold cross-validation allowed for comprehensive model evaluation by partitioning the dataset into multiple subsets, training the model on a subset, and testing it on the remaining subsets. By repeating this process multiple times, the performance of the classifier could be effectively assessed. One of the key elements in the proposed method is the utilization of Mel-Frequency Cepstral Coefficient (MFCC) as a feature for voice analysis. MFCC is a representation of the short-term power spectrum of a sound, which has been widely used in speech and audio processing applications. It captures essential characteristics of the voice, such as timbre and pitch, by analyzing the spectral content of the sound. The experimental results of this innovative approach were impressive. The accuracy achieved for distinguishing between healthy voices and hypo functional dysphonia reached an impressive 98.98%. Furthermore, the classification accuracy for distinguishing between functional dysphonia, psychogenic dysphonia, dysodie, and dysphonie ranged from a minimum of 70.73% to higher values. The hierarchical SVM employed in this approach is a variant of the standard SVM classifier that allows for a more structured decision-making process. It hierarchically organizes the classes of voice disorders, enabling a step-by-step classification approach. This hierarchical structure helps reduce complexity and improve the accuracy of the classification task. By addressing the challenges associated with non-organic voice disorder classification and effectively mitigating the risks of overfitting, this innovation provides a promising solution for the accurate identification of voice disorders. The use of MFCC as a spectral parameter and the incorporation of hierarchical SVM further enhance the effectiveness of the classification system. The implications of this innovation are significant, as it can contribute to the early detection and diagnosis of non-organic voice disorders. Early identification of these disorders is crucial for timely intervention and treatment, leading to improved patient outcomes. Additionally, the proposed approach opens avenues for further research in the field of voice disorder detection and classification, stimulating the development of more advanced and accurate systems. Therefore, the classification of non-organic voice disorders using Mel-Frequency Cepstral Coefficient (MFCC) and hierarchical SVM demonstrates promising results. By leveraging spectral parameters and addressing challenges associated with classification, this innovation provides a reliable and effective method for detecting and distinguishing between various non-organic voice disorders.

Contact Person/Inventor

Name	Email	Contact Phone
Dr. Woon Hai Sang	manjit@uniten.edu.my	0122017523

MRDCS

Information and Communication Technology (ICT)

Award

Award Title	Award Achievement	Award Year Received
Malaysian Technology Expo	MTE 2023	2023

Video

Additional Document

Attachment	Size
file-1710922830.pdf (566.91 KB)	566.91 KB
file-1710922830.pdf (156.25 KB)	156.25 KB
file-1710922830.pdf (204.6 KB)	204.6 KB
file-1710922830.pdf (648.74 KB)	648.74 KB

Comment

LOG IN or REGISTER to post comments.

Star Rating

Selangor

Organisation Name

Universiti Tenaga Nasional

Putrajaya Campus

Jalan IKRAM-UNITEN

43000

Kajang
603-89212020
https://www.uniten.edu.my/

Machine Learning Dysphonia Detection System

Description

Highlights

Contact Person/Inventor

MRDCS

Award

Video

Additional Document

Tags

Comment

Star Rating

Contact Form

SmartCycle Biofuel

Miniaturise RoVision: A Revolutionised Travelling Aid for Visually Impaired Community (VIC)

SMILED : Smart Machine for Identifying Dental Lesions with Efficient and accurate Detection

HEADSPAN: A MOBILE APPS TO EMPLOYEE WELL-BEING IN AUDIT FIRMS

MSU Assessment Chart for Children with Special Needs

DiRetina: Smartphone Fundus Imaging for Intelligent Detection of Eye Diseases (Diabetic Retinopathy)

Whatsapp

Explore

Recent Posts

Malaysia Technology Expo 2024 spotlights IT and ICT, marks 23rd year with Hong Kong Pavilion debut

Malaysia's Mosti and MTE Unveil Groundbreaking Data-Sharing Initiative to Spur Innovation

MTE 2024 JADI PEMANGKIN INOVASI DAN TEKNOLOGI NEGARA

QR Page URL

	Ministry of Science Technology and Innovation Level 4, Block C5, Complex C, Federal Government Administrative Centre 62662 Putrajaya, Malaysia
	Tel MyGCC: (603) 8000 8000
	Main Line: (603) 8885 8038/8096
	mytechmart[at]mosti.gov.my

Machine Learning Dysphonia Detection System

Machine Learning Dysphonia Detection System

Description

Highlights

Contact Person/Inventor

MRDCS

Award

Video

Additional Document

Tags

Comment

Star Rating

Contact Form

Related Listing

Whatsapp