Voice-Based Gender Identification Using Machine Learning
Keywords:
Gender Classification, Machine Learning, Audio Signal Processing, Automatic Gender Identification (AGI), Feature Extraction, TensorFlow, Classification Accuracy, Voice-Based Systems, Speech ApplicationsAbstract
Automatic gender classification (AGC) based on voice signals plays a crucial role in biometric authentication, speech analytics, and human-computer interaction. This study proposes a hybrid machine learning framework that integrates Mel-Frequency Cepstral Coefficients (MFCC) for feature extraction, Principal Component Analysis (PCA) for dimensionality reduction, and a Convolutional Neural Network (CNN) for classification. The model was trained on a curated dataset of 3,497 Urdu-language voice samples collected from publicly available YouTube recordings and processed for gender classification tasks, encompassing speakers of varying genders and dialects. Addressing limitations in prior approaches, the proposed method combines traditional spectral features with deep learning techniques to enhance classification performance. The system achieved an accuracy of 98.4%, along with strong precision, recall, and F1-score metrics, outperforming baseline models such as Support Vector Machines (SVM) and k-Nearest Neighbours (KNN). These findings support the model’s applicability in real-world use cases, including virtual assistants, automated call routing, and emotion-aware computing systems.
Downloads
Published
How to Cite
Issue
Section
License
This is an open Access Article published by Research Center of Computing & Biomedical Informatics (RCBI), Lahore, Pakistan under CCBY 4.0 International License