Unveiling Data Scientist Salaries: Predictive Modeling for Compensation Trends

Authors

  • Muhammad Taha Faculty of Computer Science and Information Technology, The Superior University, Lahore 54600, Pakistan.
  • Tayyaba Farhat Faculty of Computer Science and Information Technology, The Superior University, Lahore 54600, Pakistan.
  • Arsham Azam Faculty of Computer Science and Information Technology, The Superior University, Lahore 54600, Pakistan.
  • Muhammad Ahmar Faculty of Computer Science and Information Technology, The Superior University, Lahore 54600, Pakistan.
  • Syed Jalal Abbas The Institute of Certified Public Accountants of Pakistan (ICPAP), CPA-Pakistan, Lahore 54600, Pakistan
  • Muhammad Umar Habib Suleman Dawood School of Business (SDSB), LUMS, Lahore 54600, Pakistan

Keywords:

Machine Learning, Data Science, Data Analytics, Classification Model, Predictive Analysis, Ensemble Learning

Abstract

The fast pace of artificial intelligence growth with big data has rendered data science as one of the most in-demand jobs in the world. Data scientists' remuneration structures, though, demonstrate significant heterogeneity by region, industry, and experience, thereby making career advancement difficult for both new and old entrants. Current studies tend to be based on small data samples or basic statistical techniques, hence neglecting the intricacies of determining the determinants of salaries. This study aims to utilize advanced machine learning techniques, including decision trees, ensemble techniques, and eXtreme Gradient Boosting (XGBoost), to build an inferential model of classifying and estimating data science salaries using important determinants such as experience, location, firm size, and job. The model suggested in this study achieves accuracy of 92.3% according to a Random Forest algorithm, which is higher compared to conventional regression-based techniques. Feature importance analysis reveals that experience accounts for 45.7% of salary variation, followed by firm size (22.8%) and location (18.6%). By being data-driven, this research gives practical suggestions to job seekers, organizations, and policymakers, hence allowing them to make informed workforce planning, salary negotiation, and talent acquisition decisions. The study contributes to the body of knowledge by improving the precision of salary classifications, determinants identification, and the usefulness of predictive analytics in labor market trend analysis.

Downloads

Published

2025-03-01

How to Cite

Muhammad Taha, Tayyaba Farhat, Arsham Azam, Muhammad Ahmar, Syed Jalal Abbas, & Muhammad Umar Habib. (2025). Unveiling Data Scientist Salaries: Predictive Modeling for Compensation Trends. Journal of Computing & Biomedical Informatics, 8(02). Retrieved from https://jcbi.org/index.php/Main/article/view/882