Machine learning based prediction of risk of hypertension among people of Tamil Nadu

Authors

DOI:

https://doi.org/10.18203/2394-6040.ijcmph20254454

Keywords:

Hypertension, Machine learning, Random forest, Risk prediction, STEPS survey

Abstract

Hypertension is the one of the leading causes of cardiovascular morbidity and mortality. Early detection of individuals at elevated risk is critical, yet conventional prediction models fail to capture nonlinear and high-dimensional interactions among epidemiological and behavioral determinants. Machine learning (ML) offers new opportunities for accurate, population-level risk stratification. We conducted a secondary analysis of the Tamil Nadu STEPS survey 2020, applying supervised ML algorithms—including boosting, k-nearest neighbours, decision tree, random forest, and support vector machine—to predict hypertension risk. Predictors included demographic, anthropometric, lifestyle, and behavioral variables. Models were implemented in JASP, and their performance was evaluated using accuracy, area under the receiver operating characteristic curve (AUC), precision, recall, and F1 score. Among the algorithms tested, the random forest classifier demonstrated the most balanced performance (accuracy 65.5%, AUC 0.708, precision 64.2%, recall 65.5%, F1 score 64.7%). Feature importance analysis identified age as the strongest predictor, followed by waist circumference, while diet and physical activity contributed minimally. The confusion matrix confirmed the model’s balanced sensitivity and specificity, reducing both false negatives and false positives. This study highlights the potential of machine learning, particularly random forest models, for hypertension risk prediction in Indian populations. By leveraging routinely collected survey data, ML can enable scalable, non-invasive screening and inform targeted public health interventions. Integration of richer clinical and genetic features and ensemble methods may further improve predictive accuracy.

Metrics

Metrics Loading ...

References

World Health Organisation. Hypertension-Fact Sheets. 2025. Available at: https://www.who.int/ news-room/fact-sheets/detail/hypertension. Accessed on 12 October 2025.

Silva GFS, Fagundes TP, Teixeira BC, Chiavegatto Filho ADP. Machine Learning for Hypertension Prediction: a Systematic Review. Curr Hypertens Rep. 2022;24(11):523-33. DOI: https://doi.org/10.1007/s11906-022-01212-6

AlKaabi LA, Ahmed LS, Al Attiyah MF, Abdel-Rahman ME. Predicting hypertension using machine learning: Findings from Qatar Biobank Study. Shimosawa T, editor. PLoS One. 2020;15(10):e0240370. DOI: https://doi.org/10.1371/journal.pone.0240370

Ji W, Zhang Y, Cheng Y, Wang Y, Zhou Y. Development and validation of prediction models for hypertension risks: A cross-sectional study based on 4,287,407 participants. Front Cardiovasc Med. 2022;9:928948. DOI: https://doi.org/10.3389/fcvm.2022.928948

Sifat IK, Kibria MdK. Optimizing hypertension prediction using ensemble learning approaches. Popovic T, editor. PLoS One. 2024;19(12):e0315865. DOI: https://doi.org/10.1371/journal.pone.0315865

Hwang SH, Lee H, Lee JH, Lee M, Koyanagi A, Smith L, et al. Machine Learning–Based Prediction for Incident Hypertension Based on Regular Health Checkup Data: Derivation and Validation in 2 Independent Nationwide Cohorts in South Korea and Japan. J Med Internet Res. 2024;26:e52794. DOI: https://doi.org/10.2196/52794

Islam SMS, Talukder A, Awal MA, Siddiqui MMU, Ahamad MM, Ahammed B, et al. Machine Learning Approaches for Predicting Hypertension and Its Associated Factors Using Population-Level Data From Three South Asian Countries. Front Cardiovasc Med. 2022;9:839379. DOI: https://doi.org/10.3389/fcvm.2022.839379

Chowdhury MZI, Naeem I, Quan H, Leung AA, Sikdar KC, O’Beirne M, et al. Prediction of hypertension using traditional regression and machine learning models: A systematic review and meta-analysis. Palazón-Bru A, editor. PLoS One. 2022;17(4):e0266334. DOI: https://doi.org/10.1371/journal.pone.0266334

Selvavinayagam TS, Viswanathan V, Ramalingam A, Kangusamy B, Joseph B, Subramaniam S, et al. Prevalence of Noncommunicable Disease (NCDs) risk factors in Tamil Nadu: Tamil Nadu STEPS Survey (TN STEPS), 2020. PLoS One. 2024;19(5):e0298340. DOI: https://doi.org/10.1371/journal.pone.0298340

Peng Y, Xu J, Ma L, Wang J. Prediction of Hypertension Risks with Feature Selection and XGboost. J Mech Med Biol. 2021;21(05):2140028. DOI: https://doi.org/10.1142/S0219519421400285

Downloads

Published

2025-12-31

How to Cite

Manivannan, T., C., J. P., & V., D. (2025). Machine learning based prediction of risk of hypertension among people of Tamil Nadu. International Journal Of Community Medicine And Public Health, 13(1), 402–405. https://doi.org/10.18203/2394-6040.ijcmph20254454

Issue

Section

Systematic Reviews