A systematic approach to the development and validation process of a research tool: an overview
DOI:
https://doi.org/10.18203/2394-6040.ijcmph20260329Keywords:
Tool development, Validation, Item generation, Factor analysis, Validity, ReliabilityAbstract
Research tool development and validation process is enigmatic and opaque due to the diverse range of techniques it requires. As a result, the main goal of this article is to provide an overview of the research tool development process as simply as possible in order to support the development of new, valid, and reliable research tools, as well as improvement of existing ones. We accomplish this work by presenting all of the required steps in the right sequence to develop a research tool. In light of our search, we suggest five phases of tool development and validation, each with its own sub-steps. The first step involves preparing the preliminary draft, which includes conceptualizing the construct, evaluating current practices, conducting focus group discussions, and creating the item pool. The second phase includes the tool draft's validation with internal and external review, as well as the tool's pre-testing. Phase three includes field testing of the finalized tool draft, sampling planning, and data collection. The fourth phase includes the analysis of research tool data with item analysis, exploratory factor analysis, internal consistency, and test-retest reliability. The fifth phase is all about scale interpretation using percentiles, standard scores, norms, and cut-off points. Additionally, we briefly discussed the key best practices indicated in each step. This review will help both scientists and practitioners to understand all the steps and methodologies of research tool development and validation.
Metrics
References
Andersen RD, Jylli L, Ambuel B. Cultural adaptation of patient and observational outcome measures: a methodological example using the COMFORT behavioral rating scale. Int J Nurs Stud. 2014;51(6):934-42. DOI: https://doi.org/10.1016/j.ijnurstu.2013.10.006
Anthony M, Yastik J, MacDonald DA, Marshall KA. Development and validation of a tool to measure incivility in clinical nursing education. J Prof Nurs. 2014;30(1):48-55. DOI: https://doi.org/10.1016/j.profnurs.2012.12.011
Barker C, Pistrang N, Elliott R. Research Methods in Clinical Psychology: An Introduction for Students and Practitioners. 2016. Available at: https://www.al-edu.com/wp-content/uploads/2014/05/Barker-et-al-Research-Methods-in-Clinical-Psychology.pdf. Accessed on 20 October 2025.
Beck CT, Gable RK. Postpartum Depression Screening Scale: Development and Psychometric Testing. Nurs Res. 2000;49(5):272-82. DOI: https://doi.org/10.1097/00006199-200009000-00006
Beck CT, Gable RK. Further validation of the Postpartum Depression Screening Scale. Nurs Res. 2001;50(3):155-64. DOI: https://doi.org/10.1097/00006199-200105000-00005
Choe K. Development and preliminary testing of the Schizophrenia Hope Scale, a brief scale to measure hope in people with schizophrenia. Int J Nurs Stud. 2014;51(6):927-33. DOI: https://doi.org/10.1016/j.ijnurstu.2013.10.018
DeVellis RF. Scale Development: Theory and Applications. Thousand Oaks, CA: Sage. 2017;4.
Ferguson E, Cox T. Exploratory factor analysis: a user’s guide. Int J Selection Asses. 1993;1:84-94. DOI: https://doi.org/10.1111/j.1468-2389.1993.tb00092.x
Ferketich S. Focus on psychometrics. Aspects of item analysis. Res Nurs Health. 1991;14(2):165-8. DOI: https://doi.org/10.1002/nur.4770140211
Furr RM. Scale Construction and Psychometrics for Social and Personality Psychology. New Delhi, IN: Sage Publications. 2011. DOI: https://doi.org/10.4135/9781446287866
Gaugler JE, Hobday JV, Savik K. The CARES(®) Observational Tool: a valid and reliable instrument to assess person-centered dementia care. Geriatr Nurs. 2013;34(3):194-8. DOI: https://doi.org/10.1016/j.gerinurse.2013.01.002
Grassley JS, Spencer BS, Bryson D. The development and psychometric testing of the Supportive Needs of Adolescents Breastfeeding Scale. J Adv Nurs. 2013;69(3):708-16. DOI: https://doi.org/10.1037/t31127-000
Hinkin TR. A Review of Scale Development Practices in the Study of Organizations. J Management. 1995;21(5):967-88. DOI: https://doi.org/10.1177/014920639502100509
Hilton A, Skrutkowski M. Translating instruments into other languages: development and testing processes. Cancer Nurs. 2002;25(1):1-7. DOI: https://doi.org/10.1097/00002820-200202000-00001
Karaçam Z, Kitiş Y. The Postpartum Depression Screening Scale: its reliability and validity for the Turkish population. Turk Psikiyatri Derg. 2008;19(2):187-96.
Konrath S, Meier BP, Bushman BJ. Development and Validation of the Single Item Trait Empathy Scale (SITES). J Res Personality. 2017;73:111-22. DOI: https://doi.org/10.1016/j.jrp.2017.11.009
Kumar A. Review of the Steps for Development of Quantitative Research Tools. J Adv Pract Nurs. 2015;1:103.
Kyriazos T, Stalikas A. Applied Psychometrics: The Steps of Scale Development and Standardization Process. Psychology. 2018;9:2531-60. DOI: https://doi.org/10.4236/psych.2018.911145
Kyriazos TA. Applied Psychometrics: The 3-Faced Construct Validation Method, a Routine for Evaluating a Factor Structure. Psychology. 2018;9:2044-72. DOI: https://doi.org/10.4236/psych.2018.98117
Kyriazos TA. Applied Psychometrics: Sample Size and Sample Power Considerations in Factor Analysis (EFA, CFA) and SEM in General. Psychology. 2018;9:2207-30. DOI: https://doi.org/10.4236/psych.2018.98126
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63(7):737-45. DOI: https://doi.org/10.1016/j.jclinepi.2010.02.006
Morrison KM, Embretson S. Item Generation. Wiley. 2018. DOI: https://doi.org/10.1002/9781118489772.ch3
Navabi N, Ghaffari F, Shamsalinia A, Faghani S. Development and validation of evaluation tools of nursing students’ clinical pharmacology unit. Drug Healthc Patient Saf. 2016;8:101-9. DOI: https://doi.org/10.2147/DHPS.S110774
Netemeyer RG, Bearden WO, Sharma S. Scaling Procedures: Issues and Applications. Thousand Oaks, CA. Sage Publications. 2003. DOI: https://doi.org/10.4135/9781412985772
Oppenheim AN. Questionnaire Design, Interviewing and Attitude Measurement. Pinter, London. 1992. Available at: https://www.scirp.org/reference/ referencespapers?referenceid=435162. Accessed on 20 October 2025.
Peng Q, Wu W. Development and validation of oral chemotherapy self-management scale. BMC Cancer. 2020;20(1):890. DOI: https://doi.org/10.1186/s12885-020-07404-0
Pinto MD, Hickman R, Logsdon MC, Burant C. Psychometric evaluation of the revised attribution questionnaire (r-AQ) to measure mental illness stigma in adolescents. J Nurs Meas. 2012;20(1):47-58. DOI: https://doi.org/10.1891/1061-3749.20.1.47
Polit DF. Getting serious about test-retest reliability: a critique of retest research and some recommendations. Qual Life Res. 2014;23(6):1713-20. DOI: https://doi.org/10.1007/s11136-014-0632-9
Polit DF, Beck C, Owen S. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Res Nurs Health. 2007;30(4):459-67. DOI: https://doi.org/10.1002/nur.20199
Polit DF, Beck CT. Nursing Research: Generating and Assessing Evidence for Nursing Practice. Wolters Kluwer. 2017;10:331-54.
Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1147-57. DOI: https://doi.org/10.1007/s11136-018-1798-3
Rattray J, Jones MC. Essential elements of questionnaire design and development. J Clin Nurs. 2007;16:234-43. DOI: https://doi.org/10.1111/j.1365-2702.2006.01573.x
Rowe H, Sperlich M, Cameron H, Seng J. A Quasi-experimental outcomes analysis of a psychoeducation intervention for pregnant women with abuse-related posttraumatic stress. J Obstet Gynecol Neonatal Nurs. 2014;43(3):282-93. DOI: https://doi.org/10.1111/1552-6909.12312
Ruiz RJ, Gennaro S, O'Connor C, Marti CN, Lulloff A, Keshinover T et al. Measuring coping in pregnant minority women. West J Nurs Res. 2015;37(2):257-75. DOI: https://doi.org/10.1177/0193945914527176
Schriesheim, CA, Eisenbach RJ. An exploratory and confirmatory factor analytic investigation of item wording effects on the obtained factor structures of survey questionnaire methods. J Manag. 1995;21:1177-93. DOI: https://doi.org/10.1177/014920639502100609