Innovative Online Diagnostic Assessment for Evaluating Science Learning Readiness in the Merdeka Curriculum Framework
DOI:
https://doi.org/10.51454/jet.v7i1.546Keywords:
Online Diagnostic Assessment, Merdeka Curriculum, 4DAbstract
The main objective of this study was to develop an Online Diagnostic Assessment (ODA) that can effectively measure junior high school students’ readiness to learn science within the framework of the Merdeka Curriculum. Unlike previous diagnostic tools that are mostly offline or limited in scope, this ODA provides an innovative, technology-based solution that identifies student readiness and misconceptions while offering timely feedback to support differentiated learning. Using the 4-D development model, the research includes the Define, Design, Develop, and Disseminate stages. At the Define stage, the instrument grids were identified according to the curriculum. In the Design stage, the assessment prototype was developed using Google Form. The Develop stage involved validation by experts and field tests in two junior high schools, resulting in a valid and reliable instrument, with an Aiken's V value above 0.80, Cronbach's Alpha 0.736, and McDonald's Omega 0.768. The results of the EFA analysis showed all items had Measures of Sampling Adequacy (MSA) of more than 0.5. The distribution of respondents' scores shows that the instrument is effective in differentiating students' abilities. Thus, the developed ODA is valid, reliable, and ready to be used to assess students' readiness in science learning.
Downloads
References
Aiken’s, L. R. (1980). Content Validity and Reliability of Single Items or Questionnaires. Educational and Psychological Measurement, 40(4), 955–959. https://doi.org/10.1177/001316448004000419
Ardi Waluyo, A., & dan Sulhadi, H. (2020). Analisis Faktor: Validitas Konstruk Instrumen Penilaian Keterampilan Berpikir Kritis. In Jurnal Pendidikan Fisika Tadulako Online (JPFT) (Vol. 8, Issue 1).
Asril, A., & Marais, I. (2011). Applying a Rasch Model Distractor Analysis Implications For Teaching And Learning. Sense Publishers.
Benjamin, A. (2024). Studying the student’s perceptions of engagement and problem-solving skills for academic achievement in chemistry at the higher secondary level. Education and Information Technologies, 29(7), 8347–8368.
bin Rani, N., Baharudin, H., & Isa Hamzah, M. (2021). Validation of Existing Arabic Language Reading Knowledge Instrument (SPSAMBA): Exploratory Factor Analysis (EFA). In PSYCHOLOGY AND EDUCATION (Vol. 58, Issue 1). www.psychologyandeducation.net
Draude, B., & Brace, S. (1999). Assessing the Impact of Technology on Teaching and Learning: Student Perspectives Introduction Method Survey Results and Major Findings Conclusion Contact. Eric. http://www.mtsu.eduk-itconf/proceed99/brace.html
Ghory, S., & Ghafory, H. (2021). The impact of modern technology in the teaching and learning process. International Journal of Innovative Research and Scientific Studies, 4(3), 168–173. https://doi.org/10.53894/ijirss.v4i3.73
Green, B. F., Crone, C. R., & Folk, V. G. (1989). A Method for Studying Differential Distractor Functioning. In Journal of Educational Measurement Summer (Vol. 26, Issue 2).
Kalyani, L. K. (2024). The role of technology in education: Enhancing learning outcomes and 21st century skills. International Journal of Scientific Research in Modern Science and Technology, 3(4), 5–10.
Leitão, G., Colonna, J., Monteiro, E., Oliveira, E., & Barreto, R. (2020). New Metrics for Learning Evaluation in Digital Education Platforms. http://arxiv.org/abs/2006.14711
Liu, X. S. (2014). A note on statistical power in multi-site randomized trials with multiple treatments at each site. British Journal of Mathematical and Statistical Psychology, 67(2), 231–247. https://doi.org/10.1111/bmsp.12016
Pande, S. S., Pande, S. R., Parate, V. R., Nikam, A. P., & Agrekar, S. H. (2013). Correlation between difficulty & discrimination indices of MCQs in formative exam in Physiology. In Asian Journal of Medical Education (Vol. 7, Issue 1).
Ramly, S. N. F., Ahmad, N. J., & Yakob, N. (2022). Development, validity, and reliability of chemistry scientific creativity test for pre-university students. International Journal of Science Education, 44(14), 1–16. https://doi.org/10.1080/09500693.2022.2116298
Rapaka, A., Dharmadhikari, S. C., Kasat, K., Mohan, C. R., Chouhan, K., & Gupta, M. (2025). Revolutionizing learning− A journey into educational games with immersive and AI technologies. Entertainment Computing, 52, 100809.
Retnawati, H. (2014). Teori Respon Butir dan Penerapanya. In Yogyakarta: Nuha Medika. Nuha Medika.
Rosenbaum, P. R. (1985). Comparing distributions of item responses for two groups. British Journal of Mathematical and Statistical Psychology, 38(2), 206–215. https://doi.org/10.1111/j.2044-8317.1985.tb00836.x
Salinas-Navarro, D. E., Vilalta-Perdomo, E., Michel-Villarreal, R., & Montesinos, L. (2024). Designing experiential learning activities with generative artificial intelligence tools for authentic assessment. Interactive Technology and Smart Education.
Sarip, M., Amintarti, S., Utami, N. H., Biologi, S. P., Lambung, U., Brigjen, M. J., Brigjend, J., Basri, H., Utara, K. B., Banjarmasin, K., & Selatan, K. (2022). Validitas Dan Keterbacaan Media Ajar E-Booklet Untuk Siswa SMA/MA Materi Keanekaragaman Hayati (Vol. 1, Issue 1).
Singh Rana, S. (2014). Test Item Analysis and Relationship Between Difficulty Level and Discrimination Index of Test Items in an Achievement Test in Biology Education.
Sudarsana, I. K., Nakayanti, A. R., Sapta, A., Haimah, Satria, E., Saddhono, K., Achmad Daengs, G. S., Putut, E., Helda, T., & Mursalin, M. (2019). Technology Application in Education and Learning Process. Journal of Physics: Conference Series, 1363(1). https://doi.org/10.1088/1742-6596/1363/1/012061
Suharsono, Y., & Istiqomah, I. (2014). Validitas Dan Reliabilitas Skala Self-Efficacy. Jurnal Ilmiah Psikologi Terapan, 02(01), 144–151.
Sumin, Sukmawati, F., Setiawati, F. A., & Asmawi, S. (2022). The Impact of Z-Score Transformation Scaling on the Validity, Reliability, and Measurement Error of Instrument SATS-36. Jurnal Pengukuran Psikologi Dan Pendidikan Indonesia, 11(2), 166–180. https://doi.org/10.15408/jp3i.v11i2.26591
Sutopo, Y dan Slamet, A. (2017). Statistika Inferensial. ANDI.
Walker, A. A., Jennings, J. K., & Engelhard, G. (2018). Using person response functions to investigate areas of person misfit related to item characteristics. Educational Assessment, 23(1), 47–68. https://doi.org/10.1080/10627197.2017.1415143
Wladis, C., & Samuels, J. (2016). Do online readiness surveys do what they claim? Validity, reliability, and subsequent student enrollment decisions. Computers and Education, 98, 39–56. https://doi.org/10.1016/j.compedu.2016.03.001
Zhang, L., & Xu, J. (2025). The paradox of self-efficacy and technological dependence: Unraveling generative AI’s impact on university students’ task completion. The Internet and Higher Education, 65, 100978.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Widowati Pusporini, Dadan Rosana, Didik Setyawarno, Eko Widodo, Maryati Maryati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

