Un estudio de potencia y tasa de error tipo I del estadístico de razón de verosimilitud en la tri

López Pina, José Antonio; Hidalgo Montesinos, María Dolores

Un estudio de potencia y tasa de error tipo I del estadístico de razón de verosimilitud en la tri

Revista:

Psicothema

ISSN: 0214-9915

Año de publicación: 2001

Volumen: 13

Número: 2

Páginas: 330-334

Tipo: Artículo

DIALNET GOOGLE SCHOLAR Acceso abierto editor

Otras publicaciones en: Psicothema

Resumen

En este estudio tratamos de probar la potencia (y tasa de error tipo I) del estadístico de razón de verosimilitud utilizado en BILOG (v. 3.04) para probar el ajuste de un modelo especificado a un ítem concreto. El modelo seleccionado es el modelo de 1-p. Para ello, utilizamos un tamaño de test fijo (75 ítems), y cuatro tamaños muestrales (100, 250, 500 y 1000). Además, utilizamos tres distribuciones de habilidad: una centrada N(0,1), y las otras dos no centradas: N(-1,1) y N(1,1). Para estudiar la potencia del estadístico de razón de verosimilitud hemos procedido a manipular el parámetro de discriminación en 25 ítems de los 75 que forman el test, de tal suerte que utilizamos seis niveles de discriminación media: (.3, .6, 1.0, 1.2, 1.5 y 1.8). Los resultados apuntan que la potencia del estadístico de razón de verosimilitud es muy baja cuando el tamaño muestral fue de sólo 100 sujetos, aun cuando el parámetro de discriminación fuera .3 ó 1.8. La potencia aumentó con el tamaño muestral apreciablemente, aunque el estadístico fue incapaz de detectar ítems cuyo parámetro de discriminación fue 1.0, 1.2 ó 1.5. Finalmente, la tasa de error tipo I se mantuvo dentro de los niveles nominales, excepto cuando el tamaño muestral fue elevado, tal como era de esperar

Referencias bibliográficas

Andersen, E.B. (1973). A goodness of fit test for the Rasch model. Psychometrika, 38, 123-140.
Barbero, M.I. (1999). Desarrollos recientes de los modelos psicométricos de la teoría de la respuesta a los ítems. Psicothema, 11, 195-210.
Bishop, Y.M.M., Fienberg, S.E. y Holland, P.W. (1975). Discrete multivariate analysis: Theory and practice. Cambridge, MA: The MIT Press.
Bock, R.D. (1972). Estimating ítem parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, 29-51.
De Ayala, R.J. (1990). Assessing the fit of dichotomous ítem response theory models. Behavior Research Methods, Instruments, & Computers, 22, 80-81.
Hambleton, R.K. (1990). ítem response theory: Introduction and bibliography. Psicothema, 2, 97-107.
Hambleton, R.K. y Cook, L.L. (1983). Robustness of ítem response models and effects of test length and sample size on the precision of ability estimates. En D.J. Weiss (Ed.), New horizons in testing: Latent trait test theory and computerized adaptive testing, p. 31-49. New York: Vancouver.
Hambleton, R.K. y Murray, L. (1983). Some goodness of fit investigations for ítem response models. En R.K. Hambleton (Ed.). Applications of ítem response theory (pp. 71-94). Vancouver: Education Research Institute of British Columbia.
Hambleton, R.K. y Swaminathan, H. (1985). ítem response theory: Princi ples and applications. Boston: Kluwer-Nijhoff.
Hambleton, R.K., Swaminathan, H. y Rogers, H.J. (1991). Fundamentals of ítem response theory. Newbury Park, CA: Sage Publications.
López-Pina, J.A. e Hidalgo, M.D. (1996). Bondad de ajuste y teoría de respuesta a los ítems. En J. Muñiz (Coord.). Psicometría. Madrid: Universitas, p. 643-704.
McKinley, R.L. y Mills, C.N. (1985). A comparison of several goodnessof-fit statistics. Applied Psychological Measurement, 9, 49-57.
Mislevy, R.J. y Bock, R.D. (1990). PC-BILOG 3.04: ítem analysis and test scoring with binary logistic models. Moresville, IN: Scientific Software.
Seong, T. (1990). Sensitivity of marginal maximum likelihood estimation of ítem and ability parameters to the characteristics of the prior ability distributions. Applied Psychological Measurement, 14, 299-312.
Stone, C. (1992). Recovery of marginal maximum likelihood estimates in the two-parameter logistic response model: An evaluation of MULTILOG. Applied Psychological Measurement, 16, 1-16.
Traub, R.E. y Lam, Y.R. (1985). Latent structure and ítem sampling models for testing. Annual Review of Psychology, 36, 19-48.
van den Wollenberg, A.L. (1982a). A simple and effective method to test the dimensionality axiom of the Rasch model. Applied Psychological Measurement, 6, 83-91.
van den Wollenberg, A.L. (1982b). Two new test statistics for the Rasch model. Psychometrika, 47, 123-139.
Waller, M.I. (1981). A procedure for comparing logistic latent trait models. Journal of Educational Measurement, 18, 119-125.
Wright, B.D. y Mead, R.J. (1977). BICAL: Calibrating items and scales with the Rasch model (Research Memorandum No. 23). Chicago IL: University of Chicago, Statistical Laboratory, Department of Education.
Wright, B.D. y Panchapakesan, N.A. (1969). A procedure for sample-free ítem analysis. Educational and Psychological Measurement ,29, 23-37.
Yen, W.M. (1981). Using simulation results to choose a latent trait model. Applied Psychological Measurement, 5, 245-262.

Fuente de los datos: Dialnet