Analyzing sequential categorical data: Individual variation in markov chains

Psychometrika - Tập 55 - Trang 263-275 - 1990
William Gradner1
1Department of Psychology, University of Virginia, Charlottesville

Tóm tắt

Markov chains are probabilistic models for sequences of categorical events, with applications throughout scientific psychology. This paper provides a method for anlayzing data consisting of event sequences and covariate observations. It is assumed that each sequence is a Markov process characterized by a distinct transition probability matrix. The objective is to use the covariate data to explain differences between individuals in the transition probability matrices characterizing their sequential data. The elements of the transition probability matrices are written as functions of a vector of latent variables, with variation in the latent variables explained through a multivariate regression on the covariates. The regression is estimated using the EM algorithm, and requires the numerical calculation of a multivariate integral. An example using simulated cognitive developmental data is presented, which shows that the estimation of individual variation in the parameters of a probability model may have substantial theoretical importance, even when individual differences are not the focus of the investigator's concerns.

Tài liệu tham khảo

Anderson, T., & Goodman, L. (1957). Statistical inference about Markov chains.Annals of Mathematical Statistics, 28, 89–110. Atkinson, A. C. (1985).Plots, transformations, and regression: An introduction to graphical methods of diagnositc regression analysis. New York: Oxford University Press. Bakeman, R., & Gottman, J. (1986).Observing Interaction. Cambridge: Cambridge University Press. Bentler, P., & Lee, S. (1975). Some extensions of matrix calculus.General Systems, 20, 145–150. Bertler, P., & Tanaka, J. (1983). Problems with the EM algorithm for ML factor analysis.Psychometrika, 48, 247–252. Brainerd, C., Howe, M., & Desrochers, A. (1982). The general theory of two-stage learning: A mathematical review with illustrations from memory development.Psychological Bulletin, 91, 634–665. Budescu, D. (1987). A Markov model for generation of random binary sequences.Journal of Experimental Psychology: Human Perception and Performance, 13, 25–39. Chi, M., Feltovich, P., & Glaser, R. (1981). Categorization and representation of physics problems by novices.Cognitive Science, 5, 121–152. Çinlar, E. (1975).Introduction to stochastic processes. Englewood Cliffs, NJ: Prentice Hall. Dempster, A., Laird, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm with discussion.Journal of the Royal Statistical Society, Series B,39, 1–38. Edlefsen, E., & Jones, S. (1986).GAUSS programming language manual. Kent, WA: Aptech Systems. Gardner, W. (1989).Analysis of individual variation in Markov chains (Tech. Rep. No. 87-8). Charlottesville: University of Virginia, Department of Psychology. Gardner, W., & Hartmann, D. (1984). Markov analysis of social interaction.Behavioral Assessment, 6, 229–236. Gottman, J. (1979). Detecting cyclicity in social interaction.Psychological Bulletin, 36, 338–348. Gottman, J., & Roy, A. (1989).Sequential analysis: A guide for behavioral researchers. New York: Cambridge University Press. Greeno, J. (1974). Representation of learning as a discrete transition in a finite state space. In D. Krantz, R. Atkinson, R. Luce, & P. Suppes (Eds.),Contemporary developments in mathematical psychology: Vol. 1: Learning, memory, and thinking. San Francisco: W. H. Freeman. Hanushek, E., & Jackson, J. (1977).Statistical methods for social scientists. New York: Acadmeic Press. Heckman, J., & Singer, B. (1982). Population heterogeneity in demographic models. In K. Land & A. Rogers (Eds.),Multidimensional mathematical demography. New York: Academic Press. Louis, T. (1982). Finding the observed information matrix when using the EM algorithm.Journal of the Royal Statistical Society, Series B,44, 226–233. Schotenberg, R. (1985). Latent variables in the analysis of limited dependent variables. In N. Tuma (Ed.),Sociological methodology 1985. San Francisco: Jossey-Bass. Townsend, J., & Ashby, G. (1983).The stochastic modeling of elementary psychological processes. New York: Cambridge University Press. White, H. (1982). Maximum likelihood estimation of misspecified models.econometrica, 50, 1–25. Wilkinson, A. (1982). Partial knowledge and self correction: Developmental studies of a quantitative concept.Development Psychology, 18, 876–893.