A sampling approach to estimate the log determinant used in spatial likelihood problems

Journal of Geographical Systems - Tập 11 - Trang 209-225 - 2009
R. Kelley Pace1, James P. LeSage2
1Department of Finance, LREC Endowed Chair of Real Estate, E.J. Ourso College of Business Administration, Louisiana State University, Baton Rouge, USA
2Department of Finance and Economics, Fields Endowed Chair in Urban and Regional Economics, McCoy College of Business Administration, Texas State University-San Marcos, San Marcos, USA

Tóm tắt

Likelihood-based methods for modeling multivariate Gaussian spatial data have desirable statistical characteristics, but the practicality of these methods for massive georeferenced data sets is often questioned. A sampling algorithm is proposed that exploits a relationship involving log-pivots arising from matrix decompositions used to compute the log determinant term that appears in the model likelihood. We demonstrate that the method can be used to successfully estimate log-determinants for large numbers of observations. Specifically, we produce an log-determinant estimate for a 3,954,400 by 3,954,400 matrix in less than two minutes on a desktop computer. The proposed method involves computations that are independent, making it amenable to out-of-core computation as well as to coarse-grained parallel or distributed processing. The proposed technique yields an estimated log-determinant and associated confidence interval.

Tài liệu tham khảo

Anselin L (1988) Spatial econometrics: methods and models. Kluwer Academic Publishers, Dordrecht Barry R, Pace RK (1997) Kriging with large data sets using sparse matrix techniques. Commun Stat Comput Simul 26:619–629 Barry R, Pace RK (1999) A Monte Carlo estimator of the log determinant of large sparse matrices. Linear Algebra Appl 289:41–54 Bavaud F (1998) Models for spatial weights: a systematic look. Geogr Anal 30:153–171 Caragea PC, Smith RL (2007) Asymptotic properties of computationally efficient alternative estimators for a class of multivariate normal models. J Multivar Anal 98:1417–1440 Cressie N (1993) Statistics for spatial data. Wiley, New York Cressie N, Aldworth J (1997) Spatial statistical analysis and its consequences for spatial sampling. In: Baafi EY, Schofield NA (eds) Geostatistics Wollongong ’96, vol 1. Kluwer, Dordrecht, pp 126–137 Daubert v. Merrill Dow Pharmaceuticals, Inc., 293 U.S. 579 (1993) Griffith D (2000) Eigenfunction properties and approximations of selected incidence matrices employed in spatial analysis. Linear Algebra Appl 321:95–112 Griffith D, Sone A (1995) Trade-offs associated with normalizing constant computational simplifications for estimating spatial statistical models. J Stat Comput Simul 51:165–183 Marcus M, Minc H (1992) A survey of matrix theory and matrix inequalities. Dover, New York Pace RK, Barry R (1997) Quick computation of regressions with a spatially autoregressive dependent variable. Geogr Anal 29:232–247 Pace RK, LeSage J (2004a) Spatial autoregressive local estimation. In: Mur J, Zoller H, Getis A (eds) Recent advances in spatial econometrics. Palgrave Publishers, pp 31–51 Pace RK, LeSage J (2004b) Chebyshev approximation of log-determinants of spatial weight matrices. Comput Stat Data Anal 45:179–196 Smirnov O, Anselin L (2001) Fast maximum likelihood estimation of very large spatial autoregressive models: a characteristic polynomial approach. Comput Stat Data Anal 35:301–319 Stein ML (1999) Interpolation of spatial data: some theory for Kriging. Springer, New York Strang G (1976) Linear algebra and its applications. Academic Press, New York