Spatial Spread Sampling Using Weakly Associated Vectors

Raphaël Jauslin1, Yves Tillé1
1Institute of statistics, University of Neuchatel, Neuchatel, Switzerland

Tóm tắt

Geographical data are generally autocorrelated. In this case, it is preferable to select spread units. In this paper, we propose a new method for selecting well-spread samples from a finite spatial population with equal or unequal inclusion probabilities. The proposed method is based on the definition of a spatial structure by using a stratification matrix. Our method exactly satisfies given inclusion probabilities and provides samples that are very well spread. A set of simulations shows that our method outperforms other existing methods such as the generalized random tessellation stratified or the local pivotal method. Analysis of the variance on a real dataset shows that our method is more accurate than these two. Furthermore, a variance estimator is proposed.

Tài liệu tham khảo

Bellhouse, D. R. (1977). Some optimal designs for sampling in two dimensions. Biometrika, 64(3):605–611. Benedetti, R., Piersimoni, F., and Postiglione, P. (2017). Spatially balanced sampling: A review and a reappraisal. International Statistical Review, 85(3):439–454. Brown, J. A., Robertson, B. L., and McDonald, T. (2015). Spatially balanced sampling: application to environmental surveys. Procedia Environmental Sciences, 27:6–9. Deville, J.-C. and Tillé, Y. (1998). Unequal probability sampling without replacement through a splitting method. Biometrika, 85:89–101. Deville, J.-C. and Tillé, Y. (2004). Efficient balanced sampling: The cube method. Biometrika, 91:893–912. Dickson, M. M. and Tillé, Y. (2016). Ordered spatial sampling by means of the traveling salesman problem. Computational Statistics, 31(4):1359–1372. Diggle, P. J., Menezes, R., and Su, T.-l. (2010). Geostatistical inference under preferential sampling. Journal of the Royal Statistical Society: Series C (Applied Statistics), 59(2):191–232. Eddelbuettel, D. and Sanderson, C. (2014). Rcpparmadillo: Accelerating R with high-performance c++ linear algebra. Computational Statistics & Data Analysis, 71:1054–1063. Grafström, A. and Lisic, J. (2019). BalancedSampling: Balanced and Spatially Balanced Sampling. R package version 1.5.5. Grafström, A. (2011). Spatially correlated Poisson sampling. Journal of Statistical Planning and Inference, 142:139–147. Grafström, A. and Lundström, N. L. P. (2013). Why well spread probability samples are balanced? Open Journal of Statistics, 3(1):36–41. Grafström, A., Lundström, N. L. P., and Schelin, L. (2012). Spatially balanced sampling through the pivotal method. Biometrics, 68(2):514–520. Grafström, A. and Schelin, L. (2014). How to select representative samples? Scandinavian Journal of Statistics, 41:277–290. Grafström, A. and Tillé, Y. (2013). Doubly balanced spatial sampling with spreading and restitution of auxiliary totals. Environmetrics, 14(2):120–131. Hájek, J. (1981). Sampling from a Finite Population. Marcel Dekker, New York. Hankin, D., Mohr, M., and Newman, K. (2019). Sampling Theory: For the Ecological and Natural Resource Sciences. Oxford University Press, New York. Kincaid, T. M., Olsen, A. R., and Weber, M. H. (2019). spsurvey: Spatial Survey Design and Analysis. R package version 4.1.0. Marker, D. A. and Stevens Jr., D. L. (2009). Sampling and inference in environmental surveys. In Sample surveys: design, methods and applications, volume 29 of Handbook of Statististics, pages 487–512. Elsevier/North-Holland, New York, Amsterdam. Moran, P. A. P. (1950). Notes on continuous stochastic phenomena. Biometrika, 37(1/2):17–23. Neyman, J. and Scott, E. L. (1958). Statistical approach to problems of cosmology. Journal of the Royal Statistical Society: Series B (Methodological), 20(1):1–29. Pebesma, E. J. and Bivand, R. S. (2005). Classes and methods for spatial data in R. R News, 5(2):9–13. Quenouille, M. H. (1949). Problems in plane sampling. The Annals of Mathematical Statistics, 20:355–375. Robertson, B., McDonald, T., Price, C., and Brown, J. (2018). Halton iterative partitioning: spatially balanced sampling via partitioning. Environmental and Ecological Statistics, 25:305–323. Sen, A. R. (1953). On the estimate of the variance in sampling with varying probabilities. Journal of the Indian Society of Agricultural Statistics, 5:119–127. Stevens Jr., D. L. and Olsen, A. R. (1999). Spatially restricted surveys over time for aquatic resources. Journal of Agricultural, Biological, and Environmental Statistics, 4:415–428. Stevens Jr., D. L. and Olsen, A. R. (2003). Variance estimation for spatially balanced samples of environmental resources. Environmetrics, 14(6):593–610. Stevens Jr., D. L. and Olsen, A. R. (2004). Spatially balanced sampling of natural resources. Journal of the American Statistical Association, 99(465):262–278. Theobald, D. M., Stevens Jr., D. L., White, D. E., Urquhart, N. S., Olsen, A. R., and Norman, J. B. (2007). Using GIS to generate spatially balanced random survey designs for natural resource applications. Environmental Management, 40(1):134–146. Tillé, Y. (2006). Sampling Algorithms. Springer, New York. Tillé, Y. (2020). Sampling and estimation from finite populations. Wiley, New York. Tillé, Y., Dickson, M. M., Espa, G., and Giuliani, D. (2018). Measuring the spatial balance of a sample: A new measure based on the Moran’s \({I}\) index. Spatial Statistics, 23:182–192. Tillé, Y. and Wilhelm, M. (2017). Probability sampling designs: Balancing and principles for choice of design. Statistical Science, 32(2):176–189. Vallée, A.-A., Ferland-Raymond, B., Rivest, L.-P., and Tillé, Y. (2015). Incorporating spatial and operational constraints in the sampling designs for forest inventories. Environmetrics, 26(8):557–570. Wang, J.-F., Stein, A., Gao, B.-B., and Ge, Y. (2012). A review of spatial sampling. Spatial Statistics, 2:1–14. Yates, F. and Grundy, P. M. (1953). Selection without replacement from within strata with probability proportional to size. Journal of the Royal Statistical Society, B15:235–261.