Methods for comparative assessment of the results of cluster analysis of hydrobiocenoses structure (by the example of zooplankton communities of the Linda River, Nizhny Novgorod region)

Inland Water Biology - Tập 9 - Trang 200-208 - 2016
B. N. Yakimov1, G. V. Shurganova1, V. V. Cherepennikov1, I. A. Kudrin1, M. Yu. Il’in1
1Lobachevsky State University of Nizhny Novgorod, Nizhny Novgorod, Russia

Tóm tắt

In this paper we present modern approaches to the classification of hydrobiological samples based on various metrics of species-structure similarity—Euclidean distance, Renkonen index, and the cosine of the angle between the species abundances vectors. We use the cophenetic correlation coefficient, Gower distance, and Shepard-like plot for the justification of clustering method. For the choice of the optimal number of clusters, we apply approaches based on silhouette widths and binary matrices representing partitions. An analysis of the spatial structure of zooplankton communities in the small Linda River shows that average agglomerative clustering is an optimal algorithm for objects of this type. A comparative analysis of the results of cluster analysis on the basis of different similarity metrics shows that the most adequate classification can be obtained using the cosine of the angle between the species abundances vectors and the Renkonen index, whereas the classification based on the Euclidean distances is less successful from the biological point of view. Approaches outlined in this paper allow researchers to make quantitative decisions about key elements of classification, greatly reducing the subjectivity of the cluster analysis results.

Tài liệu tham khảo

Guidelines for collecting and processing of materials in hydrobiological studies in freshwater bodies, in Zooplankton i ego produktsiya (Zooplankton and Its Production), Leningrad: Gos. Nauchno-Issled. Inst. Ozern. Rechn. Rybn. Khoz., 1982. Oldenderfer, M.S. and Bleshfild, R.K., Cluster analysis, in Faktornyi, diskriminantnyi i klasternyi analiz (Factor, Discriminant, and Cluster Analysis), Moscow: Finansy i Statistika, 1989, pp. 139–214. Pesenko, Yu.A., Printsipy i metody kolichestvennogo analiza v faunisticheskikh issledovaniyakh (Principles and Methods for Quantitative Analysis in Faunal Studies), Moscow: Nauka, 1982. Pidgaiko, M.L., Zooplankton vodoemov evropeiskoi chasti SSSR (Zooplankton of Water Bodies of the European Part of the Soviet Union), Moscow: Nauka, 1984. Cherepennikov, V.V., Shurganova, G.V., and Artel’nyi, E.V., The use of multidimensional vector analysis to evaluate the spatial distribution of zooplankton in the Cheboksary Reservoir, in Mezhd. konf. “Ekologicheskie problemy basseinov krupnykh rek,” Tezisy dokladov (Int. Conf. “Environmental Problems of the Large River Basins,” Abstracts of Papers), Tolyatti, 2003, p. 303. Cherepennikov, V.V., Shurganova, G.V., Gelashvili, D.B., and Artel’nyi, E.V., Study of the differences in the species structure of the main zooplanktocenoses in the Cheboksary Reservoir by multivariate analysis, Izv. Samar. Nauch. Tsentra, Ross. Akad. Nauk, 2004, vol. 6, no. 2 (12), pp. 328–333. Shurganova, G.V., Kudrin, I.A., Il’in, M.Yu., and Cherepennikov, V.V., Characteristics of the spatial and species structure of zooplankton and evaluation of the quality of water in Kudma and Linda rivers (Nizhny Novgorod oblast), Voda: Khim. Ekol., 2014, no. 1, pp. 28–35. Shurganova, G.V. and Cherepennikov, V.V., Methods for distinguishing and identification of aquatic communities, in Ekologicheskii monitoring. Metody biologicheskogo i fiziko-khimicheskogo monitoringa. Uch. Pos. (Environmental monitoring: Methods of biological and physicochemical monitoring: Tutorial), Nizhny Novgorod: Nizhegorod. Gos. Univ., 2011, part 7, pp. 121–155. Borcard, D., Gillet, F., and Legendre, P., Numerical Ecology, Ecology with R, New York: Springer-Verlag, 2011. R Core Team. R: A language and environment for statistical computing, 2015. http://www.R-projectorg/ Gower, J.C., Comparing classifications, in Numerical Taxonomy, Berlin: Springer-Verlag, 1983, pp. 137–155. Jain, A.K. and Dubes, R.C., Algorithms for Clustering Data, New Jersey: Prentice Hall, 1988. Jost, L., Chao, A., and Chazdon, R.L., Compositional similarity and ß (beta) diversity, in Biological Diversity. Frontiers in Measurement and Assessment, Oxford: Oxford Univ. Press, 2011, pp. 66–84. Legendre, P. and Legendre, L., Numerical Ecology, Oxford: Elsevier, 2012. Rousseeuw, P.J., Silhouettes: a graphical aid to the interpretation and validation of cluster analysis, J. Comp. Appl. Math., 1987, vol. 20, pp. 53–65. Shepard, R.N., The analysis of proximities: multidimensional scaling with an unknown distance function, Psychometrika, 1962, vol. 27, pp. 125–139. Sokal, R.R. and Rohlf, F.J., The comparison of dendrograms by objective methods, Taxon, 1962, vol. 11, pp. 33–40.