Dealing with Zeros and Missing Values in Compositional Data Sets Using Nonparametric Imputation

J. A. Martín-Fernández1, C. Barceló-Vidal1, V. Pawlowsky-Glahn1
1Dept. Informàtica i Matemàtica Aplicada, Universitat de Girona, Girona, Spain

Tóm tắt

The statistical analysis of compositional data based on logratios of parts is not suitable when zeros are present in a data set. Nevertheless, if there is interest in using this modeling approach, several strategies have been published in the specialized literature which can be used. In particular, substitution or imputation strategies are available for rounded zeros. In this paper, existing nonparametric imputation methods—both for the additive and the multiplicative approach—are revised and essential properties of the last method are given. For missing values a generalization of the multiplicative approach is proposed.

Tài liệu tham khảo