Clustering of Sun exposure measurements

A. Szymkowiak-Have1, J. Larsen1, L.K. Hansen1, P.A. Philipsen2, E. Thieden2, H.C. Wulf2
1Informatics and Mathematical Modelling, Technical University of Denmark, Kongens Lyngby, Denmark
2Department of Dermatology, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark

Tóm tắt

In a medically motivated Sun-exposure study, questionnaires concerning Sun-habits were collected from a number of subjects together with UV radiation measurements. This paper focuses on identifying clusters in the heterogeneous set of data for the purpose of understanding possible relations between Sun-habits exposure and eventually assessing the risk of skin cancer. A general probabilistic framework originally developed for text and Web mining is demonstrated to be useful for clustering of behavioral data. The framework combines principal component subspace projection with probabilistic clustering based on the generalizable Gaussian mixture model.

Từ khóa

#Sun #Matrix decomposition #Skin cancer #Data mining #Training data #Mathematical model #Electronic mail #Hospitals #Sampling methods #Biomedical imaging

Tài liệu tham khảo

hansen, 2000, Modeling text with generalizable gaussian mixtures, Proc IEEE ICASSP, vi, 3494 10.1109/ICNN.1996.548861 szymkowiak, 2001, Impuating missing values in diary records of sun-exposure study, Proc IEEE Workshop Neural Networks Signal Processing, 489 10.1016/S0167-9473(01)00076-7 10.1109/NNSP.2002.1030096 10.1017/CBO9780511812651 larsen, 2002, Probabilistic Hierarchical Clustering with Labeled and Unlabeled Data, International Journal of Knowledge-based Intelligent Engineering Systems, 6, 56 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9 szymkowiak, 2001, Hierarchical Clustering for datamining, Proc 5th Int Conf on Knowledge-Based Intelligent Information Engineering Systems and Allied Technologies, 261 10.1007/BF02532251