Bayesian network data imputation with application to survival tree analysis

Computational Statistics and Data Analysis - Tập 93 - Trang 373-387 - 2016
Paola M.V. Rancoita1,2,3, Marco Zaffalon2, Emanuele Zucca4, Francesco Bertoni3,4, Cassio P. de Campos5
1University Centre for Statistics in the Biomedical Sciences, Vita-Salute San Raffaele University, Milan, Italy
2Dalle Molle Institute for Artificial Intelligence, Manno, Switzerland
3Institute of Oncology Research, Bellinzona, Switzerland
4Oncology Institute of Southern Switzerland, Bellinzona, Switzerland
5Queen’s University Belfast, School of Electronics, Electrical Engineering and Computer Science, Belfast, UK

Tài liệu tham khảo

Breiman, 1984 Buntine, 1991, Theory refinement on Bayesian networks, 52 Ciampi, 1986, Stratification by stepwise regression, correspondence analysis and recursive partition: a comparison of three methods of analysis for survival data with covariates, Comput. Statist. Data Anal., 4, 185, 10.1016/0167-9473(86)90033-2 Cooper, 1992, A Bayesian method for the induction of probabilistic networks from data, Mach. Learn., 9, 309, 10.1007/BF00994110 Cox, 1972, Regression models and life-tables, J. R. Stat. Soc. Ser. B, 34, 187 Cussens, J., 2011. Bayesian network learning with cutting planes. In: Proceedings of the Twenty-Seventh Annual Conference on Uncertainty in Artificial Intelligence, UAI-11, pp. 153–160. Davis, 1989, Exponential survival trees, Stat. Med., 8, 947, 10.1002/sim.4780080806 de Campos, 2010, Properties of Bayesian Dirichlet scores to learn Bayesian network structures, 431 de Campos, 2011, Efficient structure learning of Bayesian networks using constraints, J. Mach. Learn. Res., 12, 663 Di Zio, 2004, Bayesian networks for imputation, J. Roy. Statist. Soc. Ser. A, 167, 309, 10.1046/j.1467-985X.2003.00736.x Fana, 2009, Multivariate exponential survival trees and their application to tooth prognosis, Comput. Statist. Data Anal., 53, 1110, 10.1016/j.csda.2008.10.019 Friedman, 1998, The Bayesian structural EM algorithm, 129 Graf, 1999, Assessment and comparison of prognostic classification schemes for survival data, Stat. Med., 18, 2529, 10.1002/(SICI)1097-0258(19990915/30)18:17/18<2529::AID-SIM274>3.0.CO;2-5 Heckerman, 1995, Learning Bayesian networks: the combination of knowledge and statistical data, Mach. Learn., 20, 197, 10.1007/BF00994016 Hothorn, 2006, Unbiased recursive partitioning: a conditional inference framework, J. Comput. Graph. Statist., 15, 651, 10.1198/106186006X133933 Hothorn, 2004, Bagging survival trees, Stat. Med., 23, 77, 10.1002/sim.1593 Jaakkola, 2010, Learning Bayesian network structure using LP relaxations, 358 Keleş, 2002, Residual-based tree-structured survival analysis, Stat. Med., 21, 313, 10.1002/sim.981 Koller, 2009 LeBlanc, 1992, Relative risk trees for censored survival data, Biometrics, 48, 411, 10.2307/2532300 LeBlanc, 1993, Survival trees by goodness of split, J. Amer. Statist. Assoc., 88, 457, 10.1080/01621459.1993.10476296 Little, 1987 Meila, M., Jordan, M., 1998. Estimating dependency structure as a hidden variable. In: Conference on Advances in Neural Information Processing Systems, pp. 584–590. Nilsson, 1998, An efficient algorithm for finding the M most probable configurations in Bayesian networks, Stat. Comput., 9, 159, 10.1023/A:1008990218483 Peto, 1972, Asymptotically efficient rank invariant test procedures, J. Roy. Statist. Soc. Ser. A, 135, 185, 10.2307/2344317 Ramoni, 1997, Learning Bayesian networks from incomplete databases, 401 Riggelsen, 2006, Learning Bayesian networks from incomplete data: An efficient method for generating approximate predictive distributions, 130 Riggelsen, C., Feelders, A., 2005. Learning Bayesian network models from incomplete data using importance sampling. In: Cowell, R. G., Ghahramani, Z. (Eds.), Proc. of AI & Statistics. Society for Artificial Intelligence and Statistics, pp. 301–308, (Available electronically at: http://www.gatsby.ucl.ac.uk/aistats/). Rinaldi, 2011, Genome wide DNA-profiling of marginal zone lymphomas identifies subtype-specific lesions with an impact on the clinical outcome, Blood, 117, 1595, 10.1182/blood-2010-01-264275 Romero, 2004, Multivariate imputation of qualitative missing data using Bayesian networks, 605 Scandurra, 2010, Genomic lesions associated with a different clinical outcome in diffuse large B-cell lymphoma treated with R-CHOP-21, Br. J. Haematol., 151, 221, 10.1111/j.1365-2141.2010.08326.x Schauerhuber, 2008, Benchmarking open-source tree learners in R/RWeka, 389 Segal, 1988, Regression trees for censored data, Biometrics, 44, 35, 10.2307/2531894 Silander, 2006, A simple approach for finding the globally optimal Bayesian network structure, 445 Simon, 2011, Regularization paths for Cox’s proportional hazards model via coordinate descent, J. Stat. Softw., 39, 10.18637/jss.v039.i05 Singh, 1998 Tanner, 1987, The calculation of posterior distributions by data augmentation, J. Amer. Statist. Assoc., 82, 528, 10.1080/01621459.1987.10478458 Zucca, 2003, Nongastric marginal zone B-cell lymphoma of mucosa-associated lymphoid tissue, Blood, 101, 2489, 10.1182/blood-2002-04-1279