Privacy-preserving genotype imputation in a trusted execution environment

Cell Systems - Tập 12 - Trang 983-993.e7 - 2021
Natnatee Dokmai1,2, Can Kockan1,3, Kaiyuan Zhu1,3, XiaoFeng Wang1, S. Cenk Sahinalp3, Hyunghoon Cho2
1Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
2Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
3Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA

Tài liệu tham khảo

Abecasis, 2002, Merlin--rapid analysis of dense genetic maps using sparse gene flow trees, Nat. Genet., 30, 97, 10.1038/ng786 Aciicmez, 2007, Cheap hardware parallelism implies cheap security, 80 Aldaya, 2019, Port contention for fun and profit, 870 Anati, 2013 Andrysco, 2015, On subnormal floating point and abnormal timing, 623 Baum, 1972, An inequality and associated maximization technique in statistical estimation for probabilistic functions of markov processes, 1 Brasser, 2017, Software grand exposure: SGX cache attacks are practical Browning, 2016, Genotype imputation with millions of reference samples, Am. J. Hum. Genet., 98, 116, 10.1016/j.ajhg.2015.11.020 Browning, 2007, Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering, Am. J. Hum. Genet., 81, 1084, 10.1086/521987 Bycroft, 2018, The UK biobank resource with deep phenotyping and genomic data, Nature, 562, 203, 10.1038/s41586-018-0579-z Canella, 2019, Fallout: leaking data on meltdown-resistant CPUs, 769 Chen, 2017, PRESAGE: PRivacy-preserving gEnetic testing via SoftwAre guard extension, BMC Med. Genomics, 10, 48, 10.1186/s12920-017-0281-2 Chen, 2017, Princess: privacy-protecting rare disease international network collaboration via encryption through software guard extensions, Bioinformatics, 33, 871, 10.1093/bioinformatics/btw758 Das, 2017 Das, 2016, Next-generation genotype imputation service and methods, Nat. Genet., 48, 1284, 10.1038/ng.3656 Delaneau, 2011, A linear complexity phasing method for thousands of genomes, Nat. Methods, 9, 179, 10.1038/nmeth.1785 Delaneau, 2013, Improved whole-chromosome phasing for disease and population genetic studies, Nat. Methods, 10, 5, 10.1038/nmeth.2307 FinnGen Firtina, 2016, On genomic repeats and reproducibility, Bioinformatics, 32, 2243, 10.1093/bioinformatics/btw139 Fuchsberger, 2015, minimac2: faster genotype imputation, Bioinformatics, 31, 782, 10.1093/bioinformatics/btu704 Gentry, 2009, Fully homomorphic encryption using ideal lattices, 169 Gilad, 2008, Revealing the architecture of gene regulation: the promise of eQTL studies, Trends Genet, 24, 408, 10.1016/j.tig.2008.06.001 2013, The genotype-tissue expression (GTEx) project, Nat. Genet., 45, 580, 10.1038/ng.2653 Gürsoy, 2020, Privacy-preserving genotype imputation with fully homomorphic encryption, bioRxiv Hao, 2009, Accuracy of genome-wide imputation of untyped markers and impacts on statistical power for association studies, BMC Genet, 10, 27, 10.1186/1471-2156-10-27 Howie, 2011, Genotype imputation with thousands of genomes, G3 (Bethesda), 1, 457, 10.1534/g3.111.001198 Intel Intel Intel Kim, 2020, Ultra-fast homomorphic encryption models enable secure outsourcing of genotype imputation, bioRxiv Kockan, 2020, Sketching algorithms for genomic data analysis and querying in a secure enclave, Nat. Methods, 17, 295, 10.1038/s41592-020-0761-8 Li, 2010, Mach: using sequence and genotype data to estimate haplotypes and unobserved genotypes, Genet. Epidemiol., 34, 816, 10.1002/gepi.20533 Lipp, 2021, PLATYPUS: software-based power side-channel attacks on x86, 355 Liu, 2015, Last-level cache side-channel attacks are practical, 605 Marchini, 2007, A new multipoint method for genome-wide association studies by imputation of genotypes, Nat. Genet., 39, 906, 10.1038/ng2088 Markianos, 2001, Efficient multipoint linkage analysis through reduction of inheritance space, Am. J. Hum. Genet., 68, 963, 10.1086/319507 McCarthy, 2016, A reference panel of 64,976 haplotypes for genotype imputation, Nat. Genet., 48, 1279, 10.1038/ng.3643 Ongen, 2016, Fast and efficient QTL mapper for thousands of molecular phenotypes, Bioinformatics, 32, 1479, 10.1093/bioinformatics/btv722 Paul, 2012, Blockwise HMM computation for large-scale population genomic inference, Bioinformatics, 28, 2008, 10.1093/bioinformatics/bts314 Russinovich Scheet, 2006, A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase, Am. J. Hum. Genet., 78, 629, 10.1086/502802 Schwarz, 2019, ZombieLoad: cross-privilege-boundary data sampling, 753 Skarlatos, 2019, MicroScope: enabling microarchitectural replay attacks, 318 Stegle, 2012, Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses, Nat. Protoc., 7, 500, 10.1038/nprot.2011.457 Sudlow, 2015, UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age, PLoS Med, 12, 10.1371/journal.pmed.1001779 Taliun, 2021, Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program, Nature, 590, 290, 10.1038/s41586-021-03205-y 2015, A global reference for human genetic variation, Nature, 526, 68, 10.1038/nature15393 2007, A second generation human haplotype map of over 3.1 million SNPs, Nature, 449, 851, 10.1038/nature06258 Van Bulck, 2018, Foreshadow: Extracting the keys to the Intel SGX kingdom with transient out-of-order execution, 991 Van Bulck, 2020, LVI: hijacking transient execution through microarchitectural load value injection, 54 van Schaik, 2019, RIDL: rogue in-flight data load, 88 Wang, 2017, Leaky cauldron on the dark land: understanding memory side-channel hazards in SGX, 2421