SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler Tập 1 Số 1 - 2012
Ruibang Luo, Binghang Liu, Yinlong Xie, Zhenyu Li, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W. Cheung, Siu‐Ming Yiu, Shaoliang Peng, Xiaoqian Zhu, Guangming Liu, Xiangke Liao, Yingrui Li, Huanming Yang, Jun Wang, Tak‐Wah Lam
SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data Tập 7 Số 1 - 2018
Yuxin Chen, Yongsheng Chen, Chunmei Shi, Zhibo Huang, Yong Zhang, Shengkang Li, Yan Li, Jia Ye, Chang Ho Yu, Zhuo Li, Xiuqing Zhang, Wei Wang, Huanming Yang, Lin Fang, Qiang Chen
PRSice-2: Polygenic Risk Score software for biobank-scale data Tập 8 Số 7 - 2019
Shing Wan Choi, Paul F. O’Reilly
Abstract
Background
Polygenic risk score (PRS) analyses have become an integral part of biomedical research, exploited to gain insights into shared aetiology among traits, to control for genomic profile in experimental studies, and to strengthen causal inference, among a range of applications. Substantial efforts are now devoted to biobank projects to collect large genetic and phenotypic data, providing unprecedented opportunity for genetic discovery and applications. To process the large-scale data provided by such biobank resources, highly efficient and scalable methods and software are required.
Results
Here we introduce PRSice-2, an efficient and scalable software program for automating and simplifying PRS analyses on large-scale data. PRSice-2 handles both genotyped and imputed data, provides empirical association P-values free from inflation due to overfitting, supports different inheritance models, and can evaluate multiple continuous and binary target traits simultaneously. We demonstrate that PRSice-2 is dramatically faster and more memory-efficient than PRSice-1 and alternative PRS software, LDpred and lassosum, while having comparable predictive power.
Conclusion
PRSice-2's combination of efficiency and power will be increasingly important as data sizes grow and as the applications of PRS become more sophisticated, e.g., when incorporated into high-dimensional or gene set–based analyses. PRSice-2 is written in C++, with an R script for plotting, and is freely available for download from http://PRSice.info.
Genome sequence of pacific abalone (Haliotis discus hannai): the first draft genome in family Haliotidae Tập 6 Số 5 - 2017
Bo‐Hye Nam, Woori Kwak, Young‐Ok Kim, Dong‐Gyun Kim, Hee Jeong Kong, Woo‐Jin Kim, Jeong-Ha Kang, Jung Youn Park, Cheul Min An, Ji‐Yong Moon, Choul Ji Park, Jaewoong Yu, Joon Yoon, Minseok Seo, Kwondo Kim, Duk‐Kyung Kim, SaetByeol Lee, Samsun Sung, Chul Lee, Younhee Shin, Myunghee Jung, Byeong-Chul Kang, Ga‐Hee Shin, Sojeong Ka, Kelsey Caetano-Anollés, Seoae Cho, Heebal Kim