Information Sets and Excess Zeros in Random Effects Modeling of Longitudinal DataStatistics in Biosciences - Tập 2 - Trang 81-94 - 2010
Tze L. Lai, Kevin H. Sun, Samuel P. Wong
Marginal regression via generalized estimating equations is widely used in biostatistics to model longitudinal data from subjects whose outcomes and covariates are observed at several time points. In this paper we consider two issues that have been raised in the literature concerning the marginal regression approach. The first is that even though the past history may be predictive of outcome, the ...... hiện toàn bộ
Improved Semiparametric Analysis of Polygenic Gene–Environment Interactions in Case–Control StudiesStatistics in Biosciences - Tập 13 - Trang 386-401 - 2020
Tianying Wang, Alex Asher
Standard logistic regression analysis of case–control data has low power to detect gene–environment interactions, but until recently it was the only method that could be used on complex polygenic data for which parametric distributional models are not feasible. Under the assumption of gene–environment independence in the underlying population, Stalder et al. (Biometrika, 104:801–812, 2017) develop...... hiện toàn bộ
Estimating Heterogeneous Treatment Effect on Multivariate Responses Using Random ForestsStatistics in Biosciences - Tập 15 - Trang 545-561 - 2021
Boyi Guo, Hannah D. Holscher, Loretta S. Auvil, Michael E. Welge, Colleen B. Bushell, Janet A. Novotny, David J. Baer, Nicholas A. Burd, Naiman A. Khan, Ruoqing Zhu
Estimating the individualized treatment effect has become one of the most popular topics in statistics and machine learning communities in recent years. Most existing methods focus on modeling the heterogeneous treatment effects for univariate outcomes. However, many biomedical studies are interested in studying multiple highly correlated endpoints at the same time. We propose a random forest mode...... hiện toàn bộ
Estimating Decision-Relevant Comparative Effects Using Instrumental VariablesStatistics in Biosciences - Tập 3 - Trang 6-27 - 2011
Anirban Basu
Instrumental variables methods (IV) are widely used in the health economics literature to adjust for hidden selection biases in observational studies when estimating treatment effects. Less attention has been paid in the applied literature to the proper use of IVs if treatment effects are heterogeneous across subjects. Such a heterogeneity in effects becomes an issue for IV estimators when individ...... hiện toàn bộ
Links Between the Sequence Kernel Association and the Kernel-Based Adaptive Cluster TestsStatistics in Biosciences - Tập 9 - Trang 246-258 - 2017
Weiming Zhang, Michael P. Epstein, Tasha E. Fingerlin, Debashis Ghosh
Two recently developed methods for the analysis of rare variants include the sequence kernel association test (SKAT) and the kernel-based adaptive cluster test (KBAC). While SKAT represents a type of variance component score test, and KBAC computes a weighted integral representing the difference in risk between variants, they appear to be developed using different initial principles. In this note,...... hiện toàn bộ
Statistical and Computational Methods for High-Throughput Sequencing Data Analysis of Alternative SplicingStatistics in Biosciences - Tập 5 - Trang 138-155 - 2012
Liang Chen
The burgeoning field of high-throughput sequencing significantly improves our ability to understand the complexity of transcriptomes. Alternative splicing, as one of the most important driving forces for transcriptome diversity, can now be studied at an unprecedent resolution. Efficient and powerful computational and statistical methods are in urgent need to facilitate the characterization and qua...... hiện toàn bộ
New Approaches to Principal Component Analysis for TreesStatistics in Biosciences - Tập 4 - Trang 132-156 - 2012
Burcu Aydın, Gábor Pataki, Haonan Wang, Alim Ladha, Elizabeth Bullitt, J. S. Marron
Object Oriented Data Analysis is a new area in statistics that studies populations of general data objects. In this article we consider populations of tree-structured objects as our focus of interest. We develop improved analysis tools for data lying in a binary tree space analogous to classical Principal Component Analysis methods in Euclidean space. Our extensions of PCA are analogs of one dimen...... hiện toàn bộ
Hot Deck Multiple Imputation for Handling Missing Accelerometer DataStatistics in Biosciences - Tập 11 - Trang 422-448 - 2018
Nicole M. Butera, Siying Li, Kelly R. Evenson, Chongzhi Di, David M. Buchner, Michael J. LaMonte, Andrea Z. LaCroix, Amy Herring
Missing data due to non-wear are common in accelerometer studies measuring physical activity and sedentary behavior. Accelerometer outputs are high-dimensional time-series data that are episodic and often highly skewed, presenting unique challenges for handling missing data. Common methods for missing accelerometry either are ad-hoc, require restrictive parametric assumptions, or do not appropriat...... hiện toàn bộ