
AbstractThe stepped wedge cluster randomized trial (SW‐CRT) is an increasingly popular design for evaluating health service delivery or policy interventions. An essential consideration of this design is the need to account for both within‐period and between‐period correlations in sample size calculations. Especially when embedded in health care delivery systems, many SW‐CRTs may have subclusters nested in clusters, within which outcomes are collected longitudinally. However, existing sample size methods that account for between‐period correlations have not allowed for multiple levels of clustering. We present computationally efficient sample size procedures that properly differentiate within‐period and between‐period intracluster correlation coefficients in SW‐CRTs in the presence of subclusters. We introduce an extended block exchangeable correlation matrix to characterize the complex dependencies of outcomes within clusters. For Gaussian outcomes, we derive a closed‐form sample size expression that depends on the correlation structure only through two eigenvalues of the extended block exchangeable correlation structure. For non‐Gaussian outcomes, we present a generic sample size algorithm based on linearization and elucidate simplifications under canonical link functions. For example, we show that the approximate sample size formula under a logistic linear mixed model depends on three eigenvalues of the extended block exchangeable correlation matrix. We provide an extension to accommodate unequal cluster sizes and validate the proposed methods via simulations. Finally, we illustrate our methods in two real SW‐CRTs with subclusters.
eigenvalues, extended block exchangeable correlation structure, power analysis, Applications of statistics to biology and medical sciences; meta analysis, generalized linear mixed models, Research Design, Sample Size, cluster randomized trial, Cluster Analysis, Algorithms
eigenvalues, extended block exchangeable correlation structure, power analysis, Applications of statistics to biology and medical sciences; meta analysis, generalized linear mixed models, Research Design, Sample Size, cluster randomized trial, Cluster Analysis, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 12 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
