You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.qmd
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ format: gfm
7
7
The `comprehensivecohort` package provides functions for estimating the comprehensive cohort causal effects (CCCE) in comprehensive cohort studies. We develop a semiparametric sensitivity analysis framework for assessing the impact of unmeasured confounding in the observational arm. Our methods can also handle outcomes missing at random. Details about the study design, assumptions, methodology and implementation can be found in the vignettes and paper.
8
8
9
9
10
-
## Installment
10
+
## Installation
11
11
12
12
Users can install `comprehensivecohort` using the [`remotes`](https://cran.r-project.org/package=remotes){target="_blank"} R package:
13
13
@@ -21,7 +21,7 @@ Or install from CRAN:
21
21
22
22
## Example
23
23
24
-
The package includes a simulated dataset based on the TOIB study, a comprehensive cohort study aiming to determine whether to advice older adults with chronic knee pain to apply either topical or oral non-steroidal anti-inflammatory drugs (NSAIDs) for knee pain management. The dataset include 563 observations with outcome $Y$ as Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain score ($0-100$) at 12 months. Some outcome observations might be missing (denoted as NA in $Y$). Column $M$ is a outcome missingness indicator: $1$ if $Y$ is observed, $0$ if $Y$ is missing. Other variables in the dataset include $t$ the treatment indicator ($1$ for topical NSAIDs, $0$ for oral NSAIDs), and $R$ the randomization consent indicator ($1$ for RCT, $0$ for OBS). Rest of the columns are baseline covariates (age, baseline WOMAC pain score, expected pain one year later, chronic pain grade).
24
+
The package includes a simulated dataset based on the TOIB study, a comprehensive cohort study aiming to determine whether to advise older adults with chronic knee pain to apply either topical or oral non-steroidal anti-inflammatory drugs (NSAIDs) for knee pain management. The dataset include 563 observations with outcome $Y$ as Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain score ($0-100$) at 12 months. Some outcome observations might be missing (denoted as NA in $Y$). Column $M$ is a outcome missingness indicator: $1$ if $Y$ is observed, $0$ if $Y$ is missing. Other variables in the dataset include $t$ the treatment indicator ($1$ for topical NSAIDs, $0$ for oral NSAIDs), and $R$ the randomization consent indicator ($1$ for RCT, $0$ for OBS). Rest of the columns are baseline covariates (age, baseline WOMAC pain score, expected pain one year later, chronic pain grade).
Copy file name to clipboardExpand all lines: vignettes/Intro.qmd
+4-4Lines changed: 4 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -129,16 +129,16 @@ The dataset has 563 observations and 8 variables, which include:
129
129
130
130
## Application
131
131
132
-
The primary function is `est_psi()`, which returns point estimates, estimated variance and $95\%$ Walds confidence interval for $\psi_t(\widetilde{P}; \gamma_t)$, $\psi_{t, 0}(\widetilde{P}; \gamma_t)$ and $\psi_{t, 1}(\widetilde{P})$ under one or more pre-specified $\gamma_t$.
132
+
The primary function is `est_psi()`, which returns point estimates, estimated variance and $95\%$ Wald confidence interval for $\psi_t(\widetilde{P}; \gamma_t)$, $\psi_{t, 0}(\widetilde{P}; \gamma_t)$ and $\psi_{t, 1}(\widetilde{P})$ under one or more pre-specified $\gamma_t$.
133
133
134
134
Users need to input data and specify several parameters:
135
135
136
-
* Data: <spanstyle="color: blue;">Y</span>, <spanstyle="color: blue;">M</span>, <spanstyle="color: blue;">R</span>, <spanstyle="color: blue;">t</span>, <spanstyle="color: blue;">M</span> (baseline covariates in the form of data frame)
136
+
* Data: <spanstyle="color: blue;">Y</span>, <spanstyle="color: blue;">M</span>, <spanstyle="color: blue;">R</span>, <spanstyle="color: blue;">t</span>, <spanstyle="color: blue;">X</span> (baseline covariates as a data frame)
137
137
* Estimand of interest: <spanstyle="color: blue;">trt</span>=1 if estimating $E[Y(1)]$, $E[Y(1)|R=0]$ and $E[Y(1)|R=1]$; <spanstyle="color: blue;">trt</span>=0 if estimating $E[Y(0)]$, $E[Y(0)|R=0]$ and $E[Y(0)|R=1]$
138
138
* Sensitivity parameters: <spanstyle="color: blue;">gamma</span>, a vector of $\gamma_t$ value.
139
-
* Single index model settings [@redd2025sensiatrpackageconducting]: <spanstyle="color: blue;">kernel</span>, kernel smoothing, choices of Gaussian and Epanechnikov kernel; <spanstyle="color: blue;">single_index_method</span>, types of constraints in estimation, choices of setting first coefficient to 1, norm of coefficients to 1, and bandwidth to 1; <spanstyle="color: blue;">method</span>, optimization methods, default to `optim`; <spanstyle="color: blue;">use_mave</span>, whether to apply MAVE [@mave_xia_2002; @mave_wang_2008] or cumulative sliced inverse regression method [@slice_zhu_2010] to estimate initial value of the coeffficients, default to TRUE.
139
+
* Single index model settings [@redd2025sensiatrpackageconducting]: <spanstyle="color: blue;">kernel</span>, kernel smoothing, choices of Gaussian and Epanechnikov kernel; <spanstyle="color: blue;">single_index_method</span>, types of constraints in estimation, choices of setting first coefficient to 1, norm of coefficients to 1, and bandwidth to 1; <spanstyle="color: blue;">method</span>, optimization methods, default to `optim`; <spanstyle="color: blue;">use_mave</span>, whether to apply MAVE [@mave_xia_2002; @mave_wang_2008] or cumulative sliced inverse regression method [@slice_zhu_2010] to estimate initial value of the coefficients, default to TRUE.
140
140
* Truncation methods: <spanstyle="color: blue;">simple_trunc</span>=TRUE to apply quantile truncation of $\frac{1}{\pi_{t, r}(X)}$ and $\frac{1}{\eta_m(X, r, t)}$, <spanstyle="color: blue;">simple_trunc</span>=FALSE to apply tuning-free Huberization procedure [@tuning_wang_2021] to influence functions. If <spanstyle="color: blue;">simple_trunc</span>=TRUE, specify quantile truncation by setting <spanstyle="color: blue;">quant</span> from 0 to 1.
Here is an example of inferences for $\psi_1(\widetilde{P}; \gamma_1)$, $\psi_{1, 0}(\widetilde{P}; \gamma_1)$ and $\psi_{1, 1}(\widetilde{P})$ under $\gamma_1=0, 0.5$, using 5-fold sample splitting, influence function truncation procedure and specific single index model settings.
0 commit comments