Package 'truncAIPW' reference manual

Title:	Doubly Robust Estimation under Covariate-Induced Dependent Left Truncation
Description:	Doubly robust estimation for the mean of an arbitrarily transformed survival time under covariate-induced dependent left truncation and noninformative right censoring. The functions truncAIPW(), truncAIPW_cen1(), and truncAIPW_cen2() compute the doubly robust estimators under the scenario without censoring and the two censoring scenarios, respectively. The package also contains three simulated data sets 'simu', 'simu_c1', and 'simu_c2', which are used to illustrate the usage of the functions in this package. Reference: Wang, Y., Ying, A., Xu, R. (2022) "Doubly robust estimation under covariate-induced dependent left truncation" <arXiv:2208.06836>.
Authors:	Yuyao Wang [cre, aut], Andrew Ying [aut], Ronghui Xu [aut]
Maintainer:	Yuyao Wang <[email protected]>
License:	GPL-3
Version:	1.0.1
Built:	2025-03-28 03:19:19 UTC
Source:	https://github.com/cran/truncAIPW

Estimate the Conditional CDF of the Event Time given Covariates

Description

Estimate the conditional cumulative distribution function (CDF) of the event time given covariates evaluated at given time points. The options implemented in this function are: Cox proportional hazards regression using function coxph() from R package ‘survival’, and the hazard model with penalized splines using function survPen() from R package ‘survPen’.

Usage

F_est(
  dat.fit,
  dat.est = dat.fit,
  time.eval,
  model,
  time.name,
  Q.name,
  event.name,
  cov.names,
  trim = 0,
  formula.survPen = NA
)
F_est(
  dat.fit,
  dat.est = dat.fit,
  time.eval,
  model,
  time.name,
  Q.name,
  event.name,
  cov.names,
  trim = 0,
  formula.survPen = NA
)

Arguments

`dat.fit`	data frame that is used to fit the model for the full data conditional distribution of the event time given the covariates.
`dat.est`	data frame that contains the subjects for which the estimated conditional CDF is computed.
`time.eval`	vector of time points at which the conditional CDF is evaluated.
`model`	method used to estimate the conditional CDF. The options available are "Cox" and "spline", corresponding to Cox proportional hazards regression using function `coxph()` from R package ‘survival’, and the hazard model with penalized splines using function `survPen()` from R package ‘survPen’, respectively.
`time.name`	name of the event time variable.
`Q.name`	name of the left truncation time variable.
`event.name`	name of the event indicator.
`cov.names`	vector of the names of covariates.
`trim`	constant for bounding the estimated conditional CDF from 1.
`formula.survPen`	the formula when applying the hazard model with penalized splines implemented in `survPen::survPen`.

Value

F_est() returns a matrix of the estimated conditional CDF for subjects in 'data.est' evaluated at the time points in the vector 'time.eval'. Each row corresponds to a subject and each column corresponds to a time point. The column names of the matrix are the times in 'time.eval'.

Examples

data("simu")
u = c(1, 1.5, 2, 2.5, 3, 3.5, 4)
Fuz.mx = F_est(simu, simu[1:10,], u, "Cox", "time", "Q", "delta", c("Z1","Z2"))
data("simu")
u = c(1, 1.5, 2, 2.5, 3, 3.5, 4)
Fuz.mx = F_est(simu, simu[1:10,], u, "Cox", "time", "Q", "delta", c("Z1","Z2"))

Estimate the Conditional CDF for the Left Truncation Time given Covariates

Description

Estimate the conditional cumulative distribution function (CDF) of the left truncation time given covariates evaluated at given time points. The options implemented in this function are: Cox proportional hazards regression using function coxph() from R package ‘survival’, and the hazard model with penalized splines using function survPen() from R package ‘survPen’.

Usage

G_est(
  dat.fit,
  dat.est = dat.fit,
  time.eval,
  model,
  time.name,
  Q.name,
  event.name,
  cov.names,
  trim = 0,
  weights = rep(1, nrow(dat.fit)),
  formula.survPen = NA
)
G_est(
  dat.fit,
  dat.est = dat.fit,
  time.eval,
  model,
  time.name,
  Q.name,
  event.name,
  cov.names,
  trim = 0,
  weights = rep(1, nrow(dat.fit)),
  formula.survPen = NA
)

Arguments

`dat.fit`	data frame that is used to fit the model for the full data conditional distribution of the event time given the covariates.
`dat.est`	data frame that contains the subjects for which the estimated conditional CDF is computed.
`time.eval`	vector of time points at which the conditional CDF is evaluated.
`model`	method used to estimate the conditional CDF. The options available are "Cox" and "spline", corresponding to Cox proportional hazards regression using function `coxph()` from R package ‘survival’, and the hazard model with penalized splines using function `survPen()` from R package ‘survPen’, respectively.
`time.name`	name of the event time variable.
`Q.name`	name of the left truncation time variable.
`event.name`	name of the event indicator.
`cov.names`	vector of the names of covariates.
`trim`	constant for bounding the estimated conditional CDF from 0.
`weights`	vector of case weights.
`formula.survPen`	the formula when applying the hazard model with penalized splines implemented in `survPen::survPen`.

Value

G_est() returns a matrix of the estimated conditional CDF for subjects in 'data.est' evaluated at the time points in the vector 'time.eval'. Each row corresponds to a subject and each column corresponds to a time point. The column names of the matrix are the times in 'time.eval'.

Examples

data("simu")
v = c(0.5, 1, 1.5, 2, 2.5, 3)
Gvz.mx = G_est(simu, simu[1:10,], v, "Cox", "time", "Q", "delta", c("Z1","Z2"))
data("simu")
v = c(0.5, 1, 1.5, 2, 2.5, 3)
Gvz.mx = G_est(simu, simu[1:10,], v, "Cox", "time", "Q", "delta", c("Z1","Z2"))

A Simulated Data Set under Left Truncation but No Right Censoring

Description

A simulated data set under left truncation but no right censoring. The left truncation time and the event time are dependent via the two covariates ‘Z1’ and ‘Z2’. Under the data generating mechanism, the conditional distribution of the event time given covariates follows a Cox proportional hazards model in the full data, and the conditional distribution of the left truncation time given covariates follows a Cox proportional hazards model on the reversed time scale in the full data. The truncation rate is 29.5%; and the truth $P^*(T>3) = 0.5755753$ .

Usage

data(simu)
data(simu)

Format

A data frame with 500 subjects (rows) and 5 variables (columns).

time: the event time
Q: the left truncation time
delta: the event indicator
Z1: a continous covariate
Z2: a binary covariate

Source

Simulated

Examples

data(simu)
data(simu)

A Simulated Data Set under Left Truncation and Right Censoring where Censoring can be before Truncation

Description

A simulated data set under left truncation and right censoring where censoring can be before left truncation. The left truncation time and the event time are dependent via the two covariates ‘Z1’ and ‘Z2’. Under the data generating mechanism, the conditional distribution of the censored event time X given covariates follows a Cox proportional hazards model in the full data, and the conditional distribution of the left truncation time given covariates follows a Cox proportional hazards model on the reversed time scale in the full data. The truncation rate is 29.5%; the censoring rate is 16.5%; $P^*(C<Q) = 0.6057$ ; and the truth $P^*(T>3) = 0.623955$ .

Usage

data(simu_c1)
data(simu_c1)

Format

A data frame with 500 rows and 5 variables.

X: the censored event time X = min(T,C)
Q: the left truncation time
delta: the event indicator
Z1: a continous covariate
Z2: a binary covariate

Source

Simulated

Examples

data(simu_c1)
data(simu_c1)

A Simulated Data Set under Left Truncation and Right Censoring where Censoring is always after Truncation

Description

A simulated data set under left truncation and right censoring where censoring is always after left truncation. The left truncation time and the event time are dependent via two covariates ‘Z1’ and ‘Z2’. Under the data generating mechanism, the conditional distributions of the event time given covariates follows a Cox model in the full data, and the conditional distribution of the left truncation time given covariates follows a Cox model on the reversed time scale in the full data. The truncation rate is 29.5%; the censoring rate is 27.1%; and the truth $P^*(T>3) = 0.576547$ .

Usage

data(simu_c2)
data(simu_c2)

Format

A data frame with 500 rows and 5 variables.

X: the censored event time X = min(T,C)
Q: the left truncation time
delta: the event indicator
Z1: a continous covariate
Z2: a binary covariate

Source

Simulated

Examples

data(simu_c2)
data(simu_c2)

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and No Censoring

Description

Doubly robust estimation for the mean of an arbitrarily transformed survival time under covariate-induced dependent left truncation and no right censoring.

Usage

truncAIPW(dat, nu, Fuz.mx, Gvz.mx, T.name, Q.name, trim = 1e-07)
truncAIPW(dat, nu, Fuz.mx, Gvz.mx, T.name, Q.name, trim = 1e-07)

Arguments

`dat`	data frame that contains the data for constructing the estimating equation.
`nu`	transformation that defines the parameter of interest.
`Fuz.mx`	matrix for the estimated conditional CDF of the event time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `F_est` for an example of computing this conditional CDF matrix.
`Gvz.mx`	matrix for the estimated conditional CDF of the truncation time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `G_est` for an example of computing this conditional CDF matrix.
`T.name`	name of the event time variable.
`Q.name`	name of the left truncation time variable.
`trim`	constant that is used to bound from below for the denominators involved in the computation.

Value

truncAIPW() returns a list of estimators (‘dr’, ‘IPW.Q’, ‘Reg.T1’, ‘Reg.T2’), and the model-based standard errors for the ‘dr’ and ‘IPW.Q’ estimators.

`dr`	doubly robust estimator ‘dr’.
`IPW.Q`	inverse probability of truncation weighted estimator ‘IPW.Q’.
`Reg.T1`	regression based estimator ‘Reg.T1’.
`Reg.T2`	regression based estimator ‘Reg.T2’.
`SE_dr`	standard error of the ‘dr’ estimator based on the efficient influence function.
`SE_IPW.Q`	standard error of the ‘IPW.Q’ estimator computed from the robust sandwich variance estimator assuming the truncation weights are known.

References

Wang, Y., Ying, A., Xu, R. (2022) "Doubly robust estimation under covariate-induced dependent left truncation" <arXiv:2208.06836>.

Examples

data("simu")
nu <- function(t){ return(as.numeric(t>3)) }
u = c(min(simu$time)-1e-10, sort(simu$time), max(simu$time)+1e-10)
v = c(min(simu$Q)-1e-10, sort(simu$Q), max(simu$Q)+1e-10)
Fuz.mx = F_est(simu, simu, u, "Cox", "time", "Q", "delta", c("Z1","Z2"))
Gvz.mx = G_est(simu, simu, v, "Cox", "time", "Q", "delta", c("Z1","Z2"))

est = truncAIPW(simu, nu, Fuz.mx, Gvz.mx, "time", "Q", trim = 1e-7)
est
data("simu")
nu <- function(t){ return(as.numeric(t>3)) }
u = c(min(simu$time)-1e-10, sort(simu$time), max(simu$time)+1e-10)
v = c(min(simu$Q)-1e-10, sort(simu$Q), max(simu$Q)+1e-10)
Fuz.mx = F_est(simu, simu, u, "Cox", "time", "Q", "delta", c("Z1","Z2"))
Gvz.mx = G_est(simu, simu, v, "Cox", "time", "Q", "delta", c("Z1","Z2"))

est = truncAIPW(simu, nu, Fuz.mx, Gvz.mx, "time", "Q", trim = 1e-7)
est

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and Noninformative Right Censoring where Censoring can be before Left Truncation

Description

Doubly robust estimation of the mean of an arbitrarily transformed survival time under covariate-induced dependent left truncation and noninformative right censoring where censoring can be before left truncation. Inverse probability of censoring weighting is used to handle the right censoring.

Usage

truncAIPW_cen1(
  dat,
  nu,
  Fuz.mx,
  Gvz.mx,
  Sc,
  X.name,
  Q.name,
  status.name,
  trim = 1e-07
)
truncAIPW_cen1(
  dat,
  nu,
  Fuz.mx,
  Gvz.mx,
  Sc,
  X.name,
  Q.name,
  status.name,
  trim = 1e-07
)

Arguments

`dat`	data frame that contains the data for constructing the estimating equation.
`nu`	transformation that defines the parameter of interest.
`Fuz.mx`	matrix for the estimated conditional CDF of the event time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `F_est` for an example of computing this input matrix for the conditional CDF.
`Gvz.mx`	matrix for the estimated conditional CDF of the truncation time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `G_est` for an example of computing this input matrix for the conditional CDF.
`Sc`	a function for the censoring survival curve $S_c(\cdot)$ .
`X.name`	name of the censored event time variable X = min(T, C).
`Q.name`	name of the left truncation time variable.
`status.name`	name of the event time indicator.
`trim`	constant that is used to bound from below for the denominators involved in the computation.

Value

truncAIPW_cen1() returns a list of estimators (‘dr’, ‘IPW.Q’, ‘Reg.T1’, ‘Reg.T2’).

`dr`	doubly robust estimator ‘dr’.
`IPW.Q`	inverse probability of truncation weighted estimator ‘IPW.Q’.
`Reg.T1`	regression based estimator ‘Reg.T1’.
`Reg.T2`	regression based estimator ‘Reg.T2’.

References

Wang, Y., Ying, A., Xu, R. (2022) "Doubly robust estimation under covariate-induced dependent left truncation" <arXiv:2208.06836>.

Examples

library(survival)
data("simu_c1")
simu_c1$delta.1 = 1

nu <- function(t){ return(as.numeric(t>3)) }
u = c(min(simu_c1$X)-1e-10, sort(simu_c1$X), max(simu_c1$X)+1e-10)
v = c(min(simu_c1$Q)-1e-10, sort(simu_c1$Q), max(simu_c1$Q)+1e-10)

Fuz.mx = F_est(simu_c1, simu_c1, u, "Cox", "X", "Q", "delta.1", c("Z1","Z2"))
Gvz.mx = G_est(simu_c1, simu_c1, v, "Cox", "X", "Q", "delta.1", c("Z1","Z2"))

# KM curve for Sc
kmfit.C = survfit(Surv(Q, X, 1-delta)~1, data = simu_c1, type = "kaplan-meier")
Sc = stepfun(kmfit.C$time,  c(1, kmfit.C$surv))

est = truncAIPW_cen1(simu_c1, nu, Fuz.mx, Gvz.mx, Sc, "X", "Q", "delta", trim = 1e-7)
est
library(survival)
data("simu_c1")
simu_c1$delta.1 = 1

nu <- function(t){ return(as.numeric(t>3)) }
u = c(min(simu_c1$X)-1e-10, sort(simu_c1$X), max(simu_c1$X)+1e-10)
v = c(min(simu_c1$Q)-1e-10, sort(simu_c1$Q), max(simu_c1$Q)+1e-10)

Fuz.mx = F_est(simu_c1, simu_c1, u, "Cox", "X", "Q", "delta.1", c("Z1","Z2"))
Gvz.mx = G_est(simu_c1, simu_c1, v, "Cox", "X", "Q", "delta.1", c("Z1","Z2"))

# KM curve for Sc
kmfit.C = survfit(Surv(Q, X, 1-delta)~1, data = simu_c1, type = "kaplan-meier")
Sc = stepfun(kmfit.C$time,  c(1, kmfit.C$surv))

est = truncAIPW_cen1(simu_c1, nu, Fuz.mx, Gvz.mx, Sc, "X", "Q", "delta", trim = 1e-7)
est

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and Noninformative Right Censoring where Censoring is always after Left Truncation

Description

Doubly robust estimation of the mean of an arbitrarily transformed survival time under covariate-induced dependent left truncation and noninformative right censoring where censoring is always after left truncation. Inverse probability of censoring weighting is used to handle the right censoring.

Usage

truncAIPW_cen2(
  dat,
  nu,
  Fuz.mx,
  Gvz.mx,
  wd,
  X.name,
  Q.name,
  status.name,
  trim = 1e-07
)
truncAIPW_cen2(
  dat,
  nu,
  Fuz.mx,
  Gvz.mx,
  wd,
  X.name,
  Q.name,
  status.name,
  trim = 1e-07
)

Arguments

`dat`	data frame that contains the data for constructing the estimating equation.
`nu`	transformation that defines the parameter of interest.
`Fuz.mx`	matrix for the estimated conditional CDF of the event time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `F_est` for an example of computing this input matrix for the conditional CDF.
`Gvz.mx`	matrix for the estimated conditional CDF of the truncation time given covariates. Each row corresponds to a subject, and each column corresponds to a time point. The column names of the matrix are the time points. See `G_est` for an example of computing this input matrix for the conditional CDF.
`wd`	vector for the inverse probability of residual censoring weights $\Delta/\hat S_D(X-Q)$ .
`X.name`	name of the censored event time variable X = min(T, C).
`Q.name`	name of the left truncation time variable.
`status.name`	name of the event time indicator.
`trim`	constant that is used to bound from below for the denominators involved in the computation.

Value

truncAIPW_cen2() returns a list of estimators (‘dr’, ‘IPW.Q’, ‘Reg.T1’, ‘Reg.T2’).

`dr`	doubly robust estimator 'dr'.
`IPW.Q`	inverse probability of truncation weighted estimator 'IPW.Q'.
`Reg.T1`	regression based estimator 'Reg.T1'.
`Reg.T2`	regression based estimator 'Reg.T2'.

References

Wang, Y., Ying, A., Xu, R. (2022) "Doubly robust estimation under covariate-induced dependent left truncation" <arXiv:2208.06836>.

Examples

library(survival)
data("simu_c2")
nu <- function(t){ return(as.numeric(t>3)) }

u = c(min(simu_c2$X)-1e-10, sort(simu_c2$X), max(simu_c2$X)+1e-10)
v = c(min(simu_c2$Q)-1e-10, sort(simu_c2$Q), max(simu_c2$Q)+1e-10)

kmfit.D = survfit(Surv(X-Q, 1-delta)~1, data = simu_c2, type = "kaplan-meier")
Sd = stepfun(kmfit.D$time,  c(1, kmfit.D$surv))
wd = rep(0, nrow(simu_c2))
wd[which(simu_c2$delta == 1)] = 1/Sd(simu_c2$X - simu_c2$Q)[which(simu_c2$delta == 1)]

simu_c2$wd = wd
simu_c2.1 = simu_c2[simu_c2$delta==1,]
wd_1 = simu_c2.1$wd

Fuz.mx = F_est(simu_c2, simu_c2, u, "Cox", "X", "Q", "delta", c("Z1","Z2"))
Gvz.mx = G_est(simu_c2.1, simu_c2, v, "Cox", "X", "Q", "delta", c("Z1","Z2"), weights = wd_1)

est = truncAIPW_cen2(simu_c2, nu, Fuz.mx, Gvz.mx, wd, "X", "Q", "delta", trim = 1e-7)
est
library(survival)
data("simu_c2")
nu <- function(t){ return(as.numeric(t>3)) }

u = c(min(simu_c2$X)-1e-10, sort(simu_c2$X), max(simu_c2$X)+1e-10)
v = c(min(simu_c2$Q)-1e-10, sort(simu_c2$Q), max(simu_c2$Q)+1e-10)

kmfit.D = survfit(Surv(X-Q, 1-delta)~1, data = simu_c2, type = "kaplan-meier")
Sd = stepfun(kmfit.D$time,  c(1, kmfit.D$surv))
wd = rep(0, nrow(simu_c2))
wd[which(simu_c2$delta == 1)] = 1/Sd(simu_c2$X - simu_c2$Q)[which(simu_c2$delta == 1)]

simu_c2$wd = wd
simu_c2.1 = simu_c2[simu_c2$delta==1,]
wd_1 = simu_c2.1$wd

Fuz.mx = F_est(simu_c2, simu_c2, u, "Cox", "X", "Q", "delta", c("Z1","Z2"))
Gvz.mx = G_est(simu_c2.1, simu_c2, v, "Cox", "X", "Q", "delta", c("Z1","Z2"), weights = wd_1)

est = truncAIPW_cen2(simu_c2, nu, Fuz.mx, Gvz.mx, wd, "X", "Q", "delta", trim = 1e-7)
est

Package 'truncAIPW'

Help Index

Estimate the Conditional CDF of the Event Time given Covariates

Description

Usage

Arguments

Value

See Also

Examples

Estimate the Conditional CDF for the Left Truncation Time given Covariates

Description

Usage

Arguments

Value

See Also

Examples

A Simulated Data Set under Left Truncation but No Right Censoring

Description

Usage

Format

Source

Examples

A Simulated Data Set under Left Truncation and Right Censoring where Censoring can be before Truncation

Description

Usage

Format

Source

Examples

A Simulated Data Set under Left Truncation and Right Censoring where Censoring is always after Truncation

Description

Usage

Format

Source

Examples

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and No Censoring

Description

Usage

Arguments

Value

References

See Also

Examples

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and Noninformative Right Censoring where Censoring can be before Left Truncation

Description

Usage

Arguments

Value

References

See Also

Examples

Doubly Robust Estimation under Covariate-induced Dependent Left Truncation and Noninformative Right Censoring where Censoring is always after Left Truncation

Description

Usage

Arguments

Value

References

See Also

Examples