The mean process can be modeled flexibly, \(\mu_i = g(X_i)\), where \(g\) is some function that relates \(X_i\) to \(\mathbb{E}[Y_i].\)
A form of nonlinear regression approximates the function \(g\) using a finite basis expansion, \[g(X_i) = \alpha + \sum_{j=1}^J B_j(X_i)\beta_j,\] where \(B_j(X)\) are known basis functions and \(\beta_j\) are unknown parameters that determine the shape of \(g\).
Example: Gaussian radial basis functions: \[B_j(X_i) = \exp\left\{-\frac{|X_i - \nu_j|^2}{l^2}\right\},\] where \(\nu_j\) are centers of the basis functions and \(l\) is a common width parameter.
The number of of basis functions and the width parameter \(l\) controls the scale at which the model can vary as a function of \(X_i\).
Nonlinear regression
Example: The cubic B-spline basis function is the following piecewise cubic polynomial:
B-splines are a piecewise continuous function defined conditional on some set of knots.
Here we assume a uniform knot locations \(\nu_{j + k} = \nu_j + \delta k\).
B-splines have compact support, so the design matrix is sparse.
Nonlinear regression
Conditionally on the selected bases \(B\), the model is linear in the parameters. Hence we can write, \[Y_i = \mu_i + \epsilon_i = \mathbf{w}_i \boldsymbol{\beta} + \epsilon_i,\] with \(\mathbf{w}_i = (B_1(X_i),\ldots,B_J(X_i))\).
Model fitting can proceed as in linear regression models, since the resulting model is linear in \(\boldsymbol{\beta}\).
It is often useful to center the basis function model around the linear model, \(\mu_i = \alpha + X_i \beta + \mathbf{w}_i\boldsymbol{\beta}\).
Glaucoma is the leading cause of irreversible blindness world wide with over 60 million glaucoma patients as of 2012. Since impairment caused by glaucoma is irreversible, early detection of disease progression is crucial for effective treatment.
Patients with glaucoma are routinely followed up and administered visual fields, a functional assessment of their vision.
After each visual field test their current disease status is reported as a mean deviation (MD) value, measured in decibels (dB). A lower mean deviation indicates worse vision.
Central clinical challenges are i) identifying disease progression of MD, and ii) predicting future MD.
Glaucoma data
### Load and process data to obtain data for an example patientdat <-read.csv(file ="LongGlaucVF_20150216/VisualFields.csv")dat <- dat[order(dat$STUDY_ID, dat$SITE), ]dat$EYE_ID <-cumsum(!duplicated(dat[, c("STUDY_ID", "SITE")]))dat_pat <- dat[dat$EYE_ID =="4", ] # 4dat_pat$time <- (dat_pat$AGE - dat_pat$AGE[1]) /365dat_pat <- dat_pat[, c("time", "MD")]colnames(dat_pat) <-c("X", "Y")glimpse(dat_pat)