Cox partial likelihood re-derived as normal equation of GLM

Bypassing risk-set construction and event conditioning arguments

Published

February 16, 2025

The Cox proportional hazards model is a popular semiparametric regression model in survival analysis. Its partial likelihood offers a clever way to filter out the nonparametric baseline function while keeping focus on the regression coefficients (log-hazard ratios).

The partial likelihood was originally derived through careful construction of risk sets and conditioning arguments specific to the survival context. However, with a bit of handwaving, I’ll show that it can be re-derived, mechanically, as the “normal equation” of a generalized linear model (GLM).

Normal equations of GLMs

Consider a GLM for response Y against covariates Z: (1)E(YZ)=μ(β0+βTZ) through some mean function μ(). The normal equation is defined as (2)i=1n(1Zi){Yiμ(β0+βTZi)}=0. This is also the score equation for a GLM with a canonical link function. Solving (2) gives estimates of β0 and β.

  • Linear regression: μ(x)=x, with (2) leading to standard least squares.
  • Logistic regression: μ(x)=exp(x)/{1+exp(x)}, with (2) corresponding to score equation for MLE.

Cox model as a GLM

Model specification

Let T be the survival time of interest. The Cox model specifies (3)λ(tZ)=λ0(t)exp(βTZ), where λ(tZ) is the conditional hazard of T given Z. However, this seems a bit far from the mean model formulation of (1).

Reformulation as a mean model

Let’s find a mean model implied by (3). With censoring time C, we observe X=min(T,C) and δ=I(TC). Write N(t)=I(Tt,δ=1).

Consider dN(t)=N(t)N(t) as a binary response indicating an event observed at t. Then (3) implies E{dN(t)Xt,Z}=I(Xt)exp(βTZ)λ0(t)dt=I(Xt)exp{β0(t)+βTZ}. where exp{β0(t)}=λ0(t)dt.

Now, we have a formulation similar to (1) with μ(x)=I(Xt)exp(x), though the intercept β0(t) is time-varying.

Normal equation with time-varying intercept

Therefore, (2) becomes (4)i=1n(1Zi)[dNi(t)I(Xit)exp{β0(t)+βTZi}]=0.

Use the first-line equation in (4) to solve for the “intercept” exp{β^0(t)}=i=1ndNi(t)i=1nI(Xit)exp(βTZi). Plugging it back to the left-hand side of (4):

i=1nZidNi(t)exp{β^0(t)}i=1nZiI(Xit)exp(βTZi)=i=1nZidNi(t)i=1ndNi(t)i=1nI(Xit)exp(βTZi)i=1nZiI(Xit)exp(βTZi)=i=1n{Zij=1nI(Xjt)exp(βTZj)Zjj=1nI(Xjt)exp(βTZj)}dNi(t) Integrating this over t yields precisely the partial-likelihood score function.

Conclusion

The Cox model can be reframed as a GLM for a binary event-indicator against an exponentially linked linear predictor with a time-dependent intercept. In this view, the partial likelihood score function aligns exactly with the normal equation of the GLM. This connection offers a new perspective on the Cox model and its estimation.

Footnotes

  1. Whitehead (1980) presented a similar derivation (Whitehead, J. “Fitting Cox’s Regression Model to Survival Data Using GLIM.” Journal of the Royal Statistical Society Series C: Applied Statistics 29 (3): 268. https://doi.org/10.2307/2346901.)↩︎