Syllabus

Overview

This course provides a survey of modern statistical methodology for the analysis of censored time-to-event data arising from clinical, epidemiological, sociological, and engineering studies. We use intuitive explanations of statistical theory, such as counting-process martingale, to deepen understanding of real-world problems and train problem-solving skills. To do so, we combine methodological exposition with extensive case studies, mostly drawn from health sciences research (sample R/SAS code for case studies will be provided). The overall emphasis of this course is on the application side of study design and data analysis.

Course structure

The course is divided into three parts. The first part focuses on methods for univariate event times, e.g., Kaplan—Meier curve, log-rank test, and Cox proportional hazards model. Building on this foundation, the second part expands the scope to complex outcomes such as recurrent or multivariate events, (semi-)competing risks, joint analysis of survival and longitudinal data, multistate data, composite endpoints, and so forth. Special topics for censored data from cutting-edge research areas such as causal inference and machine learning are discussed in the third part.

Learning outcomes

After taking the course, students will be able to:

  • Understand the features of censored data and their implications in statistical inference

  • Choose proper non- and semi-parametric methods for analysis of various types of data

  • Understand and check the assumptions needed for estimation and inference

  • Implement the inference procedures to solve real-world problems using statistical packages such as R (or SAS)

  • Interpret and present the analytic results clearly and coherently to answer substantive questions

Prerequisites

Students are expected to have basic knowledge in statistical concepts such as random variables, expectation, variance, and maximum likelihood estimation, and to have taken first courses in statistical hypothesis testing (e.g., t-test, ANOVA, etc.) and (generalized) linear regression models. Prior experience with R or SAS is helpful but not required.

Time and Location

MW 2:30—3:45pm; 1220 HSLC (Health Sciences Learning Center)

Instructors

Main Instructor

Lu Mao, PhD

WARF 207A, 610 Walnut St, Madison, WI 53726

Email: lmao@biostat.wisc.edu

Phone: 608-263-5674

Office Time and Location: T&Th 3–4pm, or by appointment.

Zoom link provided in syllabus on Canvas.

Teaching Assistant

Po-Kuei Chen (pchen299@wisc.edu); Office hours:

  • Monday 4:30–5:30pm (Medical Science Center (MSC), room B248)

  • Wednesday 4:30–5:30pm (Zoom link provided in syllabus on Canvas)

  • By appointment.

Readings

  • [Required] Applied Survival Analysis: From Univariate to Complex Time-to-Event Outcomes (To be posted periodically by chapter on Canvas)

  • [For methodological reader] Kalbfleisch, J. D. & Prentice, R. L. (2002). The statistical analysis of failure time data (2nd Ed). John Wiley & Sons.

  • [For applied reader] Klein, J. P. & Moeschberger, M. L. (2003). Survival analysis: techniques for censored and truncated data (2nd Ed). Springer Science & Business Media.

  • [More theoretical] Fleming, T. R. & Harrington, D. P. (1991). Counting processes and survival analysis. John Wiley & Sons.

Course Schedule

Kickoff

1/24 Lecture Overview
Reading Syllabus

Part I: Univariate Events

1/29 Lecture Introduction
Reading Chapter 1
1/31 Lecture Mathematical Foundations
Reading Chapter 2
2/5 Lecture Nonparametric Estimation of the Survival Curve
Reading Chapter 3
2/7 Lecture Comparing Survival Rates between Groups
Reading Chapter 3
2/12 Lecture The Cox Proportional Hazards Model – Assumptions and Inference
Reading Chapter 4
2/14 Lecture The Cox Proportional Hazards Model – Residual Analysis
Reading Chapter 4
2/19 Lecture The Cox Proportional Hazards Model – Time-Varying Covariates
Reading Chapter 4
2/21 Lecture Other Non- and Semi-parametric Methods
Reading Chapter 5
2/26 Lecture Study Design and Sample Size Calculation
Reading Chapter 6
2/28 Lecture Left Truncation
Reading Chapter 7
3/4 Lecture Interval Censoring
Reading Chapter 7

Part II: Complex Outcomes

3/6 Lecture Multivariate Events – Conditional (Frailty) Models
Reading Chapter 8
3/11 Lecture Multivariate Event Times – Marginal Models
Reading Chapter 8
3/13 Lecture Recurrent Event
Reading Chapter 9
3/18 Lecture Competing and Semi-competing Risks
Reading Chapter 10
3/20 Lecture Joint Analysis of Longitudinal and Survival Data
Reading Chapter 11
4/1 Lecture Multistate Models – Introduction
Reading Chapter 12
4/3 Lecture* Multistate Models – Cox-Type Markov and Semi-Markov Models
Reading Chapter 12
4/8 Lecture Composite Endpoints – Nonparametric Estimation
Reading Chapter 13
4/10 Lecture Composite Endpoints – Semiparametric Regression
Reading Chapter 13

Part III: Special Topics

4/15 Lecture Causal Inference with Censored Data – IPTW and Standardization
Reading Chapter 14
4/17 Lecture Causal Inference with Censored Data – Marginal Structural Models
Reading Chapter 14
4/22 Lecture Machine Learning with Censored Data – Variable Selection
Reading Chapter 15
4/24 Lecture Machine Learning with Censored Data – Nonlinear Regression
Reading Chapter 15
4/29 Lecture Guest Lecture (Dave DeMets or other)
5/1 Lecture Recap

* Possibly online due to the instructor attending an FDA conference (https://www.advamed.org/events/fda-advamed-mdsi-conference).

Homework and Exams

One homework every other week; one mid-term, and one data analysis final project.

Grading

15% attendance; 35% homework; 20% mid-term; 30% final project.