Handling Missing Repeated Outcome Measurements in Clinical Research: Models, Myths, and Best Practices [Multiple imputation, MI]
- Mayta
- May 31
- 4 min read
Introduction
Repeated outcome measurements are common in longitudinal clinical studies—tracking symptoms, biomarkers, or functional scores across time. However, incomplete follow-up is equally common, creating analytical challenges that can distort treatment effects, underestimate variability, or introduce bias.
Handling these missing repeated measures requires more than plugging gaps—it calls for model-based strategies that respect time structure, subject-level correlations, and assumptions about missingness. This article explores and contrasts key approaches: complete case analysis, Last Observation Carried Forward (LOCF), linear mixed-effects modeling, and Multiple Imputation (MI), highlighting best practices and trade-offs.
Understanding Repeated Measures Data Structure
Wide vs Long Format
Wide format stores each repeated measure as a separate column (e.g., QoL0, QoL12, QoL24).
Long format stacks timepoints vertically with identifiers for subject and time (e.g., id, visit, QoL).
Long format is more suitable for modeling techniques that account for time trends and intra-individual correlation.
📌 Clinical Tip: Always reshape into long format before applying multilevel models or MI packages designed for longitudinal data.
Method 1: Complete Records Analysis
How It Works
Retains only individuals with no missing values at any measurement time point.
Simplifies analysis but leads to biased estimates when data are not Missing Completely at Random (MCAR).
Limitations
Information loss: High attrition of sample size, especially in longer follow-up studies.
Selection bias: Patients who complete all follow-ups may differ systematically (e.g., healthier, more adherent).
Example: In a 12-month follow-up study with 4 assessments, only 30% of patients may have complete data. Analyzing only these individuals may inflate treatment effects if the healthiest patients are overrepresented.
Method 2: Last Observation Carried Forward (LOCF)
How It Works
Fills in missing values by repeating the last available observation.
Assumes stability of the measured outcome over time beyond the last known value.
Pitfalls
Unrealistic assumption: Clinical trajectories (e.g., recovery, disease progression) rarely remain static.
Variance distortion: Artificially reduces variability, leading to overconfident confidence intervals.
Bias risk: Can either under- or overestimate effects depending on outcome trends.
⚠️ LOCF may be tempting due to simplicity, but it is neither conservative nor liberal—just misleading under most conditions.
Method 3: Linear Mixed-Effects Models (LMMs)
Why It Works
LMMs account for:
Intra-individual correlation (e.g., repeated scores within each patient),
Unequal time intervals,
Missingness under MAR assumptions—valid if missingness depends only on observed data.
Statistical Mechanics
Estimate fixed effects (population-level trends) and random effects (subject-level deviations).
Fit via Maximum Likelihood (ML) or Restricted Maximum Likelihood (REML).
Key Feature: LMMs use all available data—patients with partial follow-up contribute information without being dropped.
🔍 Secret Insight: A landmark study showed that using MI in addition to LMMs yielded no extra benefit for outcome estimation if the model was already properly specified.
Method 4: Multiple Imputation for Repeated Measures
Expanding the Toolkit
Multiple Imputation (MI) can also be applied to longitudinal data, but requires extra caution:
Format: Long format is preferred.
Structure: Time dependency and within-subject clustering must be addressed.
Challenge: MI under independent assumptions ignores the within-subject correlation.
Advanced Package: mimix in Stata
Based on mi impute mvn, the mimix package imputes time-series structured missing outcomes in long format.
Allows imputation across treatment arms, enabling sensitivity analysis under alternative missingness assumptions.
Scenarios Supported
Missing at Random (MAR) – Default; assumes dropout unrelated to unmeasured values after conditioning on observed data.
Jump to Reference (J2R) – Imputes missing values based on control arm trajectory post-dropout.
Copy Reference (CR) – Assumes treatment effect ceases and the subject follows the reference group thereafter.
Copy Increments in Reference (CIR) – Adds the average increment in the control group to the last observed value.
Last Mean Carried Forward (LMCF) – Assumes flat trajectory at the last group-level mean.
📊 Clinical Application: In an asthma trial, MAR-based MI projected optimistic FEV1 recovery in the treatment arm, while J2R and CIR suggested a more conservative trajectory—critical when interpreting treatment durability.
Comparing the Approaches: Strengths & Caveats
Method | Uses All Data | Respects Time | Handles Correlation | Assumption Robustness | Ideal For |
Complete Records | ❌ | ❌ | ❌ | MCAR only | Preliminary checks |
LOCF | ❌ | ❌ | ❌ | Implausible | Rare cases (e.g., plateau outcomes) |
LMM | ✅ | ✅ | ✅ | MAR | Primary analysis |
MI (mimix, etc.) | ✅ | ✅ | ✅ (if configured) | MAR + sensitivity | Sensitivity checks, policy impact |
Conclusion
Handling missing repeated outcome measurements isn’t just a technical necessity—it’s a design integrity safeguard. Choosing the right approach depends on:
Your assumptions about missingness (MCAR vs MAR vs MNAR),
The complexity of your longitudinal structure, and
The inferential weight of your missing data.
Linear mixed-effects models should be your primary go-to for analysis. MI methods—especially mimix—are powerful for sensitivity scenarios, allowing you to probe how results might shift under different assumptions about dropout behavior.
Key Takeaways
Complete case and LOCF are inadequate in most longitudinal clinical settings.
LMMs offer robust handling of partial data under MAR.
Multiple Imputation, when used correctly, can strengthen sensitivity analyses and clinical interpretations.
mimix enables scenario modeling (e.g., Jump to Reference) that reflects patient behavior post-protocol deviation.
コメント