← All posts

Handling Missing Repeated Outcome Measurements in Clinical Research: Models, Myths, and Best Practices [Multiple imputation, MI]

Clinical Epidemiology ResearchUniqcret doctor knowledgesData Analytics or Statistics

Introduction

Repeated outcome measurements are common in longitudinal clinical studies—tracking symptoms, biomarkers, or functional scores across time. However, incomplete follow-up is equally common, creating analytical challenges that can distort treatment effects, underestimate variability, or introduce bias.

Handling these missing repeated measures requires more than plugging gaps—it calls for model-based strategies that respect time structure, subject-level correlations, and assumptions about missingness. This article explores and contrasts key approaches: complete case analysis, Last Observation Carried Forward (LOCF), linear mixed-effects modeling, and Multiple Imputation (MI), highlighting best practices and trade-offs.


Understanding Repeated Measures Data Structure

Wide vs Long Format

Long format is more suitable for modeling techniques that account for time trends and intra-individual correlation.

📌 Clinical Tip: Always reshape into long format before applying multilevel models or MI packages designed for longitudinal data.


Method 1: Complete Records Analysis

How It Works

Limitations

Example: In a 12-month follow-up study with 4 assessments, only 30% of patients may have complete data. Analyzing only these individuals may inflate treatment effects if the healthiest patients are overrepresented.


Method 2: Last Observation Carried Forward (LOCF)

How It Works

Pitfalls

⚠️ LOCF may be tempting due to simplicity, but it is neither conservative nor liberal—just misleading under most conditions.


Method 3: Linear Mixed-Effects Models (LMMs)

Why It Works

LMMs account for:

Statistical Mechanics

Key Feature: LMMs use all available data—patients with partial follow-up contribute information without being dropped.

🔍 Secret Insight: A landmark study showed that using MI in addition to LMMs yielded no extra benefit for outcome estimation if the model was already properly specified.


Method 4: Multiple Imputation for Repeated Measures

Expanding the Toolkit

Multiple Imputation (MI) can also be applied to longitudinal data, but requires extra caution:

Advanced Package: mimix in Stata

Scenarios Supported

  1. Missing at Random (MAR) – Default; assumes dropout unrelated to unmeasured values after conditioning on observed data.
  2. Jump to Reference (J2R) – Imputes missing values based on control arm trajectory post-dropout.
  3. Copy Reference (CR) – Assumes treatment effect ceases and the subject follows the reference group thereafter.
  4. Copy Increments in Reference (CIR) – Adds the average increment in the control group to the last observed value.
  5. Last Mean Carried Forward (LMCF) – Assumes flat trajectory at the last group-level mean.

📊 Clinical Application: In an asthma trial, MAR-based MI projected optimistic FEV1 recovery in the treatment arm, while J2R and CIR suggested a more conservative trajectory—critical when interpreting treatment durability.


Comparing the Approaches: Strengths & Caveats

MethodUses All DataRespects TimeHandles CorrelationAssumption RobustnessIdeal For
Complete RecordsMCAR onlyPreliminary checks
LOCFImplausibleRare cases (e.g., plateau outcomes)
LMMMARPrimary analysis
MI (mimix, etc.)✅ (if configured)MAR + sensitivitySensitivity checks, policy impact

Conclusion

Handling missing repeated outcome measurements isn’t just a technical necessity—it’s a design integrity safeguard. Choosing the right approach depends on:

Linear mixed-effects models should be your primary go-to for analysis. MI methods—especially mimix—are powerful for sensitivity scenarios, allowing you to probe how results might shift under different assumptions about dropout behavior.


Key Takeaways

Comments

No comments yet. Be the first to share your thoughts.

Sign in to comment