← All posts

How to Think Causally in Clinical Research — The Counterfactual + DAG Blueprint

Clinical Epidemiology ResearchUniqcret doctor knowledgesMethodology and Research DesignEtiology [Methodology]

“Causation isn’t just about what happened—it’s about what would have happened instead.”

🎯 The Clinical Dilemma

Your patient, Mr. G, age 50, has stage IV lung cancer. He receives an herbal treatment and survives.

Question: Did the herb save him?

Or would he have survived anyway?

That’s the counterfactual question. And it’s the gold standard for causal inference in medicine.


🔄 1. Association ≠ Causation

Let’s separate noisy thinking from causal clarity.

ConceptMeaning
AssociationKnowing X helps predict Y
CausationChanging X will change Y

Example:

“Patients who take Vitamin D have better COVID outcomes.”That’s association.

“Vitamin D improves COVID outcomes.”That’s a causal claim—and it needs counterfactual proof.


🧬 2. The Counterfactual Logic

Key Concept:

A causal effect = Observed outcome minus Counterfactual outcome

But we never observe both for the same person. You only see one timeline: treated or untreated.

So, how do we estimate the missing outcome?


🧪 3. Approximating the Counterfactual: Your Toolbox

We create “what-if” worlds using:

ToolFunction
RCTsCreate two comparable worlds via randomization
MatchingPair treated patients with similar untreated ones
StratificationCompare within subgroups (e.g., ECOG = 2)
RegressionAdjust for confounders statistically
Propensity ScoresBalance groups based on treatment likelihood
DAGsShow where bias might sneak in (more below)


🔁 4. The DAG: Your Bias-Detecting Blueprint

What is a DAG?

A Directed Acyclic Graph is a visual map of your assumptions:

✍️ Example DAG: Does the Herbal Drug Help?

Age → Herbal Treatment → Survival ↘-------------------------↗

🛑 Don't Adjust for These:

RoleRuleExample
ConfounderAdjust ✅Age, ECOG
MediatorDon’t adjust if estimating total effect ❌Side effects
ColliderNever adjust ❌ER admission caused by both severity and treatment

🔍 Secret Insight: Adjusting for colliders introduces false associations.DAGs protect you from this silent sabotage.


🎲 5. Mr. G’s Real-World Counterfactual

You want to answer:

Would Mr. G have survived without the herbal treatment?

Since we can’t observe that:

If survival differs → credible evidence for causality.


🔄 6. Time Matters: Temporality

Causal claims demand:

If treatment happens after improvement, it can’t be the cause.


⚔️ 7. Confounding, Indication Bias & Comparability

Real-world mess:

How to beat this:


📘 Occurrence Equation for Causal Studies

Causal Structure Example

Causal Question Structure

\( Y = f(X \mid \text{confounders + bias + random error}) \)

Example

\( \text{Survival} = f(\text{Herbal Treatment} + \text{Age, ECOG} \mid \text{Confounders}) \)


🧠 Summary Table

ElementFunctionCaution
CounterfactualThe unseen alternativeEstimate, don’t assume
DAGBias mapBuild before analyzing
ConfounderAdjust ✅Blocks bias
MediatorAdjust ❌ if estimating the total effect 
ColliderNever adjust ❌Causes a spurious association
RCTGold standardRandomizes counterfactual

✅ Final Takeaways