top of page

Binreg in Stata: Odds Ratios, Risk Ratios, and Why Modified Poisson Is Preferred

  • Writer: Mayta
    Mayta
  • 3 hours ago
  • 3 min read

1. Introduction

Binary outcomes are common in clinical and epidemiological research. Examples include disease status (yes/no), mortality (dead/alive), or treatment response (success/failure). In Stata, several commands can be used to analyze binary outcomes, including logistic, binreg, and glm with different families and links.

Although these commands may appear similar, they estimate different effect measures, rely on different assumptions, and can behave very differently in practice. This article clarifies what binreg does, how it compares with alternative commands, and why researchers often choose other approaches instead.

2. What Is binreg?

binreg fits generalized linear models (GLMs) using the binomial family and allows the user to choose different link functions corresponding to different effect measures:

binreg option

Link function

Effect measure

or (default)

Logit

Odds Ratio (OR)

rr

Log

Risk Ratio (RR)

rd

Identity

Risk Difference (RD)

Importantly:

binreg always uses the binomial family.Only the link function changes.

3. binreg, or vs logistic

Mathematical equivalence

binreg y x1 x2, or
logistic y x1 x2

These two commands:

  • Fit the same model

  • Use the same likelihood

  • Produce identical estimates and standard errors

Therefore:

binreg, or is simply an alternative interface to logistic regression.

Practical implication

Because logistic is:

  • More widely recognized

  • More standard in reporting

  • Better supported in teaching materials

👉 Logistic is generally preferred over binreg, or.

4. binreg, rr: Log-Binomial Regression

What it tries to do

binreg y x1 x2, rr

This fits a log-binomial model:

log ( P ( Y = 1 ∣ X ) ) = Xβ

and reports:

RR = exp(β)

This is attractive because risk ratios are intuitive and collapsible, unlike odds ratios.

Why binreg and rr often fail

The problem is not conceptual but mathematical.

  • The log link implies

    μ = exp(Xβ)

  • But in a binomial model,

    0 ≤ μ ≤ 1

If the model attempts to estimate μ > 1, the binomial likelihood becomes invalid. As a result:

  • The optimization hits a boundary

  • The model may fail to converge

  • Estimates may be unstable or not produced

This problem is well known and occurs frequently when:

  • Outcomes are common

  • Effects are strong

  • Multiple covariates are included


5. Why Researchers Use Modified Poisson Instead

The alternative approach

glm y x1 x2, family(poisson) link(log) vce(robust) eform

This is known as modified Poisson regression.

Why this works

  • The log link still models the mean as:

    μ = exp(Xβ)

  • Poisson models do not restrict μ to be ≤ 1

  • Therefore, the model converges reliably

Although Poisson is traditionally used for count data:

  • With a binary outcome, the mean still equals the risk

  • The coefficient (\exp(\beta)) still estimates a risk ratio

The variance is misspecified, but this is corrected using robust (sandwich) standard errors.

The mean model is correct, and the inference is valid.

6. binreg, rd: Risk Difference Models

binreg y x1 x2, rd

This uses the identity link:

P ( Y = 1 ∣ X ) = Xβ

Advantages:

  • Estimates absolute risk differences

  • Collapsible and easy to interpret

Limitations:

  • Predicted probabilities can fall below 0 or above 1

  • Requires careful checking of predictions

  • Less stable in complex models

Thus, risk difference models are typically used only in simpler settings or when absolute effects are explicitly required.

7. Summary Comparison

Command

Family

Link

Effect

Stability

Recommended use

logistic

Binomial

Logit

OR

Very high

Default for binary outcomes

binreg, or

Binomial

Logit

OR

Very high

Equivalent to logistic

binreg, rr

Binomial

Log

RR

Low

Rarely recommended

glm, binomial log

Binomial

Log

RR

Low

Same issues as binreg, rr

Modified Poisson

Poisson

Log

RR

High

Preferred for RR

binreg, rd

Binomial

Identity

RD

Moderate–low

Niche use


8. Practical Guidance for Readers

  • If you want odds ratios → use logistic

  • If you want risk ratios:

    • Avoid binreg, rr in most real datasets

    • Use modified Poisson with robust SE

  • If you want risk differences → consider binreg, rd, but check predictions carefully

9. Key Take-Home Message

binreg is a flexible binomial regression framework, but its log-link (RR) implementation is often numerically unstable. For this reason, modified Poisson regression has become the preferred method for estimating risk ratios from binary outcomes in modern epidemiology.

Recent Posts

See All

Comments

Rated 0 out of 5 stars.
No ratings yet

Add a rating
Post: Blog2_Post

​Message for International and Thai Readers Understanding My Medical Context in Thailand

Message for International and Thai Readers Understanding My Broader Content Beyond Medicine

bottom of page