📊 Medical Statistics — From Population to Inference

An interactive guide built around clinical examples. Every concept is illustrated with data your patients will actually generate — blood pressure, HbA1c, drug trials.

For medical students & junior clinicians

The Population N = 5,000 patients

🏥 Clinical Context

You are a researcher studying systolic blood pressure (SBP) in adult hypertensive patients registered at hospitals across the country. The full population is 5,000 patients. Each dot below is one patient.

In practice, you can never measure everyone — too many patients, too little time and budget. This is why we sample. Choose a different variable to see how real clinical distributions look.

Clinical variable

The population has true, fixed values called parameters — denoted by Greek letters: μ (mu) = true mean, σ (sigma) = true standard deviation.
These are the "ground truth" we are always trying to estimate.

Population Parameters the ground truth

🏥 Clinical Context

We can see the true parameters here because we generated this simulated population. In a real study, μ is unknown — if we already knew the true mean BP of every hypertensive patient in the country, there would be nothing to study.

Parameter = a numerical summary of the entire population.
μ = true population mean | σ = true population SD | N = population size

Parameters are fixed constants — they do not change when you repeat a study. Only our estimates of them change.

Random Sampling your study cohort

🏥 Clinical Context

You design an RCT and recruit n patients from the population. Orange dots = enrolled patients. Blue dots = patients not in your study. Adjust n and see how coverage changes.

Sample size (n)50

Random sampling gives every patient an equal chance of enrolment — this prevents selection bias.
In clinical trials, this is achieved through randomisation. Without it, your sample may systematically differ from the population (e.g. only sicker patients presenting to your clinic).

Sample Statistics vs Population Parameters estimation

🏥 Clinical Context

You measure SBP in your recruited patients and compute the sample mean x̄. This is your estimate of the true population mean μ. Every published clinical trial does exactly this.

Sample size (n)50

Population — true parameters

Sample (n=50) — estimates

x̄ ≈ μ sample mean estimates true mean | s ≈ σ sample SD estimates true SD
Larger n → better estimates. This is why underpowered trials (tiny n) are unreliable — their x̄ can be far from μ by chance alone.

Sampling Error why studies disagree

🏥 Clinical Context

Two hospitals run identical studies on the same antihypertensive drug — same protocol, same population — yet report slightly different mean SBP reductions. This is not researcher error. It is sampling error: random variation between samples.

Sample size (n)30

Each line = one study's sample mean. Dashed orange = true population μ.
Standard Error (SE) = σ / √n — quantifies how much x̄ varies across studies.
Increase n: notice the dots cluster tighter around μ. Bigger trials = less sampling error.

Central Limit Theorem why we can use t-tests on non-normal data

🏥 Clinical Context

HbA1c values in a diabetic population are right-skewed — not normally distributed. Yet clinical trials still use t-tests on HbA1c. How? Because the CLT guarantees that the distribution of sample means is approximately normal, even when the raw data are not.

Sample size (n)5

Number of simulated studies1000

Original population (skewed)

Distribution of sample means x̄

CLT: as n increases, the distribution of x̄ approaches Normal(μ, σ²/n) — regardless of the original distribution's shape.
Rule of thumb: n ≥ 30 is usually sufficient. This underpins nearly every parametric test you will use in clinical research (t-test, ANOVA, regression).

Confidence Intervals what every paper reports

🏥 Clinical Context

The SPRINT trial reported that intensive BP control reduced SBP by 14.8 mmHg (95% CI: 14.3 – 15.3). You read this in every paper — but what does it actually mean?

Sample size (n)40

Confidence level

Formula: x̄ ± z × (σ/√n) where z = 1.96 for 95% CI

Correct interpretation: "If we repeated this study 100 times, ~95 of the resulting intervals would contain the true μ."
Common misconception: It does NOT mean "there is a 95% probability that μ lies in this interval." μ is fixed — it either is or isn't in the interval. The probability refers to the procedure, not this particular interval.

A narrow CI = more precise estimate = larger n or smaller σ. When you see wide CIs in a paper, the study was likely underpowered.

Hypothesis Testing & the p-value the most misunderstood concept in medicine

🏥 Clinical Context

You run an RCT: Drug A vs Placebo for SBP reduction. After 6 months, Drug A group has a lower mean SBP. But could this difference be due to chance alone? This is what hypothesis testing answers.

H₀ — Null Hypothesis

The drug has no effect. Any observed difference is due to chance alone.
"Drug A does not change SBP."

H₁ — Alternative Hypothesis

The drug has a real effect on the population.
"Drug A reduces mean SBP."

True drug effect (mmHg)5

Sample size (n per group)40

p-value = probability of observing a difference this large (or larger) if H₀ were true.

p < 0.05 → "statistically significant" — we reject H₀ (by convention).
p ≥ 0.05 → we fail to reject H₀ — insufficient evidence for an effect.

⚠️ p < 0.05 does NOT mean:
  • The drug is clinically important (a 1 mmHg reduction can be "significant" with n=10,000)
  • H₀ is true with 95% probability
  • The result will replicate

Always report effect size + CI + p-value together.

Parametric vs Non-parametric Tests when normal rules break down

🏥 Clinical Context

Many clinical variables are not normally distributed: length of hospital stay, pain scores (1–10), cytokine levels, tumour sizes. Using a t-test on these violates its assumptions. Non-parametric tests make no assumption about the distribution shape — they work on ranks instead of raw values.

Choose a distribution shape and sample size below. See how the two groups compare visually, and watch which test gives the right answer.

Data type

n per group20

True effect1.0

Group A (Control)

Group B (Treatment)

📋 Which test should you use?

Situation	Parametric test	Non-parametric alternative	When to use non-param

⚠️ Common mistake in clinical papers: Using a t-test on pain scores, Likert scales, or heavily skewed lab values without checking normality. Always plot your data first — a histogram or Q-Q plot reveals the shape.

Rule of thumb: If n ≥ 30 per group, CLT saves you even for skewed data (use t-test). If n < 30 and data are clearly non-normal → use non-parametric. Always use non-parametric for ordinal data regardless of n.

© Ana Radulović — All rights reserved.
This educational material, including all concepts, structure, content, and interactive design, is the intellectual property of Ana Radulović. Unauthorised use, reproduction, distribution, or publication of this material in any form — digital or printed — without explicit written permission from the author constitutes a violation of copyright law and may result in legal consequences under applicable national and international intellectual property legislation.