Sample Size Basics

Why "n matters": stability, precision, and CLT intuition

Back

Why sample size is important

The "n ≥ 30" idea is a rule of thumb, not a law — skewed/heavy-tail data may need more.

Real Dental Scenario

Mouthwash Gingivitis Study

The question: A researcher wants to study whether a new chlorhexidine mouthwash reduces gingivitis scores (Gingival Index, scale 0–3) compared to a placebo. The true population mean reduction is 0.4 points. How does sample size affect the reliability of the estimate?

n = 10

Estimate jumps wildly between studies. One trial might show 0.1, another 0.8.

UNSTABLE
n = 50

Estimate settles down. Most studies find values between 0.25 and 0.55.

STABILIZING
n = 200

Estimate is precise. Consistently near 0.4, with a tight confidence interval.

PRECISE

Animated Sampling Demo

n=10 running mean
--
CI: --
n=50 running mean
--
CI: --
n=200 running mean
--
CI: --

Each dot is a simulated study. The shaded bands show the 95% confidence interval. Notice how it narrows as n grows.

Mini CLT demo (sample means)

Pick a population shape and sample size n. We repeatedly sample and plot the distribution of sample means.

130100
Draws: 0
Mean of means: -
SD of means: -

Distribution of sample means

As n increases, the sample-mean distribution becomes tighter and more bell-shaped.

Dental context

For outcomes like DMFT or plaque scores, skewness is common. If you plan parametric tests, consider transformations or nonparametric alternatives, and plan n accordingly.