Core inference · Hypothesis tests

Z-Test Calculator

Test a mean or a proportion against a hypothesis using the normal distribution. One- and two-sample z-tests, with the z statistic, a one- or two-tailed p-value, a confidence interval, an effect size and a shaded curve — plus an honest word on when a z-test is the right tool and when you should reach for a t-test instead.

Sample mean x̄

Pop. SD σ known

Null value μ₀ the value you are testing against

Alternative hypothesis

Confidence level

Result

In plain English

A z-test asks whether an observed mean or proportion is far enough from a hypothesised value that chance alone is an unconvincing explanation. It measures the gap in standard errors — that is the z statistic — and converts it into a p-value using the bell curve. It is the large-sample, known-spread cousin of the t-test.

z statistic: How many standard errors the estimate sits from the null value: (estimate − null) ∕ standard error. Bigger magnitude means stronger evidence against the null.
p-value: The probability of a z at least this extreme if the null hypothesis were true. Small p means the data would be surprising under the null — it is not the probability the null is true.
one- vs two-tailed: Two-tailed tests for any difference; one-tailed tests only for a difference in a pre-specified direction. Choose the tail before seeing the data, never after.
confidence interval: The range of null values this data would not reject, at the chosen level. If it excludes the null, the two-tailed test is significant.
z-test vs t-test: Use a z-test when the population standard deviation is genuinely known (rare) or the sample is large, and for proportions. When σ is estimated from the sample — the usual case for means — the t-test is the honest choice.

Frequently asked

When should I use a z-test instead of a t-test?

Use a z-test when the population standard deviation σ is genuinely known, or for a test of a proportion, or when the sample is large enough that the distinction barely matters (often quoted as n ≥ 30). Use a t-test when you only have the sample's own standard deviation s to estimate σ — which is almost always the case for a mean. For large n the two give nearly identical answers, because the t-distribution converges on the normal.

What is the z critical value for a 95% test?

For a two-tailed test at the 5% level the critical values are ±1.96 (more precisely ±1.95996): a z beyond that range gives p < 0.05. For a one-tailed test at 5% the critical value is 1.645. At the 1% level they are ±2.576 (two-tailed) and 2.326 (one-tailed).

How is the z-test for proportions calculated?

For one proportion, z = (p̂ − p₀) ∕ √[p₀(1 − p₀) ∕ n], using the null value p₀ in the standard error. For two proportions, the standard error uses the pooled proportion p̄ = (x₁ + x₂) ∕ (n₁ + n₂): z = (p̂₁ − p̂₂) ∕ √[p̄(1 − p̄)(1∕n₁ + 1∕n₂)]. Both rely on a large-sample normal approximation, so they need a handful of successes and failures in each group to be trustworthy.

What is the difference between a one-tailed and a two-tailed z-test?

A two-tailed test asks whether the value differs from the null in either direction and splits the significance level between both tails; a one-tailed test looks for a difference in one pre-specified direction only, putting all of α in a single tail, which makes it easier to reach significance. Choose the tail before seeing the data — switching to a one-tailed test afterwards to nudge p under 0.05 is a form of p-hacking.

Z-Test Calculator

Result

In plain English

Frequently asked

Watch out for

t-Test →

The p-hacking sandbox

p-value & critical value →