Question 1

What is a chi-squared test used for in A/B testing?

Accepted Answer

A chi-squared test compares the conversion rates (or any success/failure proportions) between two groups to determine if the difference is statistically significant. In A/B testing, it tells you whether your variant's conversion rate is genuinely different from the control, or if the observed difference could be due to random chance.

Question 2

When should I use a chi-squared test vs. a t-test?

Accepted Answer

Use a chi-squared test when your metric is a proportion or rate — things like conversion rate, click-through rate, signup rate, or any yes/no outcome. Use a t-test when your metric is a continuous number — things like revenue per user, time on page, average order value, or session duration.

Question 3

What does the p-value mean in a chi-squared test?

Accepted Answer

The p-value is the probability of observing a difference as large as (or larger than) the one in your data, assuming there is no real difference between the groups. A small p-value (typically below 0.05) suggests the difference is unlikely to be due to chance alone, and you can reject the null hypothesis that the groups are the same.

Question 4

What is Cohen's h and why does it matter?

Accepted Answer

Cohen's h is a measure of effect size for comparing two proportions. While the p-value tells you whether a difference is statistically significant, Cohen's h tells you whether it is practically meaningful. A small h (around 0.2) indicates a small effect, 0.5 is medium, and 0.8 or above is large. A statistically significant result with a tiny effect size may not be worth acting on.

Question 5

What are the assumptions of the chi-squared test?

Accepted Answer

The chi-squared test assumes that observations are independent (each user is counted only once), that you are comparing categorical outcomes (success/failure), and that the expected count in each cell of the contingency table is at least 5. When expected counts are too low, the chi-squared approximation becomes unreliable and Fisher's exact test should be used instead.

Question 6

What is a Wilson score confidence interval?

Accepted Answer

The Wilson score interval is a method for calculating confidence intervals for proportions that performs better than the basic normal approximation, especially with small sample sizes or proportions near 0% or 100%. This calculator uses Wilson score intervals because they provide more accurate coverage than the simpler Wald interval.

	Success	Failure
Group 1	a	b
Group 2	c	d

How the Chi-Squared Test Works for A/B Testing

The Chi-Squared Formula

Worked Example

Understanding the Results

P-Value

Confidence Intervals

Effect Size (Cohen's h)

When the Chi-Squared Test Breaks Down

Frequently Asked Questions