Cohen's Kappa Calculator

Connect via MCP →

Enter Calculation

Formula

Results

Cohen Kappa (κ)

0.4

inter-rater agreement

Observed agreement (p_o)	70%
Expected agreement (p_e)	50%
Total observations (n)	50

What is Cohen's Kappa?

Cohen's Kappa ($\kappa$) is a statistic that measures the agreement between two raters who each classify items into mutually exclusive categories. Unlike a simple percentage of matches, kappa corrects for the agreement that would be expected purely by chance, making it a more honest measure of reliability. This calculator handles the common case of two raters and two categories (a 2x2 table).

Two raters independently classifying items into categories with agreements and disagreements highlighted — Cohen's kappa measures agreement between two independent raters beyond chance.

How to use this calculator

Enter the four cell counts of your 2x2 contingency table: how many items both raters called "Yes" (a), how many Rater 1 called Yes but Rater 2 called No (b), the reverse (c), and how many both called "No" (d). The calculator returns kappa along with the observed agreement and the chance-expected agreement.

The formula explained

Observed agreement is $p_o = (a + d) / n$, the proportion of items the raters agreed on. Expected agreement $p_e$ is built from the marginal totals: the chance both say Yes plus the chance both say No. Kappa is then $$\kappa = \frac{p_o - p_e}{1 - p_e}.$$ A value of 1 means perfect agreement, 0 means agreement equal to chance, and negative values mean worse than chance.

Two by two contingency table showing agreement cells and the formula components po and pe — A 2x2 contingency table: diagonal cells are agreements used to compute observed agreement.

Worked example

Suppose $a = 20$, $b = 5$, $c = 10$, $d = 15$, so $n = 50$. Observed agreement $$p_o = \frac{20 + 15}{50} = 0.70.$$ The marginals give $$p_e = \frac{25}{50}\cdot\frac{30}{50} + \frac{25}{50}\cdot\frac{20}{50} = 0.30 + 0.20 = 0.50.$$ Therefore $$\kappa = \frac{0.70 - 0.50}{1 - 0.50} = \frac{0.20}{0.50} = 0.40,$$ indicating fair agreement.

FAQ

How do I interpret the value? A common guide (Landis & Koch): <0 poor, 0–0.20 slight, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.80 substantial, 0.81–1.00 almost perfect.

Why is my kappa low despite high agreement? When one category dominates, chance agreement ($p_e$) is high, so kappa can be low even with 90%+ raw agreement — the kappa paradox.

Can kappa be negative? Yes. Negative kappa means observed agreement is below what chance predicts, suggesting systematic disagreement.

Last updated: June 18, 2026

Most popular in Math and Statistics

View all Math and Statistics calculators →

Related calculators

Cohen's d Effect Size Calculator

Calculate Cohen's d effect size from two group means, standard deviations and sample sizes, with pooled SD and a small/medium/large interpretation.
Cohen's d Effect Size Calculator

Calculate Cohen's d effect size from two group means, standard deviations, and sample sizes. Get the pooled SD and small/medium/large classification.
Cohen's D Calculator

Calculate Cohen's d effect size from two group means, standard deviations, and sample sizes. Uses pooled SD with magnitude interpretation.

Discover

Sample Size from Margin of Error Calculator

Calculate the survey sample size needed for a target margin of error and confidence level using n = z² p(1−p)/E², with optional finite population correction.
Shannon Entropy Calculator

Compute Shannon entropy H = -Σ pᵢ·log₂(pᵢ) from a probability distribution. Supports bits, nats and dits, plus max entropy and efficiency.
Law of Total Probability Calculator

Compute P(A) from conditional and partition probabilities using the law of total probability: P(A) = Σ P(A|Bᵢ)·P(Bᵢ). Supports up to 3 events.
Confidence Interval for Difference of Two Proportions

Compute a confidence interval for the difference between two sample proportions (p̂₁−p̂₂) at 90%, 95%, or 99% using the normal approximation.
Confidence Interval for Difference of Two Means Calculator

Compute a confidence interval for the difference between two independent sample means using the Welch t-method. Enter means, SDs, sizes and confidence level.