X DLC-M04-001 · Sampling Calculator.xlsx — Excel
FileHomeInsertPage LayoutFormulasDataReviewView
B4 ƒx =CEILING(NORMSINV(1-(1-C)/2)^2*p*(1-p)/E^2 * N/(N+NORMSINV(1-(1-C)/2)^2*p*(1-p)/E^2-1), 1)
DLC-M04-001 · Live Calculator
Sampling Calculator
Purpose: Control set sizing · Elusion sampling
You enter Calculated live Formula: two-sided normal-approximation with finite-population correction.
ABCD
1Inputs
2ParameterValueUnitNotes
3Corpus size (N)docsTotal documents in the population you're sampling from.
4Expected prevalence / richness (p)decimalBest guess at the responsive rate. Use 0.5 for maximum conservatism.
5Confidence level (C)decimalTypically 0.95 (courts have accepted 0.90 in some matters).
6Margin of error (E)decimalHalf-width of confidence interval. 0.03 = ±3%.
7Outputs
8ResultValueUnitInterpretation
9Required sample size (n)docsRandomly draw exactly this many documents for your sample.
10Expected responsive in sampledocsn × p. Your reviewers should see roughly this many responsive.
11Sample as % of corpus%Rule of thumb — anything under 5% is generally proportionate.
12z-score (from C)1.96 at 95%, 1.645 at 90%, 2.576 at 99%.
13Estimated review cost (@ $2.50 / doc)USDAdjust in-house rate; contract-review typical range $1.50–$5/doc.
14Reference · Common Sample Sizes at 95% Confidence
15Corpus size±5% MoE, p=0.5±3% MoE, p=0.5±3% MoE, p=0.1
1610,000370964376
1750,0003811,045381
18100,0003831,056382
19500,0003841,065384
201,000,0003841,066384
2110,000,0003841,067384
22Notes on Use
23For control sets in TAR 1: use richness (p) close to expected responsive rate on the seed set. Lower p = smaller required n (up to a point) but higher variance on precision estimates.
24For elusion sampling in TAR 2: p is your expected elusion rate (usually low — 0.01 to 0.05). E is how tight you need the ceiling on missed responsives.
25For validation of a GenAI pass: sample the excluded set, not the included set. p = expected false-negative rate.
26Court-tested defaults: C=0.95, E=0.03 for control sets; C=0.95, E=0.02 for high-stakes elusion sampling.
Statistical caveats. The formula assumes simple random sampling from a finite population. If you're stratifying (e.g. sampling separately by custodian), size each stratum independently. The normal approximation degrades when np < 5 or n(1−p) < 5 — for extreme prevalence use exact binomial methods.
Calculator Reference Stratified Change Log Ready · 100%
W Sampling Calculator — Printed Reference Sheet
DLC-M04-001 · Printed Reference
Sampling — Quick Reference Card
Version

Why we sample

Sampling proves — to a stated statistical confidence — that a decision made on a large corpus (which items to review, which to exclude, which to produce) is not producing systematically wrong outcomes. It replaces the alternative of reviewing everything, which is often infeasible and always expensive.

The math, in one paragraph

To estimate the responsive rate of a corpus within ±E at confidence C, draw n = (z² × p × (1−p)) / E², adjusted for finite population when n is a meaningful fraction of N. z is the standard-normal quantile at C (1.96 at 95%). p is your prior estimate of responsive rate; use 0.5 when you have no prior. The formula is symmetric — it works for any binary classification: responsive/not, privileged/not, correctly-coded/not.

Three sampling protocols we recommend

ProtocolWhenNumbers
TAR 1 Control SetPredictive coding, static trainingC=0.95, E=0.03, p=your best richness estimate. Typical n = 400–1,100.
TAR 2 Elusion (round n)Continuous Active LearningC=0.95, E=0.02, p=0.01–0.05. Typical n = 500–2,400.
Production QCBefore shipping a productionC=0.95, E=0.05, p=0.5 on responsive-tag accuracy. Typical n = 385.
Privilege QCBefore shipping a priv logC=0.99, E=0.02, p=your priv rate. Typical n = 1,000–4,000.

What to tell the court

"We sampled [n] documents randomly drawn from a population of [N]. At [C]% confidence, the observed [responsive / elusion / privilege] rate of [x]% is within ±[E]% of the true population rate. The sample was reviewed by [role] on [dates] under the same protocol as the main population."

One rule Sample once, review once, report the results — good or bad. Do not re-sample until the number looks better. That's not sampling; that's guessing.