Pixelmatch Tuning Recipes

When to Use

Use this as the scenario-by-scenario reference for picking a threshold value.

Scenario	`threshold`	Notes
Pixel-perfect, same OS, same browser version	`0.1` (library default)	Catches real regressions; rare false positives
Same OS, anti-aliasing variability (Canvas/WebGL)	`0.1` with `includeAA: false` (default)	AA detector handles most of it
Cross-OS (mac vs Linux CI)	`0.2`–`0.3`	Combine with `maxDiffPixelRatio: 0.01` to absorb noise
Different font hinting / sub-pixel rendering across OSes	`0.3+`, or switch to looks-same / dssim	Pixelmatch's YIQ saturates here; perceptual SSIM is more stable
Animations, gradients, video frames	Use `maxDiffPixels` / `maxDiffPixelRatio` rather than raising threshold	Keeps regression sensitivity high while tolerating bounded noise

Start with Playwright's defaults (threshold: 0.2, no maxDiffPixels). When tests are stable, tighten:

Lower threshold to 0.15, then 0.1
Add a small maxDiffPixelRatio: 0.005 ceiling
For sub-pixel-precise components, override per-test with threshold: 0.05 and a comment

If tightening produces flakes, the env or stability controls (animations, fonts) are the real problem. Don't bandaid with looser thresholds.

Wrong: Setting threshold: 0.5+ as the default — effectively not testing
Wrong: Per-test thresholds without a // why comment — six months later the comment is the only justification
Wrong: Tightening on day one — start lax, stabilize, then tighten