Independent measurement to justify the spend to your auditors.
You are buying detection technology with real budget attached. We give you a measurement layer that lives between you and the vendor, so the procurement decision and the renewal decision can both be defended in writing.
The decisions an independent number has to stand behind.
Detection spend is decided, renewed, and audited in writing. At each of these moments, a vendor's self-report is not enough. Here is where we plug in.
Test the short list before you sign.
Every vendor shows you their own number. We run the detection on your short list against the same attacks and conditions, so the decision rests on one comparison you did not have to take on faith. You sign knowing which one actually holds.
The lab number vs your traffic
0.95+
the number in the brochure
The accuracy a vendor quotes from a clean lab test.
in your real traffic
What that number tells you about live performance, until it is re-tested on the footage you actually receive: nothing.
A clean lab score says nothing about live performance until it is re-tested on the footage your environment actually receives.
Read the benchmarkEverything procurement asks for is in the box.
Per-group performance
Results broken out by demographic and platform, with the worst case shown.
Both kinds of mistake
Fakes let through and real users wrongly blocked, with the margin of error on each.
Bypass recipes
Every failure annotated with the recipe that surfaced it.
Methodology, documented
Public, versioned, and signed by the lead researcher.
Platform-realistic conditions
Scored under the re-encoding your deployment actually applies.
Audit-ready exhibits
Findings packaged to hand to a board or a regulator.
From a free benchmark to a standing red team.
Most teams start with the public benchmark, then commission an evaluation of the detection they actually run. The white-glove tier is for ongoing, tailored coverage.
Public benchmark
Free
See how the market's detectors hold up.
- Our published results on leading open-source detectors.
- Per-group breakdowns and platform-condition drops.
- A starting point for narrowing a shortlist.
Evaluation
Per engagement
An independent grade for one detector.
- Your model run through our full benchmark and attack suite.
- Pass, conditional, or fail across groups and conditions.
- The recipe that broke it, where it fails, for fixing.
White-glove red team
Custom
Ongoing, tailored adversarial coverage.
- A sequestered attack set built for your exact threat.
- Co-delivered alongside your red-team or security partner.
- Repeat engagements as new generators emerge.
Margen is the measurement layer between detection vendors and the buyers who depend on them.
An independent assessment layer.
Third-party, with no detection product of our own to sell.
A reproducible methodology.
Every claim is backed by a dataset, a pipeline, and a margin of error.
Adversarially honest.
We test under the hardest conditions buyers face in production.
Why not an internal team or a generalist pentest?
Both have their place. Neither is an independent adversary with a deepfake-specific corpus and per-group reporting. That gap is what we fill.
| Capability | Margen | Internal red team | Generalist pentest |
|---|---|---|---|
| Independent of the vendor under test | |||
| Deepfake-specific attack corpus | |||
| Per-group fairness breakdown | |||
| Platform-realistic conditions | |||
| Pre-registered, reproducible method | |||
| Hands back the breaking recipe |
Built for the audit committee.
Every finding ships as a dated, signed exhibit that maps to your own controls, so the procurement decision and the renewal both hold up in writing.
One measurement layer, every side of it.
Whichever side you are on, the same arms race runs underneath. See how we serve the rest of the market, or go straight to scoping your own evaluation.
Get the independent number your auditors will accept.
Tell us what you are buying or renewing. We will evaluate it against your conditions before you sign.