◇ trust & security

how we handle your data.

a short, plain-language reference for security teams and language-tech leads doing procurement diligence. user-facing walkthrough is at /how-it-works; full legal text is at /privacy and /terms.

━━ how the cleaner runs

streams your file rather than loading it into memory all at once. comfortably handles multi-gigabyte translation memories on modest hardware.

XML parsing is XXE-safe at every entry point. external entity resolution is structurally disabled — there is no fallback path that re-enables it.

deterministic output — a clean run on the same input with the same options produces a byte-identical result. removed entries are preserved alongside kept ones in a separate audit file so you can verify nothing important was cut.

script-aware comparisons — length-outlier and short-phrase rules use NFC code-point counts so combining-mark scripts (Devanagari, Thai, Khmer) and no-space scripts (CJK, Tibetan) are treated fairly. a legitimate Hindi or Japanese translation is not penalised for looking longer or shorter than its English source.

━━ what the cleaner catches

eight classifiers, individually toggleable in the rules panel and bundled by the three named presets:

━━ engine hardening

we run repeated independent code reviews — each round audits the engine end-to-end, fixes findings, then re-audits with a fresh reviewer who has no memory of prior passes. the most recent campaign closed 9 HIGH + 23 MED issues across five rounds before reaching convergence (zero new findings on the final pass).

convergence is a quality bar: if a fresh reviewer with no context finds nothing new, the surface is stable. we re-run the campaign whenever significant engine logic ships.

━━ security posture

━━ where your data lives

EU jurisdiction. compute and object storage are in EU datacenters; transactional email is sent through an EU-region provider. no data is replicated outside the EU.

no third-party analytics or trackers. no Google Analytics, no Hotjar, no advertising pixels. the only cookie this site sets is a signed session id (HttpOnly, Secure, SameSite=Lax).

full subprocessor list (compute provider, object storage, email, payments) and per-field retention policy is enumerated in our privacy policy.

━━ asking us more

doing a security or procurement review and need details that aren't on this page (DPA, audit report, sub-processor change notifications, custom retention windows)? reach out via /contact — we typically respond within one business day.