Precise PII Detection at Scale

Self-hosted, air-gapped PII detection with enterprise-grade security.

60+
Entity Types
15+
Supported Countries
1M+
Token Context
Python Input
import requests

# Detect sensitive data instances
response = requests.post(
    "http://localhost:8000/text/detect",
    json={
        "text": ["Hallo Matthias"],
    }
)
Output JSON
response.json()
{
  "entities": [
    [
      {
        "entity_type": "NAME",
        "start": 6,
        "end": 14,
        "score": 0.995
      }
    ]
  ],
  "stats": { "total_tokens": 4, "tps": 5420 }
}

For the full range of options, including how to configure entity types via YAML, please visit the documentation

Designed for the Agentic AI Era

PII Eraser natively understands OpenAI-format chat conversations, delivering context-aware detection for LLM guardrail and agentic workflows.

Existing PII Guardrails

Fast
Low

Scans messages individually. Fast, but misses PII requiring context.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

82910453

Process All Messages

Slow
High

Scans entire history every turn. High accuracy, but scales poorly.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

<BANK_ACCT>

Smart Context

Fast
High

Automatically includes relevant context. The optimal balance.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

<BANK_ACCT>
OpenAI Chat Support

Natively process OpenAI-format chats with context pooled between messages for improved accuracy.

Selective Role Processing

Choose to anonymize only user messages while keeping system prompts and assistant responses intact.

Incremental Processing

Optimize latency-sensitive apps by skipping previously processed messages while maintaining full conversational context.

Great Accuracy, Globally

Accurate identification of 60+ entity types across Western Europe, North America and Australia.

60+ Localized Entity Types

Most tools only detect universal identifiers like emails & phone numbers. PII Eraser features deep, country-level localizations, with support for German Steuer-IDs, French NIR numbers, Australian TFNs and dozens more.

Regular Model Updates

The world changes fast. Older models fail on terms like "COVID" and aren't familiar with MCP tool calls. We continuously update our models to recognize contemporary entities and the shifting GenAI landscape.

No Regex Maintenance

PII Eraser relies on large encoder transformer models, freeing your team from maintaining fragile regex-based solutions. We also offer model updates free of charge in case we do miss something.

Protect Confidential Business Information

Detect and remove customer names, deal values, project codenames, and internal identifiers that create risk even when they aren't covered by privacy regulations.

Animated graphic illustrating global coverage for over 60 entities, including: Name, Email Address, Phone Number, Address, Payment Card, ABN (AU), SSN (USA), SIN (CA), Company House Number (UK), Steuer ID (DE), SIREN (FR), BSN (NL), Codice Fiscale (IT), Firmenbuchnummer (AT), AHV (CH).
60+ Entities

System Architecture

Customer Infrastructure (VPC / On-Prem)

Self-hosted, air-gapped deployment. Your data never leaves your environment.

Internal Network /
API Gateway
Text, Chats, Documents
REST API
Text or Chat
PII Eraser Containers
Hardened Container
Chainguard & minimal dependencies
Fast CPU Inference
AVX-512 VNNI & AMX optimizations
Horizontal Scaling
Scale on CPU behind a load balancer
JSON RESPONSE
De-identified Text or Chat
Downstream
Systems
LLMs, RAG, Analytics
Optional: Config File Allow lists, operators & entity types

High Throughput

Sustain >5,000 tokens/sec on standard 8 vCPU instances via AVX-512 VNNI and AMX instruction sets.

Bank-Grade Deployment

Multiple reference implementations including AWS Fargate & ECS and Kubernetes. Deploy a hardened, distroless container built for zero-CVE compliance.

Seamless Migration

PII Eraser provides drop-in compatibility for Microsoft Presidio Analyzer.

Why PII Eraser?

See how a purpose-built, self-hosted solution compares to the alternatives.

PII EraserCloud APIOpen SourceLLM (Generative)
Data Sovereignty100% Local / Air-gappedCloud OnlyLocalCloud (Mostly)
Cost ModelFixed Price UnlimitedPer Character (Expensive)Free (Maintenance Heavy)Per Token (Very Expensive)
EU LocalizationNativeLimitedLimitedNative
Native Chat SupportOpenAI Completions FormatNoNoNo Structured Chat Input
Long Input Support1M+ Tokens, No Accuracy LossSupportedChunking Required; Accuracy DegradesAccuracy Degrades with Length
LatencyReal-timeMediumMediumHigh (> 1000ms)
Security & Supply ChainHardened, Minimal DependenciesProvider ManagedSelf-Managed Security & PatchingProvider Managed
HallucinationsZeroZeroZeroPossible (Probabilistic)

Ready to see it in action?

Contact us for a technical walkthrough or to deploy a trial instance on your own infrastructure.