Precise PII Detection at Scale

Runs air-gapped on your infrastructure with better accuracy than the hyperscaler alternatives.

Core Operations

PII Eraser offers four modes of operation to detect and anonymize PII, PCI and other sensitive data.

Detect (NER)

Identify selected entities, returning character offsets and confidence scores. Supports overlapping entities and nested PII.

Redact

Detect and replace entities with entity-specific placeholders like <NAME>. Ideal for preparing datasets for ML training and sanitizing LLM prompts.

Mask

Obfuscate characters while preserving format length (e.g., John -> ####). Useful for UX logs and where partial visibility is required.

Hash

Replace entities with consistent SHA-256 or SHA-512 hashes. Enables analytics on unique users without revealing identity.

Python Input
import requests

# Detect sensitive data instances
response = requests.post(
    "<base url>/text/detect",
    json={
        "text": ["Hallo Matthias"],
    }
)
Output JSON
response.json()
{
  "entities": [
    [
      {
        "entity_type": "NAME",
        "start": 6,
        "end": 14,
        "score": 0.995
      }
    ]
  ],
  "stats": { "total_tokens": 4, "tps": 5420 }
}

For the full range of options, including how to configure entity types via YAML, please visit the documentation

Designed for the Agentic AI Era

Built for LLM guardrail use, PII Eraser natively supports OpenAI-format chats to ensure seamless, context-aware privacy.

Process All Messages

Slow
High

Scans entire history every turn. High accuracy, but scales poorly.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

<BANK_ACCT>

Process Last Message

Fast
Low

Scans only the latest prompt. Fast, but misses PII requiring context.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

82910453

Smart Context

Fast
High

Automatically includes relevant context. The optimal balance.

Hi, I need to update the beneficiary details for the 'Project Alpha' contract payouts.

I can help with that. Which specific banking detail do you need to amend?

The bank account number has changed for our UK entity.

Understood. Please provide the new 8-digit account number.

<BANK_ACCT>
OpenAI Chat Support

Pools context between messages for higher accuracy without processing history needlessly.

Flexible Configuration

Configure scanning for just completions or the last user prompt with intelligent context handling.

RAG Robustness

Optimized to handle RAG artifacts like chunking errors, scraping noise, and mixed-language content.

Great Accuracy, Globally

Accurate identification of 150+ entity types across Western Europe, North America and Australia.

150+ Localized Entity Types

Most systems are built for the US and retrofitted for the world with US-centric entity names like "SSN". PII Eraser is built from the ground up for global coverage with a focus on Western Europe, as well as Canada, US and Australia.

Regular Model Updates

The world changes fast. Older models fail on terms like "COVID" and aren't familiar with MCP tool calls. We continuously update our models to recognize contemporary entities and the shifting GenAI landscape.

No Regex Maintenance

PII Eraser relies on large encoder transformer models, freeing your team from maintaining fragile regex-based solutions. We also offer model updates free of charge in case we do miss something.

150+ Entities
NameEmail AddressPhone NumberAddressPayment CardABN (AU)SSN (USA)SIN (CA)Company House Number (UK)Steuer ID (DE)SIREN (FR)BSN (NL)Codice Fiscale (IT)Firmenbuchnummer (AT)AHV (CH)

System Architecture

Customer Infrastructure (VPC / On-Prem)

Self-hosted, air-gapped deployment within your secure environment.

Your Application /
Data Pipeline
REST API POST
Text or Chat with PII
PII Eraser Container
NLU & Entity Detection
Transformer Models
CPU Optimized
ARM (Graviton) & x86
Air-gapped Operation
No external network calls
JSON RESPONSE
Sanitized Data
Your Application /
Storage
ML Model / RAG
Optional: config.yaml Custom policies & entities

High Throughput

Capable of processing >5000 tokens/sec on a single instance via ARM SVX or x86 AMX instructions, such as AWS Graviton 4.

Flexible Deployment

Multiple deployment options including AWS Fargate, ECS and Azure Container Instances. CPU-only operation makes security scans, startup and autoscaling easy.

Seamless Migration

PII Eraser features Microsoft Presidio Analyzer compatibility endpoints, allowing for easy drop-in replacement.

Why PII Eraser?

Compare us against cloud APIs, open source libraries, and LLM-based approaches.

Deployment MethodSelf-hosted ContainerCloud APILocal LibraryLLM (Generative)
Data Sovereignty100% Local / Air-gappedCloud OnlyLocalCloud (Mostly)
Cost ModelHourly (Unlimited Tokens)Per Character (Expensive)Free (Maintenance Heavy)Per Token (Very Expensive)
EU LocalizationNative (DACH, FR, IT, etc.)Limited (EN, ES, FR)Requires Custom RegexVaries (Training Bias)
Latency (p99)< 20ms~100ms~200ms (Python overhead)> 1000ms
Context Window1M+ TokensShort (Chunks required)ShortVariable
HallucinationsZero (Deterministic)ZeroZeroPossible

Get started in minutes with our free trial

Per-hour pricing on AWS and Azure Marketplaces. No sales calls and no lock-in.