git.99rst.org Git - sgasser-llm-shield.git/log

git.99rst.org / sgasser-llm-shield.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Stefan Gasser [Wed, 21 Jan 2026 06:14:05 +0000 (07:14 +0100)]

Add generic /api/mask endpoint for standalone text masking

Adds a new POST /api/mask endpoint that can be used by browser extensions,
CLIs, or any client that needs to mask PII and secrets independently of
the OpenAI/Anthropic proxy routes.

Features:
- Detects and masks both PII and secrets (configurable via detect param)
- Returns context mapping for client-side unmasking
- Supports multi-turn conversations via startFrom counters
- Auto-detects language or accepts explicit language parameter
- Logs requests to dashboard for visibility
- Rejects whitespace-only text input
- Consistent error handling for both PII and secrets detection

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 22:40:11 +0000 (23:40 +0100)]

Replace generic provider/LLM terminology with OpenAI or Anthropic (#56)

- Use specific names instead of "provider" or "LLM" in user-facing text
- Keep "local LLM" for Ollama/vLLM references
- Update README, docs, and configuration files

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 22:12:25 +0000 (23:12 +0100)]

Add scan_roles and whitelist documentation (#55)

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 22:06:58 +0000 (23:06 +0100)]

Add Anthropic API support (#51)

* Add Anthropic provider

- Add /anthropic/v1/messages endpoint with full API compatibility
- Support OAuth tokens from Claude Code for subscription users
- Provider-agnostic text extraction for PII/secrets masking
- Support streaming and non-streaming responses
- Remove unused cloud provider health checks (only local services need them)

* Add Anthropic provider documentation

- Update README and introduction to mention Anthropic support
- Add Claude Code and Anthropic SDK to integrations
- Document Anthropic provider config with OAuth support
- Create separate API reference pages for OpenAI and Anthropic
- Update navigation structure

* Improve provider error messages in logs

- Add errorMessage getter to parse OpenAI/Anthropic error formats
- Log parsed error message instead of generic "Provider error"

* Update docs wording for multi-provider support

- Clarify OpenAI and Anthropic APIs with compatible providers
- Note Anthropic endpoint is mask mode only (route mode coming)

* Add route mode support for Anthropic endpoint

- Add callLocalAnthropic function for Ollama's Anthropic API
- Update Anthropic route to support route mode with local provider
- Update docs to reflect both mask and route mode support

* Add Anthropic brand color to dashboard provider badges

* Fix duplicate /v1 prefix in Anthropic proxy wildcard handler

The path variable already contains the full path after stripping the
/anthropic prefix (e.g., /v1/messages or /api/foo). Adding /v1 again
caused double prefixes for v1 paths and incorrect paths for non-v1
endpoints like /api/event_logging/batch.

* Add role field to Anthropic extractor for scan_roles filtering

* Remove OAuth token reading, use transparent header forwarding

- Delete oauth.ts - no longer read tokens from local storage
- Simplify client.ts to forward all auth headers transparently
- Simplify anthropic.ts wildcard handler
- Add Claude Code system prompt to default whitelist
- Whitelist merges user entries with default (not replaces)

* Simplify wildcard proxies to fully transparent passthrough

* Fix wildcard proxy host header forwarding

* Update documentation: simplify intro, remove OAuth docs, add Anthropic to architecture

* Improve documentation: simplify intro, update API links, remove redundant api_key exports

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 21:23:49 +0000 (22:23 +0100)]

Fix OpenAI wildcard proxy host header and query string forwarding (#54)

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 20:52:02 +0000 (21:52 +0100)]

Add whitelist config for masking exclusions (#53)

Adds masking.whitelist config option to exclude specific text patterns
from PII masking. Useful for preventing false positives on known text
like company names or product identifiers.

- Add whitelist property to MaskingSchema (default: empty array)
- Add filterWhitelistedEntities function to filter detected PII
- Patterns match if detected text is contained in whitelist entry
or whitelist entry is contained in detected text

commit | commitdiff | tree

Stefan Gasser [Tue, 20 Jan 2026 19:07:03 +0000 (20:07 +0100)]

Add scan_roles config for role-based PII/secrets filtering (#52)

Allows configuring which message roles to scan for PII and secrets.
By default all roles are scanned (existing behavior). When scan_roles
is set, only messages with matching roles are sent to Presidio.

Use case: Skip scanning large system prompts to reduce API calls and
avoid false positives on app-controlled content.

- Add role field to TextSpan type
- OpenAI extractor populates role from message
- PII detector filters by scan_roles before Presidio calls
- Secrets detector filters by scan_roles before detection

commit | commitdiff | tree

Stefan Gasser [Mon, 19 Jan 2026 17:30:22 +0000 (18:30 +0100)]

Refactor for multi-provider architecture (#49)

Reorganizes the codebase to support multiple LLM providers with a clean,
extensible architecture. This is a preparatory refactor that improves
code organization without adding new provider support.

Architecture changes:
- Move masking utilities to src/masking/ (conflict-resolver, placeholders, context)
- Add provider-specific directories: src/providers/openai/
- Create shared provider utilities: src/providers/errors.ts, src/routes/utils.ts
- Extract OpenAI-specific code to src/masking/extractors/openai.ts
- Add service layer: src/services/pii.ts, src/services/secrets.ts
- Move stream transformer to provider directory

New patterns:
- Provider-agnostic text extraction with TextExtractor interface
- Shared error handling with ProviderError class
- Centralized timeout constants in src/constants/timeouts.ts
- Unified logging helpers in src/routes/utils.ts

Removed:
- src/services/decision.ts (logic moved to service layer)
- src/providers/openai-client.ts (replaced by src/providers/openai/client.ts)

All 219 tests pass.

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 20:07:39 +0000 (21:07 +0100)]

Reorganize for multi-provider support (#48)

* Reorganize for multi-provider support

Move LLM client to providers/ and rename proxy routes to openai:
- src/services/llm-client.ts → src/providers/openai-client.ts
- src/routes/proxy.ts → src/routes/openai.ts
- Rename proxyRoutes export to openaiRoutes

Prepares codebase for adding Anthropic provider support.

* Fix import sorting for biome check

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 19:32:54 +0000 (20:32 +0100)]

Add per-part PII/secrets detection for multimodal messages (#47)

Each text block in multimodal messages is now analyzed separately
rather than concatenating all text together. This enables precise
masking while preserving message structure.

Changes:
- Per-part detection for both PII and secrets
- Symmetric pii/ and secrets/ module structure
- Shared utilities in utils/message-transform.ts
- Rename "redact" → "mask" for consistency
- Centralize MaskResult, Span interfaces

Breaking changes:
- Header: X-PasteGuard-Secrets-Redacted → X-PasteGuard-Secrets-Masked
- Config: secrets_detection.action "redact" → "mask"

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 14:24:42 +0000 (15:24 +0100)]

Add Production Setup section to installation docs (#45)

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 14:17:20 +0000 (15:17 +0100)]

Simplify Quick Start docker command (#44)

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 13:10:36 +0000 (14:10 +0100)]

All-in-one Docker image with prebuilt language support (#42)

* All-in-one Docker image with dev mode support

Simplify deployment with a single container that includes both the proxy
and Presidio PII detection. No git clone required - just docker run.

Changes:
- Combined Dockerfile: Multi-stage build with Presidio + Bun in one container
- supervisord: Process manager for running both services
- Prebuilt images: ghcr.io/sgasser/pasteguard:en (~2.7GB) and :eu (~12GB)
- Release workflow: GitHub Actions to build en/eu images on version tags
- Dev mode: docker compose up presidio -d for local development with hot-reload
- Updated docs: curl-based installation, no cloning needed for production
- Language models: Changed from _md to _lg for better PII detection accuracy

Image tags:
- en (default/latest): English only
- eu: European languages (en, de, es, fr, it, nl, pl, pt, ro)

* Move Docker files into docker/ directory

Reorganize Docker-related files for cleaner project structure:
- Dockerfile → docker/Dockerfile
- supervisord.conf → docker/supervisord.conf
- presidio/ → docker/presidio/

Keep docker-compose.yml in root for convenience (docker compose up works).
Update all path references in workflow, configs, and docs.

* Fix CI: update Dockerfile path

* Include config.example.yaml in Docker image for zero-config startup

* Simplify docs: zero-config quickstart

* Add missing ENV detection entities to config example

Add ENV_PASSWORD, ENV_SECRET, and CONNECTION_STRING to the
secrets_detection entities section to match what's documented
and implemented.

* Auto-configure languages per Docker image

- Add PASTEGUARD_LANGUAGES env var to Dockerfile (set from LANGUAGES build arg)
- Update config.example.yaml to use env var with fallback to 'en'
- Support comma-separated string for languages config (e.g., "en,de,fr")
- EN image now auto-enables English, EU image auto-enables all 9 EU languages

Users can still override via config.yaml with array syntax if needed.

* Update docs: languages are auto-configured per image

* Clarify runtime vs build-time env vars in docs

* Update docs: languages are auto-configured per Docker image

* Remove confusing env var override example from docs

* List specific EU languages in docs instead of 'All 9'

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 11:28:40 +0000 (12:28 +0100)]

docs: add browser extension beta section (#43)

Add beta signup CTA to README and introduction docs to build
early adopter community for the upcoming browser extension.

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 00:10:25 +0000 (01:10 +0100)]

Merge pull request #41 from sgasser/feature/config-refactor-and-wildcard-proxy

Refactor config: providers.upstream → providers.openai, add wildcard proxy

commit | commitdiff | tree

Stefan Gasser [Sat, 17 Jan 2026 00:05:08 +0000 (01:05 +0100)]

Remove secrets detection tests from proxy.test.ts (tested in detect.test.ts)

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 23:53:24 +0000 (00:53 +0100)]

Fix wildcard proxy body forwarding, simplify config example

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 23:36:14 +0000 (00:36 +0100)]

Refactor config: providers.upstream → providers.openai, add wildcard proxy

Config changes:
- Rename providers.upstream to providers.openai for clarity
- Remove routing config section (simplified to: PII → local, no PII → openai)
- Move local provider to top-level config (not under providers)
- Change default secrets action from block to redact

Proxy changes:
- Replace specific /models route with wildcard /* proxy
- Supports all OpenAI endpoints: /models, /embeddings, /audio/*, etc.

Documentation:
- Update all docs to reflect new config structure
- Remove docs/api-reference/models.mdx (now covered by wildcard proxy)

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 22:30:36 +0000 (23:30 +0100)]

fix: log and display API errors in dashboard (#40)

Fixes #35

- Add LLMError class to preserve upstream status code and body
- Add status_code and error_message columns to request logs
- Add Status column to dashboard with OK/error badges
- Pass through upstream errors (429, 401, etc.) with original status
- Return OpenAI-compatible JSON format for all error responses
- Set X-PasteGuard headers consistently via Hono context

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:40:34 +0000 (17:40 +0100)]

Merge pull request #39 from sgasser/fix/issue-33-offset-error

Fix overlapping entity conflict resolution (#33)

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:39:28 +0000 (17:39 +0100)]

Fix lint: format code, remove unused isContainedIn

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:35:42 +0000 (17:35 +0100)]

Fix partial overlap bug: higher score wins for all overlaps

Previous behavior kept both entities when they partially overlapped
but had different types, causing text corruption during masking.

Now: sort by score desc, then length desc, then position. Any overlap
removes the lower-scored entity.

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:24:37 +0000 (17:24 +0100)]

Add edge case tests: unsorted input, chain of 3 overlaps

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:15:59 +0000 (17:15 +0100)]

Clean up tests: consistent naming, remove comments, add mutation test

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:14:37 +0000 (17:14 +0100)]

Clean up comments and rename resolveConflictsSimple to resolveOverlaps

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 16:11:50 +0000 (17:11 +0100)]

Fix overlapping entity conflict resolution (#33)

Implement Presidio-style conflict resolution for overlapping PII entities.

Problem: Presidio returns overlapping entities (e.g., both "Eric" and "Eric's")
which caused text corruption during masking.

Solution: Two-phase algorithm matching Presidio's Anonymizer logic:
1. Merge overlapping entities of same type (expand boundaries, keep highest score)
2. Remove conflicting entities of different types (contained or lower score loses)

- Add conflict-resolver.ts with resolveConflicts() and resolveConflictsSimple()
- Replace old removeOverlappingEntities with new conflict resolver
- Add 18 tests covering all conflict scenarios
- Delete old entities.ts (replaced by conflict-resolver.ts)

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 14:09:32 +0000 (15:09 +0100)]

Fix overlapping entity masking (#33)

Presidio can return overlapping PII detections (e.g., both "Eric" and
"Eric's" as separate PERSON entities). When both were masked, the string
positions became invalid, corrupting the output.

Added removeOverlappingEntities() to filter overlaps before masking:
- Sort by start position (longer first if same start)
- Keep only non-overlapping entities

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 15:40:04 +0000 (16:40 +0100)]

fix: use [[]] delimiters for placeholders to prevent HTML encoding issues (#38)

Fixes #36 - HTML-encoded placeholders now unmask correctly

Changes:
- Changed placeholder format from <TYPE_N> to [[TYPE_N]]
- Created src/constants/placeholders.ts as single source of truth
- Removed configurable redact_placeholder (was bug - streaming hardcoded [[)
- Updated dashboard regex for yellow highlighting
- Added tests for HTML/JSON/URL contexts

The [[]] delimiters are safe in HTML, JSON, and URLs - they don't get
entity-encoded like <> did.

commit | commitdiff | tree

Stefan Gasser [Fri, 16 Jan 2026 13:30:11 +0000 (14:30 +0100)]

feat(dashboard): display secrets detection in logs table (#30)

Add Secrets column to the Recent Requests table showing detected
secret types (e.g., OPENSSH_PRIVATE_KEY, PEM_PRIVATE_KEY) as red
badges. The data was already available from the API but not displayed.

- Add Secrets column header between PII Entities and Scan Time
- Parse secrets_types and secrets_detected from API response
- Display secret types as error-colored badges
- Add CSS utilities for error color variants
- Update colspan values for detail rows

Closes #20

commit | commitdiff | tree

Stefan Gasser [Mon, 12 Jan 2026 15:48:24 +0000 (16:48 +0100)]

Exclude test files from secret scanning (#31)

Test files contain intentional fake secrets for testing secret detection.
This prevents false positive alerts for MongoDB connection strings and
other test fixtures in src/secrets/detect.test.ts.

commit | commitdiff | tree

Max Wolf [Mon, 12 Jan 2026 15:02:29 +0000 (16:02 +0100)]

Add environment variable credential detection (#19)

* Add PatternDetector and DetectionResult interfaces for secrets detection registry

* Move pattern detection utility to new patterns/utils.ts module

* Refactor secrets detection using a registry system

- Create privateKeysDetector, apiKeysDetector, tokensDetector modules
- Refactor detectSecrets() to use the pattern registry
- Re-export types from detect.ts for backwards compatibility

* Change default secrets_detection action to redaction

Hint: The example config still shows `action: block` explicitly, with a comment noting
that `redact` is the default action if not specified

* Implement new pattern detector and add corresponding SecretEntityType options

* Register new detector and extend test suite accordingly

* Add new entity types to config.ts

* Update docs and example config

* Add environment variables section to secrets detection docs

---------

Co-authored-by: Stefan Gasser <redacted>

commit | commitdiff | tree

Stefan Gasser [Mon, 12 Jan 2026 12:32:41 +0000 (13:32 +0100)]

Add animated demo GIF to README and docs (#29)

- New demo.gif showing the complete mask/unmask flow
- Embed in README hero section
- Add to docs introduction page

commit | commitdiff | tree

Stefan Gasser [Mon, 12 Jan 2026 07:13:59 +0000 (08:13 +0100)]

Always run language detection even with single language configured (#26)

Previously, language detection was skipped when only one language was
configured, returning the configured language directly. This made it
impossible to detect misconfiguration (e.g., only EN configured but
receiving DE text).

Now language detection always runs, providing:
- Actual detected language in logs (detectedLanguage field)
- Confidence score for debugging
- usedFallback=true when detected language isn't configured

Performance impact is negligible (~0.01-0.05ms per detection).

commit | commitdiff | tree

Stefan Gasser [Sun, 11 Jan 2026 18:42:09 +0000 (19:42 +0100)]

Fix PII detection to scan all message roles (#25)

Previously, PII detection only scanned the last user message initially,
then did a full scan only if PII was found. This caused PII in system
messages (e.g., RAG context from PDFs) to be missed entirely when the
user message contained no PII.

Changes:
- Consolidate analyzeMessages() to always scan all messages
- Scan system, developer, user, and assistant roles
- Remove analyzeAllMessages() as it's no longer needed
- Simplify decision.ts by removing the redundant full scan call

This ensures PII in system messages (common in RAG patterns) is properly
detected and masked before being sent to upstream LLMs.

Fixes #17

commit | commitdiff | tree

Max Wolf [Sun, 11 Jan 2026 17:55:34 +0000 (18:55 +0100)]

Refactor secrets detection into pattern registry (#18)

* Add PatternDetector and DetectionResult interfaces for secrets detection registry

* Move all interfaces to patterns/types.ts and use the existing SecretesDetectionResult interface instead of the new DetectionResult

* Move pattern detection utility to new patterns/utils.ts module

* Refactor secrets detection using a registry system

- Create privateKeysDetector, apiKeysDetector, tokensDetector modules
- Refactor detectSecrets() to use the pattern registry
- Re-export types from detect.ts for backwards compatibility

* Change default secrets_detection action to redaction

Hint: The example config still shows `action: block` explicitly, with a comment noting
that `redact` is the default action if not specified

* Fix README default action references and improve overall structure / formatting

- Update all references from 'block (default)' to 'redact (default)'
- Fix Bearer token documentation (20+ → 40+ chars)
- Reorganize Configuration section with consistent headers
- Improve table formatting and section descriptions
- Use references to reduce duplications and maintenance overhead

* Improve type safety in PatternDetector interface

Use SecretEntityType instead of string for enabledTypes Set parameter

* Update docs to reflect redact as new default action

Reorder actions to show default first

---------

Co-authored-by: Stefan Gasser <redacted>

commit | commitdiff | tree

Stefan Gasser [Sun, 11 Jan 2026 10:11:54 +0000 (11:11 +0100)]

Add Mintlify documentation and simplify README (#24)

* Add Mintlify documentation and simplify README

Documentation:
- Add complete Mintlify docs with introduction, quickstart, integrations
- Add concept guides: mask mode, route mode, PII detection, secrets detection
- Add API reference: chat completions, models, status, dashboard API
- Add configuration guides: overview, providers, PII, secrets, logging
- Include dashboard screenshot and branding assets

README:
- Simplify structure, move detailed docs to Mintlify
- Add centered badges and navigation links
- Add "What is PasteGuard?" section explaining problem/solution
- Update example to Dr. Sarah Chen (consistent across all docs)
- Reorder integrations (OpenAI SDK, LangChain, LlamaIndex first)
- Move Presidio attribution inline with PII section
- Add Tech Stack section

Code:
- Update description to "Privacy proxy for LLMs" in package.json,
startup banner, and /info endpoint

Closes #21

* docs: fix secrets default to block (matches current code)

commit | commitdiff | tree

Stefan Gasser [Sun, 11 Jan 2026 09:14:06 +0000 (10:14 +0100)]

fix: support developer and function roles for GPT-5.x compatibility (#23)

GPT-5.x models use the 'developer' role instead of 'system' for
instructions. The 'function' role is also added for legacy compatibility.

Fixes #22

commit | commitdiff | tree

Stefan Gasser [Sat, 10 Jan 2026 16:47:28 +0000 (17:47 +0100)]

feat: Update dashboard to match design system (#16)

* feat: Update dashboard to match design system

- Apply design system tokens (colors, typography, radius, shadows)
- Add branded "Redaction Bar Loader" animation for loading/empty states
- Add favicon route serving inline SVG
- Update README with wordmark SVG and optimized screenshot
- Move assets to assets/ folder (logo, wordmark, favicon, dashboard)

* fix: Formatting and browser compatibility fixes

- Fix biome formatting in favicon route (multi-line object)
- Add color-mix() fallback for card hover effect (older browsers)

commit | commitdiff | tree

Stefan Gasser [Sat, 10 Jan 2026 14:00:48 +0000 (15:00 +0100)]

Fix lint errors from multimodal PR (#11)

- Remove unused imports in multimodal.test.ts
- Remove unused secretsConfig variable
- Fix import ordering in masking.ts and content.test.ts
- Remove trailing whitespace in proxy.ts and masking.ts

commit | commitdiff | tree

Stefan Gasser [Sat, 10 Jan 2026 13:23:29 +0000 (14:23 +0100)]

Merge pull request #8 from mkroemer/fix/multimodal-content-support

Fix/multimodal content support

commit | commitdiff | tree

mkroemer [Sat, 10 Jan 2026 08:21:33 +0000 (09:21 +0100)]

You are right, and I (and claude) addressed these issues.

1. Secrets redaction (proxy.ts):
   - Add per-part offset tracking to prevent partial secret leaks
   - Filter and adjust redaction positions for each text part
   - Fixes issue where 'sk-proj-' prefix could remain visible

2. PII masking (masking.ts):
   - Actually mask array content instead of returning original
   - Add per-part offset tracking for accurate entity positions
   - Properly handle multimodal arrays with text and images

3. Tests:
   - Add content.test.ts for text extraction utilities
   - Add multimodal.test.ts with integration tests

commit | commitdiff | tree

mkroemer [Fri, 9 Jan 2026 20:06:49 +0000 (21:06 +0100)]

fix: Update type system for multimodal content support

- Update ChatMessage interface to accept MessageContent type
- Fix masking.ts to handle multimodal content
- Fix redact.ts to extract text before processing
- Fix import order and formatting per biome linter

commit | commitdiff | tree

mkroemer [Fri, 9 Jan 2026 19:58:56 +0000 (20:58 +0100)]

fix: Add support for multimodal content (text + images)

Fixes crashes when processing OpenAI Chat Completion requests with
multimodal content (array format). Previously, the code assumed message
content is always a string, causing Presidio errors and 502/503 responses
when LibreChat Agents sent vision requests.

Changes:
- Add extractTextContent() utility to safely extract text from both
string and array content formats
- Update PII detection to handle multimodal messages
- Update secrets detection to extract text from array content
- Update message redaction to preserve images while redacting text
- Fix dashboard display to show readable content instead of [object Object]

This enables full support for OpenAI's multimodal API format while
maintaining PII/secrets protection on text portions.

Resolves issues with LibreChat Agents feature returning 502 errors.

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 17:26:03 +0000 (18:26 +0100)]

Merge pull request #7 from maximiliancw/main

Add secrets detection to prevent credential leaks

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 16:10:15 +0000 (17:10 +0100)]

Refine bearer token and JWT token detection patterns to enforce minimum character requirements;
Updated tests to reflect changes in bearer token format

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 16:03:55 +0000 (17:03 +0100)]

Update all references to outdated versions

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 16:00:19 +0000 (17:00 +0100)]

Enhance request logging to conditionally include detected secret types based on configuration; ensuring sensitive information is only logged when explicitly allowed, improving security and compliance

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:57:49 +0000 (16:57 +0100)]

Clarify max_scan_chars behavior for secrets detection; Added notes on detection limits and performance implications

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:56:09 +0000 (16:56 +0100)]

Fix: removed space in joined header string for consistency

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:52:46 +0000 (16:52 +0100)]

Use OpenAI-compatible error format for secrets blocking:

- Use 400 status code instead of 422
- Use standard error format {message, type, param, code}
- Remove non-standard 'details' field (secret types already in headers)
- Update tests to match new format

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:23:47 +0000 (16:23 +0100)]

Add database migration for secrets detection columns

Existing installations will fail with 'table request_logs has no column
named secrets_detected' since the new columns don't exist in their SQLite
database. This adds a migration check that adds the missing columns if they
don't exist.

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:04:32 +0000 (16:04 +0100)]

Implement redact and route_local actions:

- Integrate redact action: redacts secrets before PII detection, unredacts in responses
- Implement route_local action: routes requests with secrets to local provider
- Update stream transformer to handle both PII and secrets contexts
- Add comprehensive tests for secrets routing logic
- Update config.example.yaml with new entity types and action documentation
- Update README.md with complete secrets detection features

New secret entity types (opt-in):

- API_KEY_OPENAI, API_KEY_AWS, API_KEY_GITHUB
- JWT_TOKEN, BEARER_TOKEN

Response headers:
- X-PasteGuard-Secrets-Redacted: true (when action=redact)

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 15:02:22 +0000 (16:02 +0100)]

Add reversible redaction module for secret masking:

- Create redact.ts with RedactionContext for tracking secret mappings
- Implement redactSecrets() with configurable placeholder format
- Implement unredactSecrets() for restoring original secrets in responses
- Add streaming helpers for unredacting SSE responses
- Add comprehensive tests covering roundtrip, multiple messages, and streaming

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 14:52:56 +0000 (15:52 +0100)]

feat(secrets): add detection for API keys, JWT tokens, and Bearer tokens

- Add new secret entity types: API_KEY_OPENAI, API_KEY_AWS, API_KEY_GITHUB, JWT_TOKEN, BEARER_TOKEN
- Extract pattern detection into reusable helper function
- Add comprehensive tests for all new secret types with false positive checks
- Update config schema with typed entity enum

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 14:38:08 +0000 (15:38 +0100)]

Improve error messaging for secrets detection configuration validation on startup

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 14:24:41 +0000 (15:24 +0100)]

Merge remote-tracking branch 'origin/main'

# Conflicts:
# README.md
# src/routes/proxy.ts

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 14:21:23 +0000 (15:21 +0100)]

Prepare for merge with breaking changes from remote origin

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:59:18 +0000 (14:59 +0100)]

Add secrets shield feature documentation to README

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:55:10 +0000 (14:55 +0100)]

Add simple integration tests for secrets detection blocking behavior in proxy.ts

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:52:51 +0000 (14:52 +0100)]

Add comprehensive unit tests for secrets detection

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:49:27 +0000 (14:49 +0100)]

Add validation for secrets detection config on startup and update banner

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:44:59 +0000 (14:44 +0100)]

Enhance secrets detection functionality in proxy routes:

- Integrate secrets detection logic into the proxy request handling.
- Add configuration checks for enabling/disabling secrets detection.
- Implement logging for detected secrets and their types.
- Update the logger to accommodate new fields for secrets detection.
- Refactor related functions for improved clarity and maintainability

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 13:37:25 +0000 (14:37 +0100)]

Rename setShieldHeaders to setPasteGuardHeaders

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 13:27:51 +0000 (14:27 +0100)]

Update dashboard screenshot with PasteGuard branding (#6)

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 13:02:05 +0000 (14:02 +0100)]

Extend .gitignore

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 12:57:21 +0000 (13:57 +0100)]

Rename project from LLM-Shield to PasteGuard (#5)

After positive community response (100+ GitHub stars), committing fully
to this project. The name PasteGuard better describes the core function:
"guard what you paste" before sending to LLMs.

Changes:
- Update all branding: package.json, README, CONTRIBUTING
- Rename response headers: X-LLM-Shield-* → X-PasteGuard-*
- Update dashboard UI and page title
- Update /info endpoint metadata
- Update startup banner
- Update config files and defaults
- Update CI workflow docker tag
- Regenerate bun.lock with new package name

Domain pasteguard.com secured. Old GitHub links redirect automatically.

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 12:55:01 +0000 (13:55 +0100)]

Add text extraction utility function

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 12:49:45 +0000 (13:49 +0100)]

Implement secrets detection module for OpenSSH and PEM private keys

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 12:33:47 +0000 (13:33 +0100)]

Add secrets_detection section to config.example.yaml

commit | commitdiff | tree

maximiliancw [Fri, 9 Jan 2026 12:31:10 +0000 (13:31 +0100)]

Add secrets detection config schema with validation

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 08:07:56 +0000 (09:07 +0100)]

Rename chat routes to proxy for clarity (#4)

- Rename chat.ts → proxy.ts to better reflect purpose (LLM proxy routing)
- Update CLAUDE.md architecture to match actual file structure

commit | commitdiff | tree

Stefan Gasser [Fri, 9 Jan 2026 07:28:55 +0000 (08:28 +0100)]

Fix EISDIR error when config.yaml is a directory (#3)

Add isFile() check before reading config to give a clear error message
when Docker creates a directory instead of mounting a missing file.

Fixes #2

commit | commitdiff | tree

Stefan Gasser [Thu, 8 Jan 2026 16:15:59 +0000 (17:15 +0100)]

Add PII accuracy benchmark with multi-language phone context (#1)

- Add benchmark framework with precision/recall/F1 metrics
- Add 30 test cases across 5 languages (DE, EN, ES, FR, IT)
- Add phone_context words for all 24 supported languages
- Each language has 5-7 native words for: phone, number, mobile, call

Test with: bun run benchmark:accuracy

commit | commitdiff | tree

Stefan Gasser [Thu, 8 Jan 2026 11:15:42 +0000 (12:15 +0100)]

fix: support environment variables in numeric config values

Use z.coerce.number() to handle string values from environment variable
substitution in YAML config. Add validation constraints:

- port: integer 1-65535
- retention_days: integer >= 0
- score_threshold: float 0-1

commit | commitdiff | tree

Stefan Gasser [Thu, 8 Jan 2026 10:14:12 +0000 (11:14 +0100)]

Initial release

OpenAI-compatible privacy proxy with two modes:
- Mask: Replace PII with placeholders before upstream, unmask in response
- Route: Send PII-containing requests to local LLM

Features:
- 24 language support for PII detection
- Real-time streaming with unmasking
- Dashboard for monitoring
- Microsoft Presidio integration

git clone https://git.99rst.org/PROJECT