Add PII accuracy benchmark with multi-language phone context (#1)
authorStefan Gasser <redacted>
Thu, 8 Jan 2026 16:15:59 +0000 (17:15 +0100)
committerGitHub <redacted>
Thu, 8 Jan 2026 16:15:59 +0000 (17:15 +0100)
commit189e4361fc6b3bf30965de6ff23ce249a18215bf
tree1e6d40c9374bb26dd32e47fa7ddcb69a7878c6e1
parentdaa2c882ff044d2b65ede13b5a3c6c47737c8cc9
Add PII accuracy benchmark with multi-language phone context (#1)

- Add benchmark framework with precision/recall/F1 metrics
- Add 30 test cases across 5 languages (DE, EN, ES, FR, IT)
- Add phone_context words for all 24 supported languages
- Each language has 5-7 native words for: phone, number, mobile, call

Test with: bun run benchmark:accuracy
README.md
benchmarks/pii-accuracy/run.ts [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/de.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/en.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/es.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/fr.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/global.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/test-data/it.yaml [new file with mode: 0644]
benchmarks/pii-accuracy/types.ts [new file with mode: 0644]
package.json
presidio/languages.yaml
git clone https://git.99rst.org/PROJECT