| Configuration | Precision | Recall | F1 Score | Throughput MB/s | Heap MB |
|---|---|---|---|---|---|
| Loading… | |||||
Results are produced by BenchmarkResult.compute() in the test suite.
Each configuration is run against a synthetic labeled dataset of 128 samples
spread across 9 PII types (SSN, EMAIL, PHONE, CREDIT_CARD, API_KEY, PASSWORD,
IP_ADDRESS, BANK_ACCOUNT, DATE_OF_BIRTH) plus 20 negative / clean samples.
Matching is overlap-based: a detection counts as a true positive when
its character span overlaps the labeled span and the PII type matches. Macro
precision / recall / F1 are averaged uniformly across types that appear in the
reference data.
Throughput assumes 2 bytes per Java char (UTF-16).
Heap delta is measured via MemoryMXBean.getHeapMemoryUsage()
before and after the full dataset pass, after a forced GC cycle.
The benchmark
Maven profile targets **/*BenchmarkTest.java.
No extra model files or environment variables are required for the heuristic and ML configurations.