A
Public Data Residual PII Detector
4.05
Derivation Chain
Step 1
Tightened public data center safety standards + migration to private cloud
→
Step 2
Residual PII risk in migration-target data
→
Step 3
Automated detection of un-de-identified personal information in public datasets
Problem
Public agencies must de-identify data before migrating to private cloud, but it is extremely difficult to manually find residual PII — names, national ID numbers, phone numbers, addresses — buried in unstructured text fields across hundreds of thousands of records. Even a single de-identification failure constitutes a Personal Information Protection Act violation, risking fines and media exposure.
Solution
(1) Scan DB tables/files to auto-detect PII patterns (national ID numbers, phone numbers, emails, addresses, names) in unstructured text, (2) display results per record with masking recommendations, (3) generate a de-identification completion certification report for audit readiness.
NUMR-V Scores
NUMR-V Scoring System
| N Novelty | 1-5 | How uncommon the service is in market context. |
| U Urgency | 1-5 | How urgently users need this problem solved now. |
| M Market | 1-5 | Market size and growth potential from proxy indicators. |
| R Realizability | 1-5 | Buildability for a small team with realistic constraints. |
| V Validation | 1-5 | Validation signal quality from competition and demand data. |
SaaS N=.15 U=.20 M=.15 R=.30 V=.20
Senior N=.25 U=.25 M=.05 R=.30 V=.15
Feasibility (71%)
Data Availability
21.7/25
Feasibility Breakdown
| Tech Complexity | / 40 | Difficulty of core implementation stack. |
| Data Availability | / 25 | Practical availability and cost of required data. |
| MVP Timeline | / 20 | Expected time to ship a usable MVP. |
| API Bonus | / 15 | Bonus for viable public API leverage. |
Market Validation (68/100)
Validation Breakdown
| Competition | / 20 | Signal quality from competitor landscape. |
| Market Demand | / 20 | Demand proxies from search and mention patterns. |
| Timing | / 20 | Fit with current shifts in tech, behavior, and regulation. |
| Revenue Signals | / 15 | Reference evidence for monetization viability. |
| Pick-Axe Fit | / 15 | How well the concept serves participants in a trend. |
| Solo Buildability | / 10 | Practicality for lean-team implementation. |
Technical Requirements
Backend [medium]
AI/ML [medium]
Frontend [low]