Roadmap

Known limitations and open problems. If something interests you, open an issue or pull request on GitHub.

  • No semantic understanding of privacy. Regex catches structured PII like emails and phone numbers. It misses anything contextual — journal entries, health history, relationship details, anything that's private but has no fixed format.
  • No user-defined sensitive patterns. You can't tell it "treat anything about my company X as private." It only knows what it was built to know.
  • Redaction breaks context. The LLM gets [NAME_1] but loses the relationship between that name and everything else in the prompt. For complex reasoning tasks this degrades the answer quality.
  • Streaming restoration is best-effort. Tokens that get split across SSE chunks can fail to restore correctly.
  • No multi-modal support. Images, PDFs, audio — anything that isn't text passes through unscanned.
  • Single conversation scope. Mappings don't persist across sessions. Long-running agents or multi-session workflows re-expose the same PII under different tokens.