LCORE-1831: Implement Redaction Safety Capability in Pydantic AI by arin-deloatch · Pull Request #1915 · lightspeed-core/lightspeed-stack

arin-deloatch · 2026-06-11T21:57:08Z

Description

Add a regex-based PII redaction capability for pydantic-ai agents. This introduces:

Core engine (core.py): redact_text() function and immutable RedactionResult model for
sequential regex-based text substitution
Configuration (config.py): RedactionRule and RedactionConfig Pydantic models with
compile-time pattern validation and per-rule/global case sensitivity controls
Capability (capability.py): PiiRedactionCapability integrating with pydantic-ai's
AbstractCapability to redact user prompts before model requests and model response text before
returning to the caller

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: Claude Code (Claude Opus 4.6)
Generated by: N/A

Related Tickets & Documents

Closes LCORE-1831

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

uv run make format — passes, no reformats
uv run make verify — all linters pass (black, pylint 10/10, pyright 0 errors, ruff, pydocstyle,
mypy, lint-openapi)
uv run pytest tests/unit/pydantic_ai_lightspeed/capabilities/ -v — 51/51 tests pass
Coverage: 99% (2 uncovered lines in capability.py)

Summary by CodeRabbit

New Features
- Added a configurable PII redaction capability that scans user prompts and model responses, applies ordered regex rules, and replaces matched text according to customizable replacements and case-sensitivity options.
Tests
- Added comprehensive unit tests covering redaction configuration, core redaction behavior, and request/response lifecycle integration.

coderabbitai · 2026-06-11T21:57:21Z

Note

Currently processing new changes in this PR. This may take a few minutes, please wait...

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 8c1143e5-f894-4e27-989f-51c55c0deafe

📥 Commits

Reviewing files that changed from the base of the PR and between d5a97ab and 271c1f2.

📒 Files selected for processing (11)

src/models/config.py
src/pydantic_ai_lightspeed/capabilities/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/capability.py
src/pydantic_ai_lightspeed/capabilities/redaction/config.py
src/pydantic_ai_lightspeed/capabilities/redaction/core.py
tests/unit/pydantic_ai_lightspeed/capabilities/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_capability.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_config.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_core.py

 __________________________________________________________________________________________________________________________
< Unix was not designed to stop its users from doing stupid things, as that would also stop them from doing clever things. >
 --------------------------------------------------------------------------------------------------------------------------
  \
   \   \
        \ /\
        ( )
      .( o ).

Walkthrough

This pull request introduces a configurable PII redaction capability for Pydantic AI Lightspeed Core Stack. The implementation provides regex-based pattern matching and replacement logic, configuration models with compiled pattern caching, and integration with Pydantic AI's request/response lifecycle hooks.

Changes

PII Redaction Capability Implementation

Layer / File(s)	Summary
Core redaction types and text processing `src/pydantic_ai_lightspeed/capabilities/redaction/core.py`, `tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_core.py`	`CompiledPatterns` type alias and `RedactionResult` frozen model define the redaction output shape. `redact_text` applies compiled regex patterns sequentially to input, accumulating substitution counts and returning redaction metadata. Tests validate immutability, no-match passthrough, single/multiple pattern application, and case-sensitivity behavior.
Configuration models and pattern compilation `src/pydantic_ai_lightspeed/capabilities/redaction/config.py`, `tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_config.py`	`RedactionRule` captures pattern, replacement, and optional case-sensitive override. `RedactionConfig` holds ordered rules, global case-sensitivity flag, and compiles patterns at model construction time via `@model_validator`, exposing compiled patterns through a defensive copy property. Tests cover rule construction, regex compilation, case-sensitivity handling, and property immutability.
PiiRedactionCapability and message traversal `src/pydantic_ai_lightspeed/capabilities/redaction/capability.py`, `tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_capability.py`	Helpers (`_redact_string_content`, `_redact_text_content`, `_redact_content_item/list`, `_redact_user_prompt_part`, `_redact_message_parts`, `_redact_model_request`, `_redact_messages`, `_redact_response`) recursively traverse and redact message structures, preserving identity when no changes occur. `PiiRedactionCapability` class implements `before_model_request` (redacts user prompts) and `after_model_request` (redacts response text parts), wiring into Pydantic AI lifecycle. Tests validate redaction across content types and lifecycle hook behavior.
Public API and package structure `src/pydantic_ai_lightspeed/capabilities/__init__.py`, `src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py`, `tests/unit/pydantic_ai_lightspeed/capabilities/__init__.py`, `tests/unit/pydantic_ai_lightspeed/capabilities/redaction/__init__.py`	Module docstrings document package purpose. Redaction subpackage `__all__` list exposes `PiiRedactionCapability`, `RedactionConfig`, `RedactionRule`, `RedactionResult`, and `redact_text` as public API.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Suggested reviewers

asimurka
tisnik

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly and accurately describes the main change: implementation of a redaction safety capability for Pydantic AI. It is concise, clear, and includes the ticket identifier.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

✨ Simplify code

Create PR with simplified code

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Tip

You can disable sequence diagrams in the walkthrough.

Disable the reviews.sequence_diagrams setting to disable sequence diagrams in the walkthrough.

anik120

I know this is not part of the scope of this PR, but is src/pydantic_ai... leaking implementation detail again @jrobertboos?

asimurka

LGTM in overall

asimurka · 2026-06-16T09:47:24Z

+
+
+def _redact_message_parts(
+    parts: Sequence[Any], compiled_patterns: CompiledPatterns


Try to avoid Any in the whole module where possible.

asimurka · 2026-06-16T09:55:30Z

+
+def _redact_string_content(
+    text: str, compiled_patterns: CompiledPatterns
+) -> str | None:


Prefer using Optional in the whole module.

Thank you for the feedback; addressed in d5a97ab

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/pydantic_ai_lightspeed/capabilities/redaction/capability.py`:
- Line 1: Add module-level logging support by importing get_logger from log.py
and creating a logger instance at the top of the file after the module docstring
using logger = get_logger(__name__). This logger should then be used throughout
the capability module to audit PII redaction events, such as logging at debug
level when redaction rules match specific patterns and at info level for
redaction statistics and summaries. This approach aligns with coding guidelines
and provides valuable audit trails for the security-sensitive PII redaction
functionality.
- Line 5: Update all type annotations in the module to use modern pipe syntax
instead of Optional. Replace Optional[Type] with Type | None for all return type
annotations in the functions including _redact_text_content,
_redact_content_list, _redact_message_parts, and _redact_model_request. After
updating all function return types throughout the module, remove Optional from
the import statement at the top of the file since it will no longer be needed.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 7e09e4f4-9b9b-4dcf-a8b9-cf61a5e8e1e4

📥 Commits

Reviewing files that changed from the base of the PR and between f5586a7 and d5a97ab.

📒 Files selected for processing (10)

src/pydantic_ai_lightspeed/capabilities/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/capability.py
src/pydantic_ai_lightspeed/capabilities/redaction/config.py
src/pydantic_ai_lightspeed/capabilities/redaction/core.py
tests/unit/pydantic_ai_lightspeed/capabilities/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_capability.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_config.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_core.py

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (6)

GitHub Check: E2E: server mode / ci / group 1
GitHub Check: E2E: library mode / ci / group 2
GitHub Check: E2E: library mode / ci / group 1
GitHub Check: E2E: server mode / ci / group 2
GitHub Check: E2E: server mode / ci / group 3
GitHub Check: E2E: library mode / ci / group 3

🧰 Additional context used

📓 Path-based instructions (3)

tests/**/*.py

📄 CodeRabbit inference engine (AGENTS.md)

tests/**/*.py: Use pytest for all unit and integration tests; do not use unittest
Use pytest.mark.asyncio marker for async tests

Files:

tests/unit/pydantic_ai_lightspeed/capabilities/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_core.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_capability.py
tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_config.py

src/**/*.py

📄 CodeRabbit inference engine (AGENTS.md)

src/**/*.py: Use absolute imports for internal modules: from authentication import get_auth_dependency
Llama Stack imports: Use from llama_stack_client import AsyncLlamaStackClient
Check constants.py for shared constants before defining new ones
All modules must start with descriptive docstrings explaining purpose
Use logger = get_logger(__name__) from log.py for module logging
All functions must have complete type annotations for parameters and return types, use modern syntax (str | int), and include descriptive docstrings
Use snake_case with descriptive, action-oriented names for functions (get_, validate_, check_)
Avoid in-place parameter modification anti-patterns; return new data structures instead of modifying function parameters
Use async def for I/O operations and external API calls
Use standard log levels with clear purposes: debug() for diagnostic info, info() for program execution, warning() for unexpected events, error() for serious problems
All classes must have descriptive docstrings explaining purpose and use PascalCase with standard suffixes: Configuration, Error/Exception, Resolver, Interface
Abstract classes must use ABC with @abstractmethod decorators
Follow Google Python docstring conventions with required sections: Parameters, Returns, Raises, and Attributes for classes

Files:

src/pydantic_ai_lightspeed/capabilities/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/core.py
src/pydantic_ai_lightspeed/capabilities/redaction/config.py
src/pydantic_ai_lightspeed/capabilities/redaction/capability.py

src/**/__init__.py

📄 CodeRabbit inference engine (AGENTS.md)

Package __init__.py files must contain brief package descriptions

Files:

src/pydantic_ai_lightspeed/capabilities/__init__.py
src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py

🔇 Additional comments (6)

src/pydantic_ai_lightspeed/capabilities/redaction/config.py (1)

14-107: LGTM!

tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_config.py (1)

1-157: LGTM!

src/pydantic_ai_lightspeed/capabilities/redaction/capability.py (2)

31-258: LGTM!

261-325: LGTM!

tests/unit/pydantic_ai_lightspeed/capabilities/redaction/test_capability.py (1)

1-316: LGTM!

src/pydantic_ai_lightspeed/capabilities/redaction/__init__.py (1)

1-21: LGTM!

asimurka · 2026-06-17T07:00:29Z

/ok-to-test

asimurka

I personally like this decomposition to capability, core and config modules.

asimurka · 2026-06-17T07:40:24Z

+
+
+@dataclass
+class PiiRedactionCapability(AbstractCapability[Any]):


Use None for type argument as this corresponds to current implementation of lightspeed agent (with no dependencies). Use None as type annotation where possible.

tisnik · 2026-06-18T13:05:45Z

/ok-to-test

tisnik · 2026-06-18T13:27:47Z

@@ -0,0 +1,107 @@
+"""Configuration models for PII redaction rules."""


IMHO this config (or rather config classes/models) should be added into src/models/config.py. At least to avoid circular dependencies, to allow us to generate documentation etc.

Understood, it was my initial understanding that we were to keep everything pydantic_ai related separate to avoid any sorts of conflicts with the core source code.

However, I am more than happy to make the changes necessary! Just to be thorough, it is expected that RedactionRule,RedactionConfig and RedactionResult are being moved to src/models/config.py, correct?

Correct. This will be later part of LCORE config so it will be technically implementation agnostic.

…itution

…ern validation

…t tests

…capability. Use Optional for nullable returns and substitute Any with UserContent, ModelRequestPart, and ModelResponsePart for type-safe message handling.

jrobertboos

LGTM

anik120 reviewed Jun 12, 2026

View reviewed changes

asimurka requested changes Jun 16, 2026

View reviewed changes

arin-deloatch force-pushed the feat/LCORE-1831 branch from f5586a7 to d5a97ab Compare June 16, 2026 16:37

coderabbitai Bot reviewed Jun 16, 2026

View reviewed changes

Comment thread src/pydantic_ai_lightspeed/capabilities/redaction/capability.py

Comment thread src/pydantic_ai_lightspeed/capabilities/redaction/capability.py

openshift-ci Bot added the ok-to-test label Jun 17, 2026

asimurka reviewed Jun 17, 2026

View reviewed changes

asimurka approved these changes Jun 18, 2026

View reviewed changes

tisnik requested changes Jun 18, 2026

View reviewed changes

arin-deloatch added 7 commits June 18, 2026 12:27

LCORE-1831: Add core PII redaction engine with regex-based text subst…

901b7a2

…itution

LCORE-1831: Add redaction configuration models with compile-time patt…

5ca3cee

…ern validation

LCORE-1831: Add PiiRedactionCapability for pydantic-ai agents and uni…

eb1000d

…t tests

LCORE-1831: Replace Any with concrete pydantic-ai types in redaction …

e76c6aa

…capability. Use Optional for nullable returns and substitute Any with UserContent, ModelRequestPart, and ModelResponsePart for type-safe message handling.

LCORE-1831: Move redaction config models to src/models/config.py

1b66eca

Update redaction module to re-export from models.config

4f7a369

LCORE-1831: update redaction test imports to use models.config

271c1f2

arin-deloatch force-pushed the feat/LCORE-1831 branch from d5a97ab to 271c1f2 Compare June 18, 2026 19:27

jrobertboos approved these changes Jun 18, 2026

View reviewed changes



		def _redact_message_parts(
		parts: Sequence[Any], compiled_patterns: CompiledPatterns



		@dataclass
		class PiiRedactionCapability(AbstractCapability[Any]):

		@@ -0,0 +1,107 @@
		"""Configuration models for PII redaction rules."""

Conversation

arin-deloatch commented Jun 11, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Suggested reviewers

Uh oh!

anik120 left a comment

Choose a reason for hiding this comment

Uh oh!

asimurka left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

asimurka commented Jun 17, 2026

Uh oh!

asimurka left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tisnik commented Jun 18, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jrobertboos left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

arin-deloatch commented Jun 11, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Jun 11, 2026 •

edited

Loading