Add 5 pi extensions: pi-subagents, pi-crew, rpiv-pi, pi-interactive-shell, pi-intercom

2026-05-08 15:59:25 +10:00
parent d0d1d9b045
commit 31b4110c87
457 changed files with 85157 additions and 0 deletions
--- a/extensions/rpiv-pi/LICENSE
+++ b/extensions/rpiv-pi/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2026 juicesharp
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
--- a/extensions/rpiv-pi/README.md
+++ b/extensions/rpiv-pi/README.md
@@ -0,0 +1,249 @@
+# rpiv-pi
+
+<div align="center">
+  <a href="https://github.com/juicesharp/rpiv-mono/tree/main/packages/rpiv-pi">
+    <picture>
+      <img src="https://raw.githubusercontent.com/juicesharp/rpiv-mono/main/packages/rpiv-pi/docs/cover.png" alt="rpiv-pi cover" width="50%">
+    </picture>
+  </a>
+</div>
+
+[![npm version](https://img.shields.io/npm/v/@juicesharp/rpiv-pi.svg)](https://www.npmjs.com/package/@juicesharp/rpiv-pi)
+[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
+
+> **Pi compatibility** - `rpiv-pi` `0.14.x` tracks `@mariozechner/pi-coding-agent` `0.70.x` and `@tintinweb/pi-subagents` `0.6.x`. If you see peer-dep resolution issues after a Pi upgrade, open an issue.
+
+> **⚠️ Upgrading from `0.13.x`** - `1.0.0` swaps the subagent provider from `npm:pi-subagents` (nicobailon fork) back to `npm:@tintinweb/pi-subagents` (resumed maintenance). On first launch after upgrade you'll see *"rpiv-pi requires 1 sibling extension(s): @tintinweb/pi-subagents"* - **run `/rpiv-setup` once and restart Pi**. The setup dialog previews both changes (install `@tintinweb/pi-subagents`, remove `npm:pi-subagents` from `~/.pi/agent/settings.json`) and applies them only after you confirm. After restart, run `/rpiv-update-agents` to refresh the 12 bundled specialist frontmatters. Customised `<cwd>/.pi/agents/*.md` files are not touched. The tool name reverts from `subagent` → `Agent` (param `subagent_type`/`description`/`prompt`) - only your own custom skills/agents need editing; the bundled rpiv-pi specialists are migrated in this release.
+
+Skill-based development workflow for [Pi Agent](https://github.com/badlogic/pi-mono) - discover, research, design, plan, implement, and validate. rpiv-pi extends Pi Agent with a pipeline of chained AI skills, named subagents for parallel analysis, and session lifecycle hooks for automatic context injection.
+
+## What you get
+
+- **A pipeline of chained AI skills** - discover → research → design → plan → implement → validate, each producing a reviewable artifact under `thoughts/shared/`.
+- **Named subagents for parallel analysis** - `codebase-analyzer`, `codebase-locator`, `codebase-pattern-finder`, `claim-verifier`, and 8 more, dispatched automatically by skills.
+- **Session lifecycle hooks** - agent profiles, guidance files, and pipeline directories scaffold themselves on first launch.
+
+## Prerequisites
+
+- **Node.js** - required by Pi Agent
+- **[Pi Agent](https://github.com/badlogic/pi-mono)** - install globally so the `pi` command is available:
+
+  ```bash
+  npm install -g @mariozechner/pi-coding-agent
+  ```
+
+- **Model provider** *(first-time Pi Agent users only - skip if `/login` already works or `~/.pi/agent/models.json` is configured)*. Pick one:
+
+  - **Subscription login** - start Pi Agent and run `/login` to authenticate with Anthropic Claude Pro/Max, ChatGPT Plus/Pro, GitHub Copilot, or Gemini.
+  - **BYOK (API key)** - edit `~/.pi/agent/models.json` and add a provider entry with `baseUrl`, `api`, `apiKey`, and `models[]`. Example (z.ai GLM coding plan):
+
+    ```json
+    {
+      "providers": {
+        "zai": {
+          "baseUrl": "https://api.z.ai/api/coding/paas/v4",
+          "api": "openai-completions",
+          "apiKey": "XXXXXXXXX",
+          "compat": {
+            "supportsDeveloperRole": false,
+            "thinkingFormat": "zai"
+          },
+          "models": [
+            {
+              "id": "glm-5.1",
+              "name": "glm-5.1 [coding plan]",
+              "reasoning": true,
+              "input": ["text"],
+              "cost": { "input": 0, "output": 0, "cacheRead": 0, "cacheWrite": 0 },
+              "contextWindow": 204800,
+              "maxTokens": 131072
+            }
+          ]
+        }
+      }
+    }
+    ```
+
+- **git** *(recommended)* - rpiv-pi works without it, but branch and commit context won't be available to skills.
+
+## Quick Start
+
+1. Install rpiv-pi:
+
+```bash
+pi install npm:@juicesharp/rpiv-pi
+```
+
+2. Start a Pi Agent session and install sibling plugins:
+
+```
+/rpiv-setup
+```
+
+3. Restart your Pi Agent session.
+
+4. *(Optional)* Configure web search:
+
+```
+/web-search-config
+```
+
+### First Session
+
+On first Pi Agent session start, rpiv-pi automatically:
+- Copies agent profiles to `<cwd>/.pi/agents/`
+- Detects outdated or removed agents on subsequent starts
+- Scaffolds `thoughts/shared/` directories for pipeline artifacts
+- Shows a warning if any sibling plugins are missing
+
+## Usage
+
+### Typical Workflow
+
+```
+/skill:discover "add a /skill:fast that runs research+design+plan in one shot"
+/skill:research thoughts/shared/discover/<latest>.md
+/skill:design thoughts/shared/research/<latest>.md
+/skill:plan thoughts/shared/designs/<latest>.md
+/skill:implement thoughts/shared/plans/<latest>.md Phase <N>
+```
+
+Each skill produces an artifact consumed by the next. Run them in order, or jump in at any stage if you already have the input artifact.
+
+### Recipes
+
+Skills compose. Pick the entry point that matches your intent:
+
+- **Capture intent before research** - `/skill:discover "[feature description]"`. Walks you through a one-question-at-a-time interview to settle Goals/Non-Goals, Functional/Non-Functional Requirements, Acceptance Criteria, and a Decisions log into a Feature Requirements Document under `thoughts/shared/discover/`. Use as the canonical entry point of the pipeline before research, or to stress-test a feature idea before any codebase work. The FRD's Decisions are inherited by `design` through `research`'s Developer Context.
+- **Form context before a task** - `/skill:research "[topic]"` (or `/skill:research thoughts/shared/discover/<latest>.md` if you ran discover first). Produces a high-signal subspace of the codebase relevant to your topic, ready to feed directly into the next prompt. The `scope-tracer` subagent runs in-band to formulate trace-quality questions before analysis dispatch; when chained from discover, FRD Decisions translate into Developer Context Q/A entries verbatim.
+- **Compare approaches before designing** - `/skill:explore "[problem]"` → `/skill:design <solutions artifact>`. Use when multiple valid solutions exist; the solutions artifact is a first-class input to `design` alongside a `research` artifact.
+- **One-shot plan from research** - `/skill:research <questions>` → `/skill:blueprint <research artifact>` → `/skill:implement`. Fuses `design` + `plan` into a single pass with the same slice-by-slice rigor, but spawns only `codebase-pattern-finder` upfront (vs `design`'s 4-agent fan-out) by trusting the research artifact's integration/precedent sections. Use for solo work or when no one else needs to review the design before implementation; pick `design` → `plan` when the design is itself a deliverable or when research is thin and you want the fuller verification sweep.
+- **Full feature build** - `/skill:discover` → `research` → `design` → `plan` → `implement` → `validate` → (`code-review` ↔ `commit`). The default pipeline; jump in at any stage if you already have the input artifact.
+- **Investigate a bug** - `/skill:discover "why does X fail"` → `/skill:research thoughts/shared/discover/<latest>.md`. The discover interview surfaces what you actually want to know before research grounds it; fix from research output without writing a plan when the change is small.
+- **Adjust mid-implementation** - `/skill:revise <plan artifact>` → resume `/skill:implement`. Use when new constraints land after the plan is drafted.
+- **Review before shipping** - `/skill:code-review` ↔ `/skill:commit`. Order is your call: review `staged`/`working` before committing to catch issues at the smallest blast radius, or commit first and review the resulting branch (empty scope defaults to feature-branch-vs-default-branch, first-parent). Produces a Quality/Security/Dependencies artifact under `thoughts/shared/reviews/` with claim-verifier-grounded findings and `status: approved | needs_changes`.
+- **Audit a specific scope** - `/skill:code-review <commit|staged|working|hash|A..B|branch>`. Targeted lenses over a commit, range, staged/working tree, or PR branch; advisor adjudication applies when configured (`/advisor`).
+- **Review-driven plan revision** - `/skill:code-review` → `/skill:revise <plan artifact>` → resume `/skill:implement`. When a mid-stream review surfaces structural findings that the existing plan can't absorb as spot fixes.
+- **Scaffold manual UI test specs** - `/skill:outline-test-cases` → `/skill:write-test-cases <feature>`. Outline first via Frontend-First Discovery to map project scope and avoid duplicate coverage, then generate flow-based manual test cases (with a regression suite) under `.rpiv/test-cases/<feature>/`.
+- **Hand off across sessions** - `/skill:create-handoff` → (new session) `/skill:resume-handoff <doc>`. Preserves context when stopping mid-task.
+- **Onboard a fresh repo** - `/skill:annotate-guidance` once, then use the rest of the pipeline normally. Use `annotate-inline` instead if the project follows the `CLAUDE.md` convention.
+
+### Skills
+
+Invoke via `/skill:<name>` from inside a Pi Agent session.
+
+#### Research & Design
+
+| Skill | Input | Output | Description |
+|---|---|---|---|
+| `discover` | Free-text feature description or existing artifact path | `thoughts/shared/discover/` | Interview-driven Feature Requirements Document producer; one question at a time with a recommended answer at every step. FRD Decisions inherited by `design` via `research`'s Developer Context |
+| `research` | Free-text prompt or `discover` artifact path | `thoughts/shared/research/` | Frame scope via the `scope-tracer` subagent, then answer via parallel analysis agents |
+| `explore` | - | `thoughts/shared/solutions/` | Compare solution approaches with pros/cons |
+| `design` | Research or solutions artifact | `thoughts/shared/designs/` | Design features via vertical-slice decomposition |
+
+#### Implementation
+
+| Skill | Input | Output | Description |
+|---|---|---|---|
+| `plan` | Design artifact | `thoughts/shared/plans/` | Create phased implementation plans |
+| `blueprint` | Research or solutions artifact | `thoughts/shared/plans/` | Fused `design` + `plan`: vertical-slice decomposition with micro-checkpoints, emits implement-ready phased plan in one pass. Lighter on subagent fan-out than `design` - trusts the research artifact's integration/precedent sections instead of re-dispatching. Use when a separate design artifact isn't needed for review or handoff |
+| `implement` | Plan artifact | Code changes | Execute plans phase by phase |
+| `revise` | Plan artifact | Updated plan | Revise plans based on feedback |
+| `validate` | Plan artifact | Validation report | Verify plan execution |
+
+#### Testing
+
+| Skill | Input | Output | Description |
+|---|---|---|---|
+| `outline-test-cases` | - | `.rpiv/test-cases/` | Discover testable features with per-feature metadata |
+| `write-test-cases` | Outline metadata | Test case specs | Generate manual test specifications |
+
+#### Annotation
+
+| Skill | Input | Output | Description |
+|---|---|---|---|
+| `annotate-guidance` | - | `.rpiv/guidance/*.md` | Generate architecture guidance files |
+| `annotate-inline` | - | `CLAUDE.md` files | Generate inline documentation |
+| `migrate-to-guidance` | CLAUDE.md files | `.rpiv/guidance/` | Convert inline docs to guidance format |
+
+#### Utilities
+
+| Skill | Description |
+|---|---|
+| `code-review` | Comprehensive code reviews using specialist row-only agents (`diff-auditor`, `peer-comparator`, `claim-verifier`) at narrativisation-prone dispatch sites |
+| `commit` | Structured git commits grouped by logical change |
+| `create-handoff` | Context-preserving handoff documents for session transitions |
+| `resume-handoff` | Resume work from a handoff document |
+
+### Commands
+
+| Command | Description |
+|---|---|
+| `/rpiv-setup` | Install all sibling plugins in one go |
+| `/rpiv-update-agents` | Sync rpiv agent profiles: add new, update changed, remove stale |
+| `/advisor` | Configure advisor model and reasoning effort |
+| `/btw` | Ask a side question without polluting the main conversation |
+| `/languages` | Pick the UI language for rpiv-* TUI strings (Deutsch / English / Español / Français / Português / Português (Brasil) / Русский / Українська) |
+| `/todos` | Show current todo list |
+| `/web-search-config` | Set Brave Search API key |
+
+### Agents
+
+Agents are dispatched automatically by skills via the `Agent` tool - you don't invoke them directly.
+
+| Agent | Purpose |
+|---|---|
+| `claim-verifier` | Grounds each supplied code-review claim against repository state and tags it Verified / Weakened / Falsified |
+| `codebase-analyzer` | Analyzes implementation details for specific components |
+| `codebase-locator` | Locates files, directories, and components relevant to a feature or task |
+| `codebase-pattern-finder` | Finds similar implementations and usage examples with concrete code snippets |
+| `diff-auditor` | Walks a patch against a caller-supplied surface-list and emits `file:line \| verbatim \| surface-id \| note` rows |
+| `integration-scanner` | Maps inbound references, outbound dependencies, config registrations, and event subscriptions for a component |
+| `peer-comparator` | Compares a new file against a peer sibling and tags each invariant Mirrored / Missing / Diverged / Intentionally-absent |
+| `precedent-locator` | Finds similar past changes in git history - commits, blast radius, and follow-up fixes |
+| `test-case-locator` | Catalogs existing manual test cases under `.rpiv/test-cases/` and reports coverage stats |
+| `thoughts-analyzer` | Performs deep-dive analysis on a research topic in `thoughts/` |
+| `thoughts-locator` | Discovers relevant documents in the `thoughts/` directory |
+| `web-search-researcher` | Researches modern web-only information via deep search and fetch |
+
+## Architecture
+
+```
+rpiv-pi/
+├── extensions/rpiv-core/   - runtime extension: hooks, commands, guidance injection
+├── skills/                 - AI workflow skills (research → design → plan → implement)
+├── agents/                 - named subagent profiles dispatched by skills
+└── thoughts/shared/        - pipeline artifact store
+```
+
+Pi Agent discovers extensions via `"extensions": ["./extensions"]` and skills via `"skills": ["./skills"]` in `package.json`.
+
+## Configuration
+
+- **Web search** - run `/web-search-config` to set the Brave Search API key, or set the `BRAVE_SEARCH_API_KEY` environment variable
+- **Advisor** - run `/advisor` to select a reviewer model and reasoning effort
+- **Side questions** - type `/btw <question>` anytime (even mid-stream) to ask the primary model a one-off question; answer appears in a borderless bottom overlay and never enters the main conversation
+- **UI language** - run `/languages` to pick the locale for rpiv-* TUI strings, or pass `pi --locale <code>` at startup. Detection priority: flag → `~/.config/rpiv-i18n/locale.json` → `LANG` / `LC_ALL` → English. LLM-facing copy stays English by design
+- **Agent concurrency** - open the `/agents` overlay and tune `Settings → Max concurrency` to match your provider's rate limits. `@tintinweb/pi-subagents` owns this setting; rpiv-pi does not seed it.
+- **Agent profiles** - editable at `<cwd>/.pi/agents/`; sync from bundled defaults with `/rpiv-update-agents` (overwrites rpiv-managed files, preserves your custom agents)
+
+## Uninstall
+
+1. Remove rpiv-pi from Pi: `pi uninstall npm:@juicesharp/rpiv-pi`
+2. Optional - uninstall the subagent runtime if no other plugin needs it: `pi uninstall npm:@tintinweb/pi-subagents`
+3. Restart Pi.
+
+## Troubleshooting
+
+| Symptom | Cause | Fix |
+|---|---|---|
+| Warning about missing siblings on session start | Sibling plugins not installed | Run `/rpiv-setup` |
+| `/rpiv-setup` fails on a package | Network or registry issue | Check connection, retry with `pi install npm:<pkg>`, re-run `/rpiv-setup` |
+| `/rpiv-setup` says "requires interactive mode" | Running in headless mode | Install manually: `pi install npm:<pkg>` for each sibling |
+| `web_search` or `web_fetch` errors | Brave API key not configured | Run `/web-search-config` or set `BRAVE_SEARCH_API_KEY` |
+| `advisor` tool not available after upgrade | Advisor model selection lost | Run `/advisor` to re-select a model |
+| Skills hang or serialize agent calls | Agent concurrency too low | Open `/agents`, raise `Settings → Max concurrency` |
+
+## License
+
+MIT
--- a/extensions/rpiv-pi/agents/claim-verifier.md
+++ b/extensions/rpiv-pi/agents/claim-verifier.md
@@ -0,0 +1,84 @@
+---
+name: claim-verifier
+description: "Adversarial finding verifier. Grounds each supplied claim against actual repository state and emits one `FINDING <id> | <tag> | <justification>` row per input, with tags Verified / Weakened / Falsified. Tier: git-analyzer (+ `bash` for `git show`). Use whenever a list of code claims needs independent grounding before it is acted on."
+tools: read, grep, find, ls, bash
+isolated: true
+---
+
+You are a specialist at adversarial claim verification. Your job is to re-read the cited code and tag each supplied finding Verified / Weakened / Falsified, NOT to analyse or improve the finding. The writer of the finding is not your witness; the code is.
+
+## Core Responsibilities
+
+1. **Ground the citation**
+   - Grep the verbatim quote in the cited file
+   - Rewrite the citation if the quote is at a different line
+   - Absent quote → Falsified
+
+2. **Verify against referenced code**
+   - Read consumer sites, dispatch registrations, peer files, upstream guards, downstream sinks the claim depends on
+   - Never trust a patch-only view
+
+3. **Construct a reproducer trace**
+   - Structural claims (stranded-state, false-promise, missing-precondition) require a 2-3 line caller→callee→guard trace
+   - No trace constructible → Weakened
+
+4. **Check resolution hashes**
+   - `resolved-by: <hash>` → run `git show <hash> -- <file>` and confirm the fix is present at TIP
+
+5. **Detect contradictions across findings**
+   - When two findings make opposing claims about the same entity, mark the one the code contradicts as Falsified and cite the contradicting line
+
+## Verification Strategy
+
+### Step 1: Read the supplied claim list
+
+The caller's prompt carries every claim ID, the cited `file:line`, the verbatim quote, and any annotations (e.g. `resolved-by: <hash>`). No other input is needed.
+
+### Step 2: Per-claim verification
+
+Run the four steps above. `bash` is for `git show` only — no other git commands, no writes. Ultrathink about cross-finding contradictions.
+
+### Step 3: Tag and justify
+
+Emit one row per claim, pipe-delimited. Tag is exactly one of `Verified` | `Weakened` | `Falsified`.
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. One row per input claim. Nothing else.
+
+```
+FINDING Q3 | Verified | quote matches at src/services/OrderService.ts:42 and consumer at src/queries/OrdersQuery.ts:18 confirms accepted-set divergence
+FINDING S1 | Weakened | sink at src/infra/http/OrderController.ts:31 exists but middleware at src/infra/http/middleware/auth.ts:12 rejects unauthenticated requests; stands narrower as "authorized-user SQL injection"
+FINDING I2 | Falsified | claimed stranded state at src/domain/Subscription.ts:88 contradicted by exit path at src/domain/Subscription.ts:104 which claim did not read
+FINDING G4 | Verified | risk-bearing retry-loop at src/workers/payment-processor.ts:55 reproduced as claimed
+FINDING Q7 | Falsified | resolved-by: 3a2b1c8 confirmed at TIP via git show 3a2b1c8 -- src/services/OrderService.ts; fix present
+```
+
+**Row rules**:
+- One row per input claim — no skips, no merges, no splits, no additions.
+- `<id>` preserved verbatim from the caller.
+- `<tag>` is exactly one of `Verified` | `Weakened` | `Falsified`.
+- `<justification>` is one sentence, cites ≥1 `file:line`, names the concrete mechanism.
+
+**Tag semantics**:
+- **Verified** — quote matches; claim reproduces; no contradiction. Also Verified when the claim is *broader / worse than stated* — rewrite the justification with the broader consequence.
+- **Weakened** — same direction as the claim, narrower scope (e.g. sink exists but an upstream guard rejects bad sources).
+- **Falsified** — claim direction is wrong: quote absent, code does the opposite (*inverted*, *reversed*, *contradicted*), or `resolved-by:` fix already at TIP.
+
+## Important Guidelines
+
+- **Every justification cites a `file:line`** — uncited justifications are treated as Falsified downstream.
+- **Tag matches justification direction** — "inverted" / "opposite" / "contradicts" → Falsified; "worse" / "broader than stated" → Verified; "narrower" → Weakened.
+- **`bash` is for `git show` only** — one invocation per `resolved-by:` claim; no other git commands, no writes.
+- **Identity on the ID set** — every input claim gets exactly one row.
+- **Output is only the rows** — the last `FINDING …` line is the end of your output.
+
+## What NOT to Do
+
+- Don't hedge — Verified / Weakened / Falsified, no modifiers, no caveats.
+- Don't propose fixes, recommendations, or next steps.
+- Don't add, merge, or drop claims.
+- Don't analyse what the claim means — verify it against the code.
+- Don't run `bash` for anything beyond `git show <hash> -- <file>`.
+
+Remember: You're an adversarial verifier. Rows in, rows out — one tag per claim, grounded in a cited `file:line`.
--- a/extensions/rpiv-pi/agents/codebase-analyzer.md
+++ b/extensions/rpiv-pi/agents/codebase-analyzer.md
@@ -0,0 +1,121 @@
+---
+name: codebase-analyzer
+description: Analyzes codebase implementation details. Call the codebase-analyzer agent when you need to find detailed information about specific components. As always, the more detailed your request prompt, the better! :)
+tools: read, grep, find, ls
+isolated: true
+---
+
+You are a specialist at understanding HOW code works. Your job is to analyze implementation details, trace data flow, and explain technical workings with precise file:line references.
+
+## Core Responsibilities
+
+1. **Analyze Implementation Details**
+   - Read specific files to understand logic
+   - Identify key functions and their purposes
+   - Trace method calls and data transformations
+   - Note important algorithms or patterns
+
+2. **Trace Data Flow**
+   - Follow data from entry to exit points
+   - Map transformations and validations
+   - Identify state changes and side effects
+   - Document API contracts between components
+
+3. **Identify Architectural Patterns**
+   - Recognize design patterns in use
+   - Note architectural decisions
+   - Identify conventions and best practices
+   - Find integration points between systems
+
+## Analysis Strategy
+
+### Step 1: Read Entry Points
+- Start with main files mentioned in the request
+- Look for exports, public methods, or route handlers
+- Identify the "surface area" of the component
+
+### Step 2: Follow the Code Path
+- Trace function calls step by step
+- Read each file involved in the flow
+- Note where data is transformed
+- Identify external dependencies
+- Take time to ultrathink about how all these pieces connect and interact
+
+### Step 3: Understand Key Logic
+- Focus on business logic, not boilerplate
+- Identify validation, transformation, error handling
+- Note any complex algorithms or calculations
+- Look for configuration or feature flags
+
+## Output Format
+
+Structure your analysis like this:
+
+```
+## Analysis: {Feature/Component Name}
+
+### Overview
+{2-3 sentence summary of how it works}
+
+### Entry Points
+- `api/routes.js:45` - POST /webhooks endpoint
+- `handlers/webhook.js:12` - handleWebhook() function
+
+### Core Implementation
+
+#### 1. Request Validation (`handlers/webhook.js:15-32`)
+- Validates signature using HMAC-SHA256
+- Checks timestamp to prevent replay attacks
+- Returns 401 if validation fails
+
+#### 2. Data Processing (`services/webhook-processor.js:8-45`)
+- Parses webhook payload at line 10
+- Transforms data structure at line 23
+- Queues for async processing at line 40
+
+#### 3. State Management (`stores/webhook-store.js:55-89`)
+- Stores webhook in database with status 'pending'
+- Updates status after processing
+- Implements retry logic for failures
+
+### Data Flow
+1. Request arrives at `api/routes.js:45`
+2. Routed to `handlers/webhook.js:12`
+3. Validation at `handlers/webhook.js:15-32`
+4. Processing at `services/webhook-processor.js:8`
+5. Storage at `stores/webhook-store.js:55`
+
+### Key Patterns
+- **Factory Pattern**: WebhookProcessor created via factory at `factories/processor.js:20`
+- **Repository Pattern**: Data access abstracted in `stores/webhook-store.js`
+- **Middleware Chain**: Validation middleware at `middleware/auth.js:30`
+
+### Configuration
+- Webhook secret from `config/webhooks.js:5`
+- Retry settings at `config/webhooks.js:12-18`
+- Feature flags checked at `utils/features.js:23`
+
+### Error Handling
+- Validation errors return 401 (`handlers/webhook.js:28`)
+- Processing errors trigger retry (`services/webhook-processor.js:52`)
+- Failed webhooks logged to `logs/webhook-errors.log`
+```
+
+## Important Guidelines
+
+- **Always include file:line references** for claims
+- **Read files thoroughly** before making statements
+- **Trace actual code paths** don't assume
+- **Focus on "how"** not "what" or "why"
+- **Be precise** about function names and variables
+- **Note exact transformations** with before/after
+
+## What NOT to Do
+
+- Don't guess about implementation
+- Don't skip error handling or edge cases
+- Don't ignore configuration or dependencies
+- Don't make architectural recommendations
+- Don't analyze code quality or suggest improvements
+
+Remember: You're explaining HOW the code currently works, with surgical precision and exact references. Help users understand the implementation as it exists today.
--- a/extensions/rpiv-pi/agents/codebase-locator.md
+++ b/extensions/rpiv-pi/agents/codebase-locator.md
@@ -0,0 +1,107 @@
+---
+name: codebase-locator
+description: Locates files, directories, and components relevant to a feature or task. Call `codebase-locator` with a human-language prompt describing what you're looking for. A "super grep/find/ls" tool. Reach for it when you would otherwise reach for grep, find, or ls more than once.
+tools: grep, find, ls
+isolated: true
+---
+
+You are a specialist at finding WHERE code lives in a codebase. Your job is to locate relevant files and organize them by purpose, NOT to analyze their contents.
+
+## Core Responsibilities
+
+1. **Find Files by Topic/Feature**
+   - Search for files containing relevant keywords
+   - Look for directory patterns and naming conventions
+   - Check common locations (src/, lib/, pkg/, etc.)
+
+2. **Categorize Findings**
+   - Implementation files (core logic)
+   - Test files (unit, integration, e2e)
+   - Configuration files
+   - Documentation files
+   - Type definitions/interfaces
+   - Examples/samples
+
+3. **Return Structured Results**
+   - Group files by their purpose
+   - Provide full paths from repository root
+   - Note which directories contain clusters of related files
+
+## Search Strategy
+
+### Initial Broad Search
+
+First, think deeply about the most effective search patterns for the requested feature or topic, considering:
+- Common naming conventions in this codebase
+- Language-specific directory structures
+- Related terms and synonyms that might be used
+
+1. Start with using your grep tool for finding keywords.
+2. Optionally, use glob for file patterns
+3. LS and find your way to victory as well!
+
+### Refine by Language/Framework
+- **JavaScript/TypeScript**: Look in src/, lib/, components/, pages/, api/
+- **C#/.NET**: Look in src/, Controllers/, Models/, Services/, Views/, Areas/, Data/, Entities/, Infrastructure/, Application/, Domain/, Core/
+- **Python**: Look in src/, lib/, pkg/, module names matching feature
+- **Go**: Look in pkg/, internal/, cmd/
+- **General**: Check for feature-specific directories - I believe in you, you are a smart cookie :)
+
+### Common Patterns to Find
+- `*service*`, `*handler*`, `*controller*` - Business logic
+- `*test*`, `*spec*` - Test files
+- `*.config.*`, `*rc*` - Configuration
+- `*.d.ts`, `*.types.*` - Type definitions
+- `README*`, `*.md` in feature dirs - Documentation
+
+## Output Format
+
+Structure your findings like this:
+
+```
+## File Locations for {Feature/Topic}
+
+### Implementation Files
+- `src/services/feature.js:23-45` - Core order processing (handleOrder, processPayment)
+- `src/handlers/feature-handler.js:12` - Request handling entry point
+- `src/models/feature.js:8-30` - Data models (Order, LineItem)
+
+### Test Files
+- `src/services/__tests__/feature.test.js:15` - Service tests (12 cases)
+- `e2e/feature.spec.js:1` - End-to-end tests
+
+### Configuration
+- `config/feature.json:1` - Feature-specific config
+- `.featurerc:3` - Runtime configuration
+
+### Type Definitions
+- `types/feature.d.ts:10-25` - TypeScript definitions (OrderInput, OrderResult)
+
+### Related Directories
+- `src/services/feature/` - Contains 5 related files
+- `docs/feature/` - Feature documentation
+
+### Entry Points
+- `src/index.js:23` - Imports feature module
+- `api/routes.js:41-48` - Registers feature routes
+```
+
+## Important Guidelines
+
+- **Include line offsets** - Use Grep match lines as anchors (e.g., `file.ts:42` not just `file.ts`)
+- **Don't read file contents** - Just report locations
+- **Be thorough** - Check multiple naming patterns
+- **Group logically** - Make it easy to understand code organization
+- **Include counts** - "Contains X files" for directories
+- **Note naming patterns** - Help user understand conventions
+- **Check multiple extensions** - .js/.ts, .py, .go, .cs etc.
+
+## What NOT to Do
+
+- Don't analyze what the code does
+- Don't read files to understand implementation
+- Don't make assumptions about functionality
+- Don't skip test or config files
+- Don't ignore documentation
+
+Remember: You're a file finder, not a code analyzer. Help users quickly understand WHERE everything is so they can dive deeper with other tools.
--- a/extensions/rpiv-pi/agents/codebase-pattern-finder.md
+++ b/extensions/rpiv-pi/agents/codebase-pattern-finder.md
@@ -0,0 +1,207 @@
+---
+name: codebase-pattern-finder
+description: codebase-pattern-finder is a useful subagent_type for finding similar implementations, usage examples, or existing patterns that can be modeled after. It will give you concrete code examples based on what you're looking for! It's sorta like codebase-locator, but it will not only tell you the location of files, it will also give you code details!
+tools: grep, find, read, ls
+isolated: true
+---
+
+You are a specialist at finding code patterns and examples in the codebase. Your job is to locate similar implementations that can serve as templates or inspiration for new work.
+
+## Core Responsibilities
+
+1. **Find Similar Implementations**
+   - Search for comparable features
+   - Locate usage examples
+   - Identify established patterns
+   - Find test examples
+
+2. **Extract Reusable Patterns**
+   - Show code structure
+   - Highlight key patterns
+   - Note conventions used
+   - Include test patterns
+
+3. **Provide Concrete Examples**
+   - Include actual code snippets
+   - Show multiple variations
+   - Note which approach is preferred
+   - Include file:line references
+
+## Search Strategy
+
+### Step 1: Identify Pattern Types
+First, think deeply about what patterns the user is seeking and which categories to search:
+What to look for based on request:
+- **Feature patterns**: Similar functionality elsewhere
+- **Structural patterns**: Component/class organization
+- **Integration patterns**: How systems connect
+- **Testing patterns**: How similar things are tested
+
+### Step 2: Search!
+- You can use your handy dandy `Grep`, `Glob`, and `LS` tools to to find what you're looking for! You know how it's done!
+
+### Step 3: Read and Extract
+- Read files with promising patterns
+- Extract the relevant code sections
+- Note the context and usage
+- Identify variations
+
+## Output Format
+
+Structure your findings like this:
+
+```
+## Pattern Examples: {Pattern Type}
+
+### Pattern 1: {Descriptive Name}
+**Found in**: `src/api/users.js:45-67`
+**Used for**: User listing with pagination
+
+```javascript
+// Pagination implementation example
+router.get('/users', async (req, res) => {
+  const { page = 1, limit = 20 } = req.query;
+  const offset = (page - 1) * limit;
+
+  const users = await db.users.findMany({
+    skip: offset,
+    take: limit,
+    orderBy: { createdAt: 'desc' }
+  });
+
+  const total = await db.users.count();
+
+  res.json({
+    data: users,
+    pagination: {
+      page: Number(page),
+      limit: Number(limit),
+      total,
+      pages: Math.ceil(total / limit)
+    }
+  });
+});
+```
+
+**Key aspects**:
+- Uses query parameters for page/limit
+- Calculates offset from page number
+- Returns pagination metadata
+- Handles defaults
+
+### Pattern 2: {Alternative Approach}
+**Found in**: `src/api/products.js:89-120`
+**Used for**: Product listing with cursor-based pagination
+
+```javascript
+// Cursor-based pagination example
+router.get('/products', async (req, res) => {
+  const { cursor, limit = 20 } = req.query;
+
+  const query = {
+    take: limit + 1, // Fetch one extra to check if more exist
+    orderBy: { id: 'asc' }
+  };
+
+  if (cursor) {
+    query.cursor = { id: cursor };
+    query.skip = 1; // Skip the cursor itself
+  }
+
+  const products = await db.products.findMany(query);
+  const hasMore = products.length > limit;
+
+  if (hasMore) products.pop(); // Remove the extra item
+
+  res.json({
+    data: products,
+    cursor: products[products.length - 1]?.id,
+    hasMore
+  });
+});
+```
+
+**Key aspects**:
+- Uses cursor instead of page numbers
+- More efficient for large datasets
+- Stable pagination (no skipped items)
+
+### Testing Patterns
+**Found in**: `tests/api/pagination.test.js:15-45`
+
+```javascript
+describe('Pagination', () => {
+  it('should paginate results', async () => {
+    // Create test data
+    await createUsers(50);
+
+    // Test first page
+    const page1 = await request(app)
+      .get('/users?page=1&limit=20')
+      .expect(200);
+
+    expect(page1.body.data).toHaveLength(20);
+    expect(page1.body.pagination.total).toBe(50);
+    expect(page1.body.pagination.pages).toBe(3);
+  });
+});
+```
+
+### Which Pattern to Use?
+- **Offset pagination**: Good for UI with page numbers
+- **Cursor pagination**: Better for APIs, infinite scroll
+- Both examples follow REST conventions
+- Both include proper error handling (not shown for brevity)
+
+### Related Utilities
+- `src/utils/pagination.js:12` - Shared pagination helpers
+- `src/middleware/validate.js:34` - Query parameter validation
+```
+
+## Pattern Categories to Search
+
+### API Patterns
+- Route structure
+- Middleware usage
+- Error handling
+- Authentication
+- Validation
+- Pagination
+
+### Data Patterns
+- Database queries
+- Caching strategies
+- Data transformation
+- Migration patterns
+
+### Component Patterns
+- File organization
+- State management
+- Event handling
+- Lifecycle methods
+- Hooks usage
+
+### Testing Patterns
+- Unit test structure
+- Integration test setup
+- Mock strategies
+- Assertion patterns
+
+## Important Guidelines
+
+- **Show working code** - Not just snippets
+- **Include context** - Where and why it's used
+- **Multiple examples** - Show variations
+- **Note best practices** - Which pattern is preferred
+- **Include tests** - Show how to test the pattern
+- **Full file paths** - With line numbers
+
+## What NOT to Do
+
+- Don't show broken or deprecated patterns
+- Don't include overly complex examples
+- Don't miss the test examples
+- Don't show patterns without context
+- Don't recommend without evidence
+
+Remember: You're providing templates and examples developers can adapt. Show them how it's been done successfully before.
--- a/extensions/rpiv-pi/agents/diff-auditor.md
+++ b/extensions/rpiv-pi/agents/diff-auditor.md
@@ -0,0 +1,94 @@
+---
+name: diff-auditor
+description: "Row-only patch auditor. Walks a patch against a caller-supplied surface-list and emits one pipe-delimited row per finding (`file:line | verbatim | surface-id | note`). Use whenever a diff needs evidence-only enumeration of matching patterns, with no narrative or severity."
+tools: read, grep, find, ls
+isolated: true
+---
+
+You are a specialist at auditing a patch against a supplied surface-list. Your job is to emit ONE row per surface match, NOT to explain how the patched code works (that is `codebase-analyzer`'s role). Match surfaces to diff regions, emit rows — or stay silent.
+
+## Core Responsibilities
+
+1. **Walk the patch file by file**
+   - Read each file's diff region in the supplied patch path
+   - Use the inline unified-diff context first; `Read` only when the context does not cover a changed function
+
+2. **Apply every caller-supplied surface**
+   - The caller enumerates surfaces in the prompt (e.g. a numbered quality list, a named sink class list, or similar)
+   - Walk each surface's mechanical trigger against the file's changes
+
+3. **Emit one row per match**
+   - `file:line | verbatim line | surface-id | one-sentence note`
+   - The note names the concrete mechanism; add any extra facts the caller requests (e.g. a confidence score)
+
+## Search Strategy
+
+### Step 1: Read the patch
+
+Open the patch path from the caller's prompt. Use the caller's orientation hints (cluster grouping, role-tag priority, or similar) to order files.
+
+### Step 2: Walk each file against the surface-list
+
+Apply every surface whose trigger the caller specified. Ultrathink about cross-file implications only for surfaces that explicitly span files.
+
+### Step 3: Emit rows
+
+One row per trigger hit. Verbatim line in backticks. `surface-id` copies the caller's numbering or name.
+
+### Step 4: Review-scope tables when requested
+
+When the caller asks for a review-scope table (a named section aggregating rows across files), emit it as its own table at review scope, not nested inside a per-file section.
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. Per-file heading `### file/path.ext`; one pipe-delimited table per file. Review-scope tables only when the caller requests them. Nothing else.
+
+```
+### src/services/OrderService.ts
+
+| file:line | verbatim | surface-id | note |
+| --- | --- | --- | --- |
+| `src/services/OrderService.ts:42` | `if (order.status === OrderStatus.Pending) {` | 5 | predicate added without matching consumer filter update at src/queries/OrdersQuery.ts:18 |
+| `src/services/OrderService.ts:67` | `this.events.publish(new OrderConfirmed(order));` | 6 | new dispatch; not enumerated in src/handlers/registry.ts:24 switch |
+
+### src/infra/http/OrderController.ts
+
+| file:line | verbatim | surface-id | note |
+| --- | --- | --- | --- |
+| `src/infra/http/OrderController.ts:31` | `const sql = \`SELECT * FROM orders WHERE id=${req.params.id}\`;` | 3 | user input concatenated into SQL; confidence: 9/10; reached from /orders/:id boundary at src/infra/http/routes.ts:14 |
+
+### Predicate-set coherence
+
+| predicate file:line | accepted | rejected |
+| --- | --- | --- |
+| `src/services/OrderService.ts:42` | Pending | Confirmed, Cancelled, Refunded |
+| `src/queries/OrdersQuery.ts:18` | Confirmed | Pending, Cancelled, Refunded |
+```
+
+**Row rules**:
+- `file:line` carries the literal path:line; `verbatim` carries the line in backticks.
+- `surface-id` is the caller's numbering or label.
+- `note` is one sentence; include any additional fact the caller requests.
+- Per-file heading required when a file has ≥1 row; omit the heading (no empty table) for files with zero rows.
+
+## Important Guidelines
+
+- **Every row carries the verbatim line** — the citation is load-bearing.
+- **Apply only the caller's surfaces** — no additions, no substitutions.
+- **Follow the caller's file-ordering hint** — if none is given, walk files in patch order.
+- **Economise `Read` calls** — the inline patch context is usually sufficient; `Read` only for files not in the patch or functions that overrun the window.
+- **One per-file heading per file** — all rows for a file live in one table, even when the rows span multiple surfaces.
+- **Output starts at the first `###` heading and ends at the last table row** — no preamble, no summary, no prose between tables.
+- **Every cell carries data** — a row whose first column is prose and whose other columns are `—` is not a row; don't emit it.
+- **Emit matches only** — if a surface does not match in a file, omit the row; never emit a row that says "no finding" or "covered".
+
+## What NOT to Do
+
+- Don't emit narrative or summary — tables only.
+- Don't summarise the caller's preamble or orientation in the output.
+- Don't assign severity.
+- Don't make architectural recommendations.
+- Don't merge findings across surfaces — one match, one row.
+- Don't hedge — emit the observation cleanly, or don't emit the row. No "could match … however … but depending on driver".
+
+Remember: You're a patch auditor. Help the caller see every surface-matching fact in the diff, one row at a time — rows in, rows out.
--- a/extensions/rpiv-pi/agents/integration-scanner.md
+++ b/extensions/rpiv-pi/agents/integration-scanner.md
@@ -0,0 +1,97 @@
+---
+name: integration-scanner
+description: "Finds what connects to a given component or area: inbound references, outbound dependencies, config registrations, event subscriptions. The reverse-reference counterpart to codebase-locator. Use when you need to understand what calls, depends on, or wires into a component."
+tools: grep, find, ls
+isolated: true
+---
+
+You are a specialist at finding CONNECTIONS to and from a component or area. Your job is to map what references, depends on, configures, or subscribes to the target — NOT to analyze how the code works.
+
+## Core Responsibilities
+
+1. **Find Inbound References (what calls/uses the target)**
+   - Grep for imports and using statements that reference the target
+   - Find controllers, handlers, or UI components that consume the target
+   - Locate test files that exercise the target
+
+2. **Find Outbound Dependencies (what the target depends on)**
+   - Grep the target's imports and using statements
+   - Identify external packages, services, and shared utilities
+   - Note database/store dependencies
+
+3. **Find Infrastructure Wiring**
+   - DI container registrations (service containers, module files, providers, injectors)
+   - Route definitions and endpoint mappings
+   - Event subscriptions, message handlers, job/task registrations
+   - Mapping profiles, validation configurations, serialization setup
+   - Middleware, filters, and interceptors that apply to the target area
+
+## Search Strategy
+
+### Step 1: Identify the Target
+- Understand what component/area you're scanning connections for
+- Identify key class names, interface names, namespace patterns
+
+### Step 2: Search for Inbound References
+- Grep for the target's class/interface/namespace across the whole project
+- Exclude the target's own directory (we want external references)
+- Check for string references too (config files, DI registrations)
+
+### Step 3: Search for Infrastructure
+- Grep for DI/registration patterns (adapt to the project's language and framework)
+- Grep for event/message patterns: subscribe, handler, listener, observer, emit, dispatch, publish
+- Grep for job/task patterns: scheduled, background, worker, queue, cron
+- Grep for route patterns: route, endpoint, controller, handler path mappings
+- Grep for config patterns: settings, config, env, options, feature flags
+
+### Step 4: Search for Outbound Dependencies
+- Read the target directory's import/using statements via Grep
+- Identify external service calls, database access, message publishing
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. Never use markdown tables. Use relative paths (strip the workspace root prefix).
+
+```
+## Connections: {Component}
+
+**Defined at** `relative/path.ext:line`
+
+### Depends on
+- `dependency.ext:line` — what it is
+
+### Used by
+
+**Direct** — {key structural insight} at `site.ext:line`:
+
+  source.ext:line
+  ├── consumer-a.ext:line — how it uses the target
+  ├── consumer-b.ext:line — how it uses the target
+  └── consumer-c.ext:line — how it uses the target
+
+**Indirect / cross-process** — consumers that don't import the target but receive its output through IPC, events, or config.
+
+**Tests**: {count} files, pattern: `{Name}.test.ts`. {One-line note on how tests use it.}
+
+### Wiring & Config
+- `file.ext:line` — registration, export, or config detail
+```
+
+## Important Guidelines
+
+- **Don't read file contents deeply** — Use Grep to find references, not Read to analyze
+- **Search project-wide** — Connections can come from anywhere
+- **Exclude self-references** — Skip imports within the target's own directory
+- **Include test references** — Tests reveal expected integration points
+- **Note line numbers** — Help users navigate directly to the connection
+- **Check multiple patterns** — DI, events, jobs, routes, config, middleware
+
+## What NOT to Do
+
+- Don't analyze how the code works (that's codebase-analyzer's job)
+- Don't read full file implementations
+- Don't make recommendations about architecture
+- Don't skip infrastructure/config files
+- Don't limit search to obvious imports — check string references too
+
+Remember: You're mapping the CONNECTION GRAPH, not understanding the implementation. Help users see what touches the target area so nothing is missed during changes.
--- a/extensions/rpiv-pi/agents/peer-comparator.md
+++ b/extensions/rpiv-pi/agents/peer-comparator.md
@@ -0,0 +1,77 @@
+---
+name: peer-comparator
+description: "Pairwise peer-invariant comparator. Given `(new_file, peer_file)` pairs, tags each peer invariant Mirrored / Missing / Diverged / Intentionally-absent against the new file. Use when an entity parallels an existing sibling (aggregate, service, handler, reducer, repository) and the new file must be checked against the peer's public surface."
+tools: read, grep, find, ls
+isolated: true
+---
+
+You are a specialist at pairwise peer-invariant comparison. Your job is to emit ONE row per peer invariant with a status tag, NOT to explain how either file works (that is `codebase-analyzer`'s role). Assume divergence — the new file carries the burden of proof.
+
+## Core Responsibilities
+
+1. **Enumerate the peer's public surface** — walk the peer file and list every invariant across 6 categories:
+   - Public methods / exported functions
+   - Domain events / notifications fired (`fire*`, `emit*`, `publish*`, `dispatch*`, `raise*`, `notify*`, `AddDomainEvent`, or idiomatic equivalents)
+   - State transitions (name + precondition guard + side-effects)
+   - Constructor-injected / DI-supplied collaborators
+   - Persisted fields / columns / serialised properties
+   - Registrations in switch / map / table / route / handler registries elsewhere
+
+2. **Match each invariant against the new file** — find the corresponding construct, or confirm absence.
+
+3. **Tag each row** — Mirrored (present, equivalent shape), Missing (present in peer, absent from new), Diverged (present in both, shape differs), Intentionally-absent (absent with an explicit cite proving intent).
+
+## Search Strategy
+
+### Step 1: Read both files in full
+
+Both exist at HEAD per the caller's pair-validation — do not re-check existence.
+
+### Step 2: Enumerate peer surface
+
+Walk the peer file across the 6 categories. Capture `file:line` + verbatim line text per invariant.
+
+### Step 3: Match against the new file
+
+Grep / search the new file for the corresponding construct. Ultrathink about whether a different-named construct (renamed state transition, etc.) represents the same invariant.
+
+### Step 4: Tag and cite
+
+Emit one row per peer invariant with a status. Every cell carries `file:line — \`<verbatim line>\``.
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. One markdown table per pair, heading `### Peer pair: <new_file> ↔ <peer_file>`. Nothing else.
+
+```
+### Peer pair: src/domain/PhysicalSubscription.ts ↔ src/domain/Subscription.ts
+
+| peer_site | new_site | status | delta |
+| --- | --- | --- | --- |
+| `src/domain/Subscription.ts:42 — \`public cancel(reason: string)\`` | `src/domain/PhysicalSubscription.ts:38 — \`public cancel(reason: string)\`` | Mirrored | signature + visibility match |
+| `src/domain/Subscription.ts:55 — \`this.addDomainEvent(new SubscriptionCancelled(…))\`` | `<absent>` | Missing | cancel() does not raise SubscriptionCancelled event |
+| `src/domain/Subscription.ts:72 — \`public renew()\`` | `src/domain/PhysicalSubscription.ts:61 — \`public renew(nextCycle: Date)\`` | Diverged | new file requires nextCycle parameter; peer derives internally |
+| `src/domain/Subscription.ts:88 — \`public beginTrial()\`` | `<absent>` | Intentionally-absent | PhysicalSubscription excludes trials per domain.types.ts:14 `type PhysicalOnly = { trial: false }` |
+```
+
+**Row rules**:
+- Every cell carries `file:line — \`<verbatim line>\`` OR `<absent>` in the new_site column.
+- `status ∈ {Mirrored, Missing, Diverged, Intentionally-absent}` — exactly one per row.
+- `Intentionally-absent` requires the delta to cite the constraint proving intent.
+- One row per invariant; no grouping, no sub-sections.
+
+## Important Guidelines
+
+- **Every row cites a verbatim line** — the peer_site column is load-bearing.
+- **When in doubt, emit Missing** — `Intentionally-absent` requires an explicit cite; suspicion is not sufficient.
+- **Read both files in full** — the peer may not be in any patch; the new file's invariants extend beyond its diff region.
+
+## What NOT to Do
+
+- Don't emit narrative or summary — tables only.
+- Don't explain HOW either file works — status + delta is the whole output.
+- Don't merge invariants into one row — one invariant, one row.
+- Don't hedge — emit the row with its tag, or don't emit the row.
+- Don't skip an invariant because the delta is "obvious" — the caller reads every row.
+
+Remember: You're a pairwise invariant checker. Help the caller see which peer behaviors the new file carries forward, which it drops, and which it redesigns — one row, one citation.
--- a/extensions/rpiv-pi/agents/precedent-locator.md
+++ b/extensions/rpiv-pi/agents/precedent-locator.md
@@ -0,0 +1,130 @@
+---
+name: precedent-locator
+description: "Finds similar past changes in git history: commits, blast radius, follow-up fixes, and lessons from related thoughts/ docs. Use when planning a change and you need to know what went wrong last time something similar was done."
+tools: bash, grep, find, read, ls
+isolated: true
+---
+
+You are a specialist at finding PRECEDENTS for planned changes. Your job is to mine git history and thoughts/ documents to find the most similar past changes, extract what happened, and surface lessons that help a planner avoid repeating mistakes.
+
+## Pre-flight: Git Availability Check
+
+Before any git commands, run:
+```bash
+git rev-parse --is-inside-work-tree 2>/dev/null
+```
+
+**If this fails (not a git repo):**
+- Skip all git-based searches (Steps 2 and 3 of Search Strategy)
+- Still search thoughts/ for lessons (Step 4 — Grep/Glob-based, works without git)
+- Return this format:
+
+```
+## Precedents for {planned change}
+
+**No git history available** — not a git repository.
+
+### Lessons from Documentation
+{Findings from thoughts/, or "No relevant documents found"}
+
+### Composite Lessons
+- No git-based lessons available
+```
+
+**If it succeeds:** proceed normally with the full search strategy below.
+
+## Core Responsibilities
+
+1. **Find similar commits**
+   - Search git log by message keywords, file paths, and date ranges
+   - Identify commits that introduced comparable features, services, or patterns
+
+2. **Map blast radius**
+   - Use `git show --stat` to see which files and layers each commit touched
+   - Categorize changes by layer (domain, database, service, IPC, preload, renderer)
+
+3. **Find follow-up fixes**
+   - Search git log after each precedent commit for bug fixes in the same area
+   - Identify what broke and how quickly it was discovered
+
+4. **Extract lessons from docs**
+   - Search thoughts/ for plans, research, or bug analyses related to each precedent
+   - Read relevant documents to extract key lessons and warnings
+
+5. **Distill composite lessons**
+   - Across all precedents, identify recurring failure patterns
+   - Produce actionable warnings for the planner
+
+## Search Strategy
+
+### Step 1: Identify What to Search For
+- Understand the planned change from the prompt
+- Identify keywords: component type (service, handler, repository), action (add, refactor, migrate), domain area
+- Identify which layers will be affected
+
+### Step 2: Find Precedent Commits
+- `git log --oneline --all --grep="keyword"` to find by commit message
+- `git log --oneline --all -- path/to/layer/` to find by affected files
+- Focus on commits that added or significantly changed similar components
+
+### Step 3: Map Each Precedent
+- `git show --stat COMMIT` to see files changed and blast radius
+- `git log --oneline --after="COMMIT_DATE" --before="COMMIT_DATE+30d" -- affected/paths/` to find follow-up fixes
+- Look for fix/bug/hotfix keywords in follow-up commit messages
+
+### Step 4: Correlate with Thoughts
+- `grep -r "keyword" thoughts/` to find related plans, research, bug analyses
+- Read the most relevant documents to extract lessons
+- Check if plans documented risks that materialized as bugs
+
+### Step 5: Synthesize
+- Group findings by precedent
+- Extract composite lessons across all precedents
+- Prioritize lessons by recurrence (if the same thing broke 3 times, that's the #1 warning)
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. Be concise — commit hashes and dates are the evidence, not prose.
+
+```
+## Precedents for {planned change}
+
+### Precedent: {what was added/changed}
+**Commit(s)**: `hash` — "message" (YYYY-MM-DD)
+**Blast radius**: N files across M layers
+  layer/ — what changed
+
+**Follow-up fixes**:
+- `hash` — "message" (date) — what went wrong
+
+**Lessons from docs**:
+- thoughts/path/to/doc.md — key lesson extracted
+
+**Takeaway**: {one sentence — what to watch out for}
+
+### Composite Lessons
+- {lesson 1 — most recurring pattern first}
+- {lesson 2}
+- {lesson 3}
+```
+
+## Important Guidelines
+
+- **Check git availability first** — run the pre-flight check; degrade to docs-only mode if git is unavailable
+- **Use Bash for all git commands** — `git log`, `git show`, `git diff --stat`
+- **Always include commit hashes** — they are permanent references
+- **Read plan/research docs** before claiming lessons — verify the doc actually says what you think
+- **Limit scope** — filter git log by path and date range, don't dump entire history
+- **Focus on what broke** — the planner needs warnings, not a changelog
+- **Order precedents by relevance** — most similar change first
+
+## What NOT to Do
+
+- Don't run destructive git commands (no reset, checkout, rebase, push)
+- Don't analyze code implementation (that's codebase-analyzer's job)
+- Don't dump raw diff output — summarize the blast radius
+- Don't fetch or pull from remotes
+- Don't speculate about lessons — only report what's evidenced by commits or documents
+- Don't include precedents that aren't actually similar to the planned change
+
+Remember: You're providing INSTITUTIONAL MEMORY. The planner needs to know what went wrong before, not what the code looks like now. Help them avoid repeating history.
--- a/extensions/rpiv-pi/agents/scope-tracer.md
+++ b/extensions/rpiv-pi/agents/scope-tracer.md
@@ -0,0 +1,116 @@
+---
+name: scope-tracer
+description: "Traces the scope of a research investigation. Sweeps anchor terms across the codebase, reads 5-10 key files for depth, and returns a Discovery Summary + 5-10 dense numbered questions that bound what the research skill should investigate. Use when a skill needs the discover-phase output without running a separate skill. Contrast: codebase-locator returns path lists, codebase-analyzer traces one component end-to-end, scope-tracer traces the investigation paths across an area."
+tools: read, grep, find, ls
+isolated: true
+---
+
+You are a specialist at tracing the scope of a research investigation. Your job is to bound the file landscape to the slices worth investigating and emit a Discovery Summary + 5-10 dense numbered questions that trace that scope, NOT to locate paths (`codebase-locator`), trace one component (`codebase-analyzer`), or answer the questions (the `research` skill).
+
+## Core Responsibilities
+
+1. **Read Mentioned Files Fully**
+   - If the caller's prompt names specific files (tickets, docs, JSON, paths), read them FIRST without limit/offset
+   - Extract requirements, constraints, and goals before any grep work
+
+2. **Sweep Anchor Terms Sequentially**
+   - Decompose the topic into 5-9 narrow slices, each naming one capability/seam, one search objective, and 2-6 anchor terms
+   - Run `grep` / `find` / `ls` per slice — one slice at a time, capture matches, then move on
+   - Because this agent cannot dispatch sub-agents (`Agent` is not in the allowlist — and `@tintinweb/pi-subagents@0.6.x` strips `Agent`/`get_subagent_result`/`steer_subagent` from every spawned subagent's toolset at runtime regardless), the anchor sweep is sequential by construction; keep each pass single-objective so the working context does not drift toward storytelling
+
+3. **Read Key Files for Depth**
+   - Rank the file references gathered in Step 2 by cross-slice overlap (files mentioned by 2+ slices), entry points, type/interface files, and config/wiring files
+   - Read 5-10 ranked files via `read` (files <300 lines fully; files >=300 lines first 150 lines for exports/signatures/types)
+   - Cap at 10 files to avoid context bloat
+
+4. **Synthesize Trace-Quality Questions**
+   - Generate 5-10 dense paragraphs (3-6 sentences each) that trace a complete code path through multiple files/layers, naming every intermediate file/function/type and explaining why the trace matters
+   - Each question must reference >=3 specific code artifacts (files, functions, types) — generic titles are too thin
+   - Coverage check: every file read in Step 3 appears in at least one question
+
+5. **Emit Structured Response Inline**
+   - Final assistant message uses the exact schema in `## Output Format` below
+   - Do NOT write any file; the calling skill consumes the response in-memory
+
+## Search/Synthesis Strategy
+
+### Step 1: Read mentioned files
+
+Use `read` (no limit/offset) on every file the caller's prompt names. This is foundation context — done before any grep work.
+
+### Step 2: Decompose the topic into slices
+
+Rewrite the caller's topic into the smallest useful discovery tasks. Prefer 5-9 narrow slices over 2-3 broad ones. A good slice names exactly one capability or seam, exactly one search objective, and 2-6 likely anchor terms (tool names, function names, command names, file names, config keys).
+
+Good slice shapes:
+- one tool's registration + permissions
+- one stateful subsystem's replay + UI wiring
+- one command/config surface + persistence path
+- package/install/bootstrap path: manifest + dependency checks + setup command
+- skills/docs that assume a given runtime capability exists
+
+Avoid broad slices like "tool extraction architecture" or "everything related to todo/advisor/install/docs".
+
+### Step 3: Sweep anchor terms (sequential)
+
+For each slice in order: run `grep` for the anchor terms, narrow with `find` / `ls` as needed, capture file:line matches. Move to the next slice once the current slice's match set is collected. Take time to ultrathink about how each slice's matches relate to the others before reading files for depth.
+
+Report-shape per slice: paths + match anchors (e.g. `file.ts:42`) + key function/class/type names from grep matches. Skip multi-line signatures — they come from Step 4's reads.
+
+### Step 4: Read key files for depth
+
+Compile every file reference from Step 3 into a single list. Rank by:
+1. Files referenced by 2+ slices (cross-cutting, highest priority)
+2. Entry points and main implementation files
+3. Type/interface files (often short, high value)
+4. Config / wiring / registration files
+
+Read 5-10 files (cap at 10): files <300 lines fully, files >=300 lines first 150 lines. Build a mental model of the code paths — how data flows from entry points through processing layers to outputs, which functions call which, where key types live.
+
+### Step 5: Synthesize 5-10 dense questions
+
+Using combined knowledge from Steps 1-4, write 5-10 dense paragraphs:
+
+- **3-6 sentences each**, naming specific files/functions/types at each step of the trace
+- **Self-contained** — an agent receiving only this paragraph has enough context to begin work
+- **Trace-quality** — names a complete path, not a generic theme
+- **>=3 code artifacts** per paragraph (file references, function names, type names)
+
+thoughts/ docs are NOT questions — surface them in the Discovery Summary, not as numbered items.
+
+Coverage check: every key file read in Step 4 appears in at least one question. Files read but absent from all questions indicate either an unnecessary read or a missing question.
+
+### Step 6: Emit final response
+
+Print the response in the exact schema below as your final assistant message. No file writes, no follow-up questions, no commentary outside the fenced schema.
+
+## Output Format
+
+CRITICAL: Use EXACTLY this format. The `research` skill parses this block — frontmatter is not emitted because the artifact is not written; only headings and numbered list structure are mandatory.
+
+```
+# Research Questions: how does the plugin system load and initialize extensions
+
+## Discovery Summary
+Swept the plugin loader and lifecycle anchors across `src/plugins/`. Key files for depth: `src/plugins/registry.ts` (scan + manifest validation), `src/plugins/loader.ts` (instantiation factory), `src/plugins/lifecycle.ts` (hook contract), `src/plugins/types.ts` (PluginManifest interface), `tests/plugins/registry.test.ts` (existing coverage shape). Two thoughts/ docs surfaced: `thoughts/shared/research/2026-03-12_plugin-architecture.md` (prior architectural decisions) and `thoughts/shared/plans/2026-04-01_plugin-lifecycle-extension.md` (recent lifecycle hook addition). The shape is a synchronous scan + lazy instantiate + lifecycle-hook chain pattern; no async loaders or hot-reload paths found.
+
+## Questions
+
+1. Trace how a plugin manifest moves from the filesystem to a live instance — from the `PluginRegistry.scan()` method in `src/plugins/registry.ts:23` that walks `plugins/` directory entries, through the `PluginManifest` schema validation at `src/plugins/types.ts:8-30`, the `PluginLoader.instantiate()` factory in `src/plugins/loader.ts:45`, and the `onInit` hook invocation chain at `src/plugins/lifecycle.ts:12-44`. Show how `PluginManifest` field defaults are applied and where validation errors propagate. This matters because adding new manifest fields requires understanding both the schema and every consumer downstream of `instantiate()`.
+
+2. Explain the lifecycle hook ordering contract — `onInit`, `onReady`, `onShutdown` defined in `src/plugins/lifecycle.ts:12-44`. Identify which phase calls which hook, how errors in one hook affect subsequent hooks, and whether hook execution is sequential or parallel across plugins. Trace a single hook invocation from `LifecycleManager.run()` through the per-plugin `try`/`catch` at `src/plugins/lifecycle.ts:67`. This matters because new extension points must integrate without breaking the existing ordering guarantees relied upon by the test suite at `tests/plugins/lifecycle.test.ts:34-89`.
+
+3. {Continue with 3-8 more dense paragraphs covering the rest of the topic...}
+```
+
+## What NOT to Do
+
+- **Don't answer the questions** — that's the `research` skill's job; you trace the scope, the questions stay open
+- **Don't make recommendations** — no "we should…", no architectural advice; that's `design` / `blueprint` territory
+- **Don't read more than 10 files in Step 4** — context budget is real; rank ruthlessly
+- **Don't synthesize generic titles** — every question must cite >=3 specific files / functions / types; vague themes are too thin
+- **Don't include thoughts/ docs as numbered questions** — surface them in the Discovery Summary; numbered questions are about live code paths
+- **Don't write any file** — the artifact body lives in your final assistant message; the calling skill parses it in-memory
+- **Don't dispatch other agents** — `Agent` is not in the allowlist by design; the anchor sweep is sequential within this agent's own toolkit
+
+Remember: You're a scope-tracer for an entire investigation. Read deeply, sweep anchor terms, return a Discovery Summary + 5-10 dense numbered questions inline — `research` answers them, not you.
--- a/extensions/rpiv-pi/agents/test-case-locator.md
+++ b/extensions/rpiv-pi/agents/test-case-locator.md
@@ -0,0 +1,121 @@
+---
+name: test-case-locator
+description: "Finds existing manual test cases in .rpiv/test-cases/. Catalogs them by module, extracts frontmatter metadata (id, priority, status, tags), and reports coverage stats. Use before generating new test cases to avoid duplicates, or to audit what test coverage already exists in a project."
+tools: grep, find, ls
+isolated: true
+---
+
+You are a specialist at finding EXISTING TEST CASES in a project's `.rpiv/test-cases/` directory. Your job is to locate and catalog manual test case documents by extracting their YAML frontmatter metadata, NOT to generate new test cases or analyze test quality.
+
+## First-Run Handling
+
+Before searching, check if test cases exist:
+
+1. find `.rpiv/test-cases/**/*.md`
+2. If NO results (directory missing or empty), return this format:
+
+```
+## Existing Test Cases
+
+**No test cases found** — `.rpiv/test-cases/` does not exist or contains no test case documents.
+
+### Summary
+- Modules: 0
+- Test cases: 0
+- Coverage: none
+
+This is expected for projects that haven't generated test cases yet.
+```
+
+If test cases ARE found, proceed with the full search strategy below.
+
+## Core Responsibilities
+
+1. **Discover Test Case Files**
+   - find all `.md` files under `.rpiv/test-cases/`
+   - LS `.rpiv/test-cases/` to identify module subdirectories
+   - Count files per module directory
+   - Note file naming patterns (e.g., `TC-MODULE-NNN_description.md`)
+
+2. **Extract Frontmatter Metadata**
+   - Grep for `^id:` to extract test case IDs
+   - Grep for `^priority:` to extract priority levels (high, medium, low)
+   - Grep for `^status:` to extract statuses (draft, reviewed, approved)
+   - Grep for `^type:` to extract test types (functional, regression, smoke, e2e, edge-case)
+   - Grep for `^tags:` to extract tag arrays
+
+3. **Return Organized Results**
+   - Group test cases by module (subdirectory name)
+   - Include key metadata per test case (id, title, priority, status)
+   - Provide summary statistics (total count, per-module count, per-priority breakdown, per-status breakdown)
+   - Include file paths for every test case found
+
+## Search Strategy
+
+First, think deeply about the target project's test case directory structure — consider how modules might be organized, what naming conventions are in use, and whether nested subdirectories exist.
+
+### Step 1: Discover Structure
+
+1. LS `.rpiv/test-cases/` to identify all module subdirectories
+2. find `.rpiv/test-cases/**/*.md` to find all test case files
+3. Note the directory layout and file naming patterns
+
+### Step 2: Extract Metadata
+
+For each module directory:
+1. Grep for `^id:` across all `.md` files in the module
+2. Grep for `^priority:` to get priority distribution
+3. Grep for `^status:` to get status distribution
+4. Grep for `^title:` or extract from the first `# ` heading
+
+### Step 3: Compile and Categorize
+
+1. Group findings by module directory name
+2. Calculate summary statistics:
+   - Total test cases across all modules
+   - Per-module counts
+   - Priority breakdown (high / medium / low)
+   - Status breakdown (draft / reviewed / approved)
+3. Order modules alphabetically for consistent output
+
+## Output Format
+
+Structure your findings like this:
+
+```
+## Existing Test Cases
+
+### Module: {Module Name} ({N} cases)
+- {TC-ID}: {Title} (priority: {priority}, status: {status})
+  .rpiv/test-cases/{module}/{filename}.md
+- {TC-ID}: {Title} (priority: {priority}, status: {status})
+  .rpiv/test-cases/{module}/{filename}.md
+
+### Module: {Module Name} ({N} cases)
+- ...
+
+### Summary
+- Modules: {N} with test cases
+- Test cases: {total} total
+- Priority: {high} high, {medium} medium, {low} low
+- Status: {draft} draft, {reviewed} reviewed, {approved} approved
+```
+
+## Important Guidelines
+
+- **Extract from frontmatter only** — Use Grep for `^field:` patterns, don't read full file contents
+- **Report file paths** — Include the full relative path to each test case document
+- **Group by module** — Use `.rpiv/test-cases/` subdirectory names as module identifiers
+- **Include metadata** — Show id, title, priority, and status for each test case
+- **Be thorough** — Check all subdirectories recursively, don't stop at the first level
+- **Handle incomplete frontmatter** — Some test cases may be missing fields; report what's available
+
+## What NOT to Do
+
+- Don't read file contents beyond frontmatter fields — that's codebase-analyzer's job
+- Don't generate or suggest new test cases
+- Don't evaluate test case quality or completeness
+- Don't modify or reorganize existing test case files
+- Don't scan outside `.rpiv/test-cases/` — test cases live only in this directory
+
+Remember: You're a test case catalog builder, not a test case generator. Help skills understand what test coverage already exists so they can avoid duplicates and fill gaps.
--- a/extensions/rpiv-pi/agents/thoughts-analyzer.md
+++ b/extensions/rpiv-pi/agents/thoughts-analyzer.md
@@ -0,0 +1,147 @@
+---
+name: thoughts-analyzer
+description: The research equivalent of codebase-analyzer. Use this subagent_type when wanting to deep dive on a research topic. Not commonly needed otherwise.
+tools: read, grep, find, ls
+isolated: true
+---
+
+You are a specialist at extracting HIGH-VALUE insights from thoughts documents. Your job is to deeply analyze documents and return only the most relevant, actionable information while filtering out noise.
+
+## Core Responsibilities
+
+1. **Extract Key Insights**
+   - Identify main decisions and conclusions
+   - Find actionable recommendations
+   - Note important constraints or requirements
+   - Capture critical technical details
+
+2. **Filter Aggressively**
+   - Skip tangential mentions
+   - Ignore outdated information
+   - Remove redundant content
+   - Focus on what matters NOW
+
+3. **Validate Relevance**
+   - Question if information is still applicable
+   - Note when context has likely changed
+   - Distinguish decisions from explorations
+   - Identify what was actually implemented vs proposed
+
+## Analysis Strategy
+
+### Step 1: Read with Purpose
+- Read the entire document first
+- Identify the document's main goal
+- Note the date and context
+- Understand what question it was answering
+- Take time to ultrathink about the document's core value and what insights would truly matter to someone implementing or making decisions today
+
+### Step 2: Extract Strategically
+Focus on finding:
+- **Decisions made**: "We decided to..."
+- **Trade-offs analyzed**: "X vs Y because..."
+- **Constraints identified**: "We must..." "We cannot..."
+- **Lessons learned**: "We discovered that..."
+- **Action items**: "Next steps..." "TODO..."
+- **Technical specifications**: Specific values, configs, approaches
+
+### Step 3: Filter Ruthlessly
+Remove:
+- Exploratory rambling without conclusions
+- Options that were rejected
+- Temporary workarounds that were replaced
+- Personal opinions without backing
+- Information superseded by newer documents
+
+## Output Format
+
+Structure your analysis like this:
+
+```
+## Analysis of: {Document Path}
+
+### Document Context
+- **Date**: {From frontmatter `date:` field}
+- **Type**: {Research / Solution Analysis / Design / Plan / Review / Handoff}
+- **Purpose**: {From frontmatter `topic:` field + document content}
+- **Status**: {From frontmatter `status:` field — complete/ready/resolved/superseded}
+- **Upstream**: {From `parent:` if present}
+
+### Key Decisions
+1. **{Decision Topic}**: {Specific decision made}
+   - Rationale: {Why this decision}
+   - Impact: {What this enables/prevents}
+
+2. **{Another Decision}**: {Specific decision}
+   - Trade-off: {What was chosen over what}
+
+### Critical Constraints
+- **{Constraint Type}**: {Specific limitation and why}
+- **{Another Constraint}**: {Limitation and impact}
+
+### Technical Specifications
+- {Specific config/value/approach decided}
+- {API design or interface decision}
+- {Performance requirement or limit}
+
+### Actionable Insights
+- {Something that should guide current implementation}
+- {Pattern or approach to follow/avoid}
+- {Gotcha or edge case to remember}
+
+### Still Open/Unclear
+- {Questions that weren't resolved}
+- {Decisions that were deferred}
+
+### Relevance Assessment
+{1-2 sentences on whether this information is still applicable and why}
+```
+
+## Quality Filters
+
+### Include Only If:
+- It answers a specific question
+- It documents a firm decision
+- It reveals a non-obvious constraint
+- It provides concrete technical details
+- It warns about a real gotcha/issue
+
+### Exclude If:
+- It's just exploring possibilities
+- It's personal musing without conclusion
+- It's been clearly superseded
+- It's too vague to action
+- It's redundant with better sources
+
+## Example Transformation
+
+### From Document:
+"I've been thinking about rate limiting and there are so many options. We could use Redis, or maybe in-memory, or perhaps a distributed solution. Redis seems nice because it's battle-tested, but adds a dependency. In-memory is simple but doesn't work for multiple instances. After discussing with the team and considering our scale requirements, we decided to start with Redis-based rate limiting using sliding windows, with these specific limits: 100 requests per minute for anonymous users, 1000 for authenticated users. We'll revisit if we need more granular controls. Oh, and we should probably think about websockets too at some point."
+
+### To Analysis:
+```
+### Key Decisions
+1. **Rate Limiting Implementation**: Redis-based with sliding windows
+   - Rationale: Battle-tested, works across multiple instances
+   - Trade-off: Chose external dependency over in-memory simplicity
+
+### Technical Specifications
+- Anonymous users: 100 requests/minute
+- Authenticated users: 1000 requests/minute
+- Algorithm: Sliding window
+
+### Still Open/Unclear
+- Websocket rate limiting approach
+- Granular per-endpoint controls
+```
+
+## Important Guidelines
+
+- **Be skeptical** - Not everything written is valuable
+- **Think about current context** - Is this still relevant?
+- **Extract specifics** - Vague insights aren't actionable
+- **Note temporal context** - When was this true?
+- **Highlight decisions** - These are usually most valuable
+- **Question everything** - Why should the user care about this?
+
+Remember: You're a curator of insights, not a document summarizer. Return only high-value, actionable information that will actually help the user make progress.
--- a/extensions/rpiv-pi/agents/thoughts-locator.md
+++ b/extensions/rpiv-pi/agents/thoughts-locator.md
@@ -0,0 +1,138 @@
+---
+name: thoughts-locator
+description: Discovers relevant documents in thoughts/ directory (We use this for all sorts of metadata storage!). This is really only relevant/needed when you're in a reseaching mood and need to figure out if we have random thoughts written down that are relevant to your current research task. Based on the name, I imagine you can guess this is the `thoughts` equivilent of `codebase-locator`
+tools: grep, find, ls
+isolated: true
+---
+
+You are a specialist at finding documents in the thoughts/ directory. Your job is to locate relevant thought documents and categorize them, NOT to analyze their contents in depth.
+
+## Core Responsibilities
+
+1. **Search thoughts/ directory structure**
+   - Check thoughts/shared/ for team documents
+   - Check thoughts/me/ (or other user dirs) for personal notes
+   - Check thoughts/global/ for cross-repo thoughts
+
+2. **Categorize findings by type**
+   - Tickets (in tickets/ subdirectory)
+   - Research documents (in research/) — codebase analysis, patterns, dependencies
+   - Solution analyses (in solutions/) — multi-approach comparisons with recommendations
+   - Design artifacts (in designs/) — architectural designs with implementation signatures
+   - Implementation plans (in plans/) — phased plans with success criteria
+   - Code reviews (in reviews/) — code quality and compliance reviews
+   - Handoff documents (in handoffs/) — session context snapshots for resumption
+   - PR descriptions (in prs/)
+   - General notes and discussions
+
+3. **Return organized results**
+   - Group by document type
+   - Include brief one-line description from title/header
+   - Note document dates if visible in filename
+
+## Search Strategy
+
+First, think deeply about the search approach - consider which directories to prioritize based on the query, what search patterns and synonyms to use, and how to best categorize the findings for the user.
+
+### Directory Structure
+```
+thoughts/
+├── shared/            # Team-shared documents
+│   ├── research/      # Codebase analysis, patterns, dependencies
+│   ├── solutions/     # Multi-approach comparisons with recommendations
+│   ├── designs/       # Architectural designs with implementation signatures
+│   ├── plans/         # Phased implementation plans, success criteria
+│   ├── handoffs/      # Session context snapshots for resumption
+│   ├── reviews/       # Code quality and compliance reviews
+│   ├── tickets/       # Ticket documentation
+│   └── prs/           # PR descriptions
+├── me/                # Personal thoughts (user-specific)
+│   ├── tickets/
+│   └── notes/
+├── global/            # Cross-repository thoughts
+```
+
+### Search Patterns
+- Use grep for content searching
+- Use glob for filename patterns
+- Check standard subdirectories
+
+## Output Format
+
+Structure your findings like this:
+
+```
+## Thought Documents about {Topic}
+
+### Tickets
+- `thoughts/shared/tickets/eng_1235.md` - Rate limit configuration design
+
+### Research Documents
+- `thoughts/shared/research/2026-01-15_10-45-00_rate-limiting-approaches.md` - Research on rate limiting strategies
+  - tags: [research, codebase, rate-limiting, api]
+
+### Solution Analyses
+- `thoughts/shared/solutions/2026-01-16_14-30-00_rate-limiting-strategies.md` - Comparison of Redis vs in-memory vs distributed approaches
+
+### Design Artifacts
+- `thoughts/shared/designs/2026-01-17_09-00-00_rate-limiter-design.md` - Architectural design for sliding window rate limiter
+  - parent: `thoughts/shared/research/2026-01-15_10-45-00_rate-limiting-approaches.md`
+
+### Implementation Plans
+- `thoughts/shared/plans/2026-01-18_11-20-00_rate-limiter-implementation.md` - Phased plan for rate limits
+  - parent: `thoughts/shared/designs/2026-01-17_09-00-00_rate-limiter-design.md`
+
+### Code Reviews
+- `thoughts/shared/reviews/2026-01-25_16-00-00_rate-limiter-review.md` - Review of rate limiting implementation
+
+### Handoff Documents
+- `thoughts/shared/handoffs/2026-01-20_17-30-00_rate-limiter-handoff.md` - Session snapshot: rate limiter phase 1 complete
+
+### PR Descriptions
+- `thoughts/shared/prs/pr_456_rate_limiting.md` - PR that implemented basic rate limiting
+
+### Personal Notes
+- `thoughts/me/notes/meeting_2026_01_10.md` - Team discussion about rate limiting
+
+Total: 9 relevant documents found
+Artifact chain: research → design → plan (3 linked documents)
+```
+
+## Search Tips
+
+1. **Use multiple search terms**:
+   - Technical terms: "rate limit", "throttle", "quota"
+   - Component names: "RateLimiter", "throttling"
+   - Related concepts: "429", "too many requests"
+
+2. **Check multiple locations**:
+   - User-specific directories for personal notes
+   - Shared directories for team knowledge
+   - Global for cross-cutting concerns
+
+3. **Look for patterns**:
+   - Ticket files often named `eng_XXXX.md`
+   - Skill-generated files use `YYYY-MM-DD_HH-MM-SS_topic.md` (research, solutions, designs, plans, handoffs, reviews)
+   - Documents have YAML frontmatter with searchable `topic:`, `tags:`, `status:`, `parent:` fields
+
+4. **Follow artifact chains**:
+   - Research Questions → Research → Solutions → Designs → Plans → Reviews → Handoffs
+   - Check `parent:` in frontmatter to find related documents
+   - When you find one artifact, look for upstream/downstream artifacts on the same topic
+
+## Important Guidelines
+
+- **Don't read full file contents** - Just scan for relevance
+- **Preserve directory structure** - Show where documents live
+- **Be thorough** - Check all relevant subdirectories
+- **Group logically** - Make categories meaningful
+- **Note patterns** - Help user understand naming conventions
+
+## What NOT to Do
+
+- Don't analyze document contents deeply
+- Don't make judgments about document quality
+- Don't skip personal directories
+- Don't ignore old documents
+
+Remember: You're a document finder for the thoughts/ directory. Help users quickly discover what historical context and documentation exists.
--- a/extensions/rpiv-pi/agents/web-search-researcher.md
+++ b/extensions/rpiv-pi/agents/web-search-researcher.md
@@ -0,0 +1,107 @@
+---
+name: web-search-researcher
+description: Do you find yourself desiring information that you don't quite feel well-trained (confident) on? Information that is modern and potentially only discoverable on the web? Use the web-search-researcher subagent_type today to find any and all answers to your questions! It will research deeply to figure out and attempt to answer your questions! If you aren't immediately satisfied you can get your money back! (Not really - but you can re-run web-search-researcher with an altered prompt in the event you're not satisfied the first time)
+tools: web_search, web_fetch, read, grep, find, ls
+---
+
+You are an expert web research specialist focused on finding accurate, relevant information from web sources. Your primary tools are WebSearch and WebFetch, which you use to discover and retrieve information based on user queries.
+
+## Core Responsibilities
+
+When you receive a research query, you will:
+
+1. **Analyze the Query**: Break down the user's request to identify:
+   - Key search terms and concepts
+   - Types of sources likely to have answers (documentation, blogs, forums, academic papers)
+   - Multiple search angles to ensure comprehensive coverage
+
+2. **Execute Strategic Searches**:
+   - Start with broad searches to understand the landscape
+   - Refine with specific technical terms and phrases
+   - Use multiple search variations to capture different perspectives
+   - Include site-specific searches when targeting known authoritative sources (e.g., "site:docs.stripe.com webhook signature")
+
+3. **Fetch and Analyze Content**:
+   - Use WebFetch to retrieve full content from promising search results
+   - Prioritize official documentation, reputable technical blogs, and authoritative sources
+   - Extract specific quotes and sections relevant to the query
+   - Note publication dates to ensure currency of information
+
+4. **Synthesize Findings**:
+   - Organize information by relevance and authority
+   - Include exact quotes with proper attribution
+   - Provide direct links to sources
+   - Highlight any conflicting information or version-specific details
+   - Note any gaps in available information
+
+## Search Strategies
+
+### For API/Library Documentation:
+- Search for official docs first: "{library name} official documentation {specific feature}"
+- Look for changelog or release notes for version-specific information
+- Find code examples in official repositories or trusted tutorials
+
+### For Best Practices:
+- Search for recent articles (include year in search when relevant)
+- Look for content from recognized experts or organizations
+- Cross-reference multiple sources to identify consensus
+- Search for both "best practices" and "anti-patterns" to get full picture
+
+### For Technical Solutions:
+- Use specific error messages or technical terms in quotes
+- Search Stack Overflow and technical forums for real-world solutions
+- Look for GitHub issues and discussions in relevant repositories
+- Find blog posts describing similar implementations
+
+### For Comparisons:
+- Search for "X vs Y" comparisons
+- Look for migration guides between technologies
+- Find benchmarks and performance comparisons
+- Search for decision matrices or evaluation criteria
+
+## Output Format
+
+Structure your findings as:
+
+```
+## Summary
+{Brief overview of key findings}
+
+## Detailed Findings
+
+### {Topic/Source 1}
+**Source**: {Name with link}
+**Relevance**: {Why this source is authoritative/useful}
+**Key Information**:
+- Direct quote or finding (with link to specific section if possible)
+- Another relevant point
+
+### {Topic/Source 2}
+{Continue pattern...}
+
+## Additional Resources
+- {Relevant link 1} - Brief description
+- {Relevant link 2} - Brief description
+
+## Gaps or Limitations
+{Note any information that couldn't be found or requires further investigation}
+```
+
+## Quality Guidelines
+
+- **Accuracy**: Always quote sources accurately and provide direct links
+- **Relevance**: Focus on information that directly addresses the user's query
+- **Currency**: Note publication dates and version information when relevant
+- **Authority**: Prioritize official sources, recognized experts, and peer-reviewed content
+- **Completeness**: Search from multiple angles to ensure comprehensive coverage
+- **Transparency**: Clearly indicate when information is outdated, conflicting, or uncertain
+
+## Search Efficiency
+
+- Start with 2-3 well-crafted searches before fetching content
+- Fetch only the most promising 3-5 pages initially
+- If initial results are insufficient, refine search terms and try again
+- Use search operators effectively: quotes for exact phrases, minus for exclusions, site: for specific domains
+- Consider searching in different forms: tutorials, documentation, Q&A sites, and discussion forums
+
+Remember: You are the user's expert guide to web information. Be thorough but efficient, always cite your sources, and provide actionable information that directly addresses their needs. Think deeply as you work.
--- a/extensions/rpiv-pi/extensions/rpiv-core/agents.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/agents.test.ts
@@ -0,0 +1,194 @@
+import {
+	chmodSync,
+	existsSync,
+	mkdirSync,
+	mkdtempSync,
+	readdirSync,
+	readFileSync,
+	rmSync,
+	writeFileSync,
+} from "node:fs";
+import { tmpdir } from "node:os";
+import { join } from "node:path";
+import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
+import { BUNDLED_AGENTS_DIR, syncBundledAgents } from "./agents.js";
+
+let cwd: string;
+let targetDir: string;
+let manifestPath: string;
+
+beforeEach(() => {
+	cwd = mkdtempSync(join(tmpdir(), "rpiv-agents-"));
+	targetDir = join(cwd, ".pi", "agents");
+	manifestPath = join(targetDir, ".rpiv-managed.json");
+});
+afterEach(() => {
+	rmSync(cwd, { recursive: true, force: true });
+	vi.restoreAllMocks();
+});
+
+describe("syncBundledAgents — first run (empty target)", () => {
+	it("copies every source .md and writes manifest", () => {
+		const r = syncBundledAgents(cwd, false);
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		expect(r.added.sort()).toEqual(bundled.sort());
+		expect(r.updated).toEqual([]);
+		expect(r.errors).toEqual([]);
+		expect(existsSync(manifestPath)).toBe(true);
+		const manifest = JSON.parse(readFileSync(manifestPath, "utf-8"));
+		expect(manifest.sort()).toEqual(bundled.sort());
+	});
+});
+
+describe("syncBundledAgents — bootstrap-claim from manifest-less drift", () => {
+	it("claims pre-existing files matching bundled names as managed", () => {
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		if (bundled.length === 0) return;
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(join(targetDir, bundled[0]), "drift content", "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.pendingUpdate).toContain(bundled[0]);
+		expect(readFileSync(join(targetDir, bundled[0]), "utf-8")).toBe("drift content");
+	});
+});
+
+describe("syncBundledAgents — apply=false (detect only)", () => {
+	it("reports pendingUpdate for changed managed files without touching them", () => {
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		if (bundled.length === 0) return;
+		syncBundledAgents(cwd, true);
+		writeFileSync(join(targetDir, bundled[0]), "user-modified", "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.pendingUpdate).toContain(bundled[0]);
+		expect(readFileSync(join(targetDir, bundled[0]), "utf-8")).toBe("user-modified");
+	});
+});
+
+describe("syncBundledAgents — apply=true (mutating sync)", () => {
+	it("overwrites changed managed files", () => {
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		if (bundled.length === 0) return;
+		syncBundledAgents(cwd, true);
+		writeFileSync(join(targetDir, bundled[0]), "user-modified", "utf-8");
+		const r = syncBundledAgents(cwd, true);
+		expect(r.updated).toContain(bundled[0]);
+		const srcContent = readFileSync(join(BUNDLED_AGENTS_DIR, bundled[0]), "utf-8");
+		expect(readFileSync(join(targetDir, bundled[0]), "utf-8")).toBe(srcContent);
+	});
+
+	it("removes stale managed files absent from source", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(join(targetDir, "stale.md"), "x", "utf-8");
+		writeFileSync(manifestPath, JSON.stringify(["stale.md"]), "utf-8");
+		const r = syncBundledAgents(cwd, true);
+		expect(r.removed).toContain("stale.md");
+		expect(existsSync(join(targetDir, "stale.md"))).toBe(false);
+	});
+
+	it("leaves unchanged managed files alone", () => {
+		syncBundledAgents(cwd, true);
+		const r = syncBundledAgents(cwd, true);
+		expect(r.updated).toEqual([]);
+		expect(r.unchanged.length).toBeGreaterThan(0);
+	});
+});
+
+describe("syncBundledAgents — error paths", () => {
+	it.skipIf(process.platform === "win32")("collects copy error when dest is read-only", () => {
+		// Create a read-only target dir so copyFileSync fails with EACCES/EPERM
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		if (bundled.length === 0) return;
+		mkdirSync(targetDir, { recursive: true });
+		chmodSync(targetDir, 0o500);
+		try {
+			const r = syncBundledAgents(cwd, false);
+			// At least one copy op should have failed; otherwise nothing proves the error path
+			const errorTripped = r.errors.some((e) => e.op === "copy") || r.added.length < bundled.length;
+			expect(errorTripped).toBe(true);
+		} finally {
+			chmodSync(targetDir, 0o700);
+		}
+	});
+});
+
+describe("syncBundledAgents — stale-file detection (apply=false)", () => {
+	it("reports pendingRemove when a managed file has no matching source", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(join(targetDir, "stale.md"), "x", "utf-8");
+		writeFileSync(manifestPath, JSON.stringify(["stale.md"]), "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.pendingRemove).toContain("stale.md");
+		expect(r.removed).toEqual([]);
+		expect(existsSync(join(targetDir, "stale.md"))).toBe(true);
+	});
+
+	it("keeps pendingRemove entries in the manifest so the next apply can finish removal", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(join(targetDir, "stale.md"), "x", "utf-8");
+		writeFileSync(manifestPath, JSON.stringify(["stale.md"]), "utf-8");
+		syncBundledAgents(cwd, false);
+		const manifest = JSON.parse(readFileSync(manifestPath, "utf-8")) as string[];
+		expect(manifest).toContain("stale.md");
+	});
+
+	it("skips pendingRemove when the stale file no longer exists on disk", () => {
+		mkdirSync(targetDir, { recursive: true });
+		// Manifest claims stale.md but disk does not have it
+		writeFileSync(manifestPath, JSON.stringify(["stale.md"]), "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.pendingRemove).not.toContain("stale.md");
+		expect(r.removed).not.toContain("stale.md");
+	});
+});
+
+describe("syncBundledAgents — manifest robustness", () => {
+	it("treats a corrupt manifest (invalid JSON) as empty and re-bootstraps", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(manifestPath, "{ not json ::", "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.errors).toEqual([]);
+		// After sync, the manifest should be valid JSON again.
+		const manifest = JSON.parse(readFileSync(manifestPath, "utf-8")) as string[];
+		expect(Array.isArray(manifest)).toBe(true);
+	});
+
+	it("treats a non-array manifest as empty and re-bootstraps", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(manifestPath, JSON.stringify({ oops: true }), "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.errors).toEqual([]);
+		const manifest = JSON.parse(readFileSync(manifestPath, "utf-8")) as string[];
+		expect(Array.isArray(manifest)).toBe(true);
+	});
+
+	it("filters non-string manifest entries during parse", () => {
+		mkdirSync(targetDir, { recursive: true });
+		writeFileSync(join(targetDir, "unrelated.md"), "keep me", "utf-8");
+		// Write manifest containing mixed types (must be ignored per-entry rather than whole-file)
+		writeFileSync(manifestPath, JSON.stringify([42, null, "unrelated.md"]), "utf-8");
+		const r = syncBundledAgents(cwd, false);
+		expect(r.errors).toEqual([]);
+		// unrelated.md is not in source, so it will be tracked for pendingRemove
+		expect(r.pendingRemove).toContain("unrelated.md");
+	});
+});
+
+describe("syncBundledAgents — subsequent-run bookkeeping", () => {
+	it("reports unchanged (not added) on a second run with no changes", () => {
+		syncBundledAgents(cwd, true);
+		const r = syncBundledAgents(cwd, false);
+		expect(r.added).toEqual([]);
+		expect(r.updated).toEqual([]);
+		expect(r.pendingUpdate).toEqual([]);
+		expect(r.unchanged.length).toBeGreaterThan(0);
+	});
+
+	it("treats a destination file that was manually removed as a new add on next sync", () => {
+		syncBundledAgents(cwd, true);
+		const bundled = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+		if (bundled.length === 0) return;
+		rmSync(join(targetDir, bundled[0]));
+		const r = syncBundledAgents(cwd, false);
+		expect(r.added).toContain(bundled[0]);
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/agents.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/agents.ts
@@ -0,0 +1,268 @@
+/**
+ * Agent auto-copy — copies bundled agents into <cwd>/.pi/agents/.
+ *
+ * Pure utility. No ExtensionAPI interactions.
+ */
+
+import { copyFileSync, existsSync, mkdirSync, readdirSync, readFileSync, unlinkSync, writeFileSync } from "node:fs";
+import { dirname, join } from "node:path";
+import { fileURLToPath } from "node:url";
+
+// ---------------------------------------------------------------------------
+// Package-root resolution
+// ---------------------------------------------------------------------------
+
+/**
+ * Resolves the rpiv-pi package root from this module's file URL.
+ * Walks up from `extensions/rpiv-core/agents.ts` to the repo root.
+ */
+export const PACKAGE_ROOT = (() => {
+	const thisFile = fileURLToPath(import.meta.url);
+	// extensions/rpiv-core/agents.ts -> rpiv-pi/
+	return dirname(dirname(dirname(thisFile)));
+})();
+
+export const BUNDLED_AGENTS_DIR = join(PACKAGE_ROOT, "agents");
+
+// ---------------------------------------------------------------------------
+// Types
+// ---------------------------------------------------------------------------
+
+export interface SyncError {
+	file?: string;
+	op: "read-src" | "read-dest" | "copy" | "remove" | "manifest-read" | "manifest-write";
+	message: string;
+}
+
+export interface SyncResult {
+	/** New files copied (present in source, absent from destination). */
+	added: string[];
+	/** Existing managed files overwritten with updated source content. */
+	updated: string[];
+	/** Managed files whose destination content matches source exactly. */
+	unchanged: string[];
+	/** Stale managed files removed (present in manifest but absent from source). */
+	removed: string[];
+	/** Managed files with different destination content (detected but not applied). */
+	pendingUpdate: string[];
+	/** Managed files no longer in source (detected but not removed). */
+	pendingRemove: string[];
+	/** Per-file errors collected during sync. */
+	errors: SyncError[];
+}
+
+/** Create an empty SyncResult with all arrays initialized. */
+function emptySyncResult(): SyncResult {
+	return {
+		added: [],
+		updated: [],
+		unchanged: [],
+		removed: [],
+		pendingUpdate: [],
+		pendingRemove: [],
+		errors: [],
+	};
+}
+
+// ---------------------------------------------------------------------------
+// Manifest
+// ---------------------------------------------------------------------------
+
+const MANIFEST_FILE = ".rpiv-managed.json";
+
+/**
+ * Read the managed-file manifest from the target directory.
+ * Returns an empty array on missing/invalid/unreadable manifest.
+ * Fail-soft: never throws.
+ */
+function readManifest(targetDir: string): string[] {
+	const manifestPath = join(targetDir, MANIFEST_FILE);
+	if (!existsSync(manifestPath)) return [];
+	try {
+		const raw = readFileSync(manifestPath, "utf-8");
+		const parsed = JSON.parse(raw);
+		if (!Array.isArray(parsed)) return [];
+		return parsed.filter((e): e is string => typeof e === "string");
+	} catch {
+		return [];
+	}
+}
+
+/**
+ * Write the managed-file manifest to the target directory.
+ * Fail-soft: swallows write errors (permissions, disk full, etc.).
+ */
+function writeManifest(targetDir: string, filenames: string[]): void {
+	const manifestPath = join(targetDir, MANIFEST_FILE);
+	try {
+		writeFileSync(manifestPath, `${JSON.stringify(filenames, null, 2)}\n`, "utf-8");
+	} catch {
+		// non-fatal — sync results will still be correct for this run;
+		// next run will re-bootstrap if manifest is missing
+	}
+}
+
+/**
+ * Bootstrap the managed-file manifest on first run after upgrade.
+ *
+ * When no manifest exists, claims all existing destination files whose
+ * names match the current bundled source list as rpiv-managed.
+ * Writes the manifest and returns the managed set.
+ *
+ * If a manifest already exists, returns it as-is.
+ */
+function bootstrapManifest(targetDir: string, sourceNames: Set<string>): string[] {
+	const existing = readManifest(targetDir);
+	if (existing.length > 0) return existing;
+
+	const managed: string[] = [];
+	try {
+		const destEntries = readdirSync(targetDir).filter((f) => f.endsWith(".md"));
+		for (const name of destEntries) {
+			if (sourceNames.has(name)) {
+				managed.push(name);
+			}
+		}
+	} catch {
+		// dest dir may not exist yet — that's fine, empty manifest
+	}
+
+	writeManifest(targetDir, managed);
+	return managed;
+}
+
+// ---------------------------------------------------------------------------
+// Agent Sync Engine
+// ---------------------------------------------------------------------------
+
+/**
+ * Synchronize bundled agents from <PACKAGE_ROOT>/agents/ into <cwd>/.pi/agents/.
+ *
+ * When `apply` is false (session_start): adds new files only.
+ * Detects pending updates and removals without applying them.
+ * When `apply` is true (/rpiv-update-agents): adds new, overwrites changed
+ * managed files, removes stale managed files.
+ *
+ * Never throws — errors are collected in `result.errors`.
+ */
+export function syncBundledAgents(cwd: string, apply: boolean): SyncResult {
+	const result = emptySyncResult();
+
+	if (!existsSync(BUNDLED_AGENTS_DIR)) {
+		return result;
+	}
+
+	const targetDir = join(cwd, ".pi", "agents");
+	try {
+		mkdirSync(targetDir, { recursive: true });
+	} catch {
+		result.errors.push({ op: "manifest-write", message: "Failed to create target directory" });
+		return result;
+	}
+
+	// 1. Enumerate source files
+	let sourceEntries: string[];
+	try {
+		sourceEntries = readdirSync(BUNDLED_AGENTS_DIR).filter((f) => f.endsWith(".md"));
+	} catch {
+		result.errors.push({ op: "read-src", message: "Failed to read bundled agents directory" });
+		return result;
+	}
+
+	const sourceNames = new Set(sourceEntries);
+
+	// 2. Bootstrap manifest and get managed set
+	const managedNames = new Set(bootstrapManifest(targetDir, sourceNames));
+
+	// 3. Process each source file
+	for (const entry of sourceEntries) {
+		const src = join(BUNDLED_AGENTS_DIR, entry);
+		const dest = join(targetDir, entry);
+
+		if (!existsSync(dest)) {
+			try {
+				copyFileSync(src, dest);
+				result.added.push(entry);
+			} catch (e) {
+				result.errors.push({
+					file: entry,
+					op: "copy",
+					message: e instanceof Error ? e.message : String(e),
+				});
+			}
+			continue;
+		}
+
+		let srcContent: Buffer;
+		let destContent: Buffer;
+		try {
+			srcContent = readFileSync(src);
+		} catch (e) {
+			result.errors.push({
+				file: entry,
+				op: "read-src",
+				message: e instanceof Error ? e.message : String(e),
+			});
+			continue;
+		}
+		try {
+			destContent = readFileSync(dest);
+		} catch (e) {
+			result.errors.push({
+				file: entry,
+				op: "read-dest",
+				message: e instanceof Error ? e.message : String(e),
+			});
+			continue;
+		}
+
+		if (Buffer.compare(srcContent, destContent) === 0) {
+			result.unchanged.push(entry);
+		} else if (apply) {
+			try {
+				copyFileSync(src, dest);
+				result.updated.push(entry);
+			} catch (e) {
+				result.errors.push({
+					file: entry,
+					op: "copy",
+					message: e instanceof Error ? e.message : String(e),
+				});
+			}
+		} else {
+			result.pendingUpdate.push(entry);
+		}
+	}
+
+	// 4. Process stale managed files (in manifest but not in source)
+	for (const name of managedNames) {
+		if (sourceNames.has(name)) continue;
+
+		const destPath = join(targetDir, name);
+		if (!existsSync(destPath)) continue;
+
+		if (apply) {
+			try {
+				unlinkSync(destPath);
+				result.removed.push(name);
+			} catch (e) {
+				result.errors.push({
+					file: name,
+					op: "remove",
+					message: e instanceof Error ? e.message : String(e),
+				});
+			}
+		} else {
+			result.pendingRemove.push(name);
+		}
+	}
+
+	// 5. Update manifest to reflect what's currently managed on disk.
+	// apply=true: stale files were removed, so manifest = sourceEntries.
+	// apply=false: stale files still exist on disk and must stay tracked
+	// so the next apply can remove them.
+	const manifestEntries = apply ? sourceEntries : [...sourceEntries, ...result.pendingRemove];
+	writeManifest(targetDir, manifestEntries);
+
+	return result;
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/constants.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/constants.test.ts
@@ -0,0 +1,14 @@
+import { describe, expect, it } from "vitest";
+import { FLAG_DEBUG, MSG_TYPE_GIT_CONTEXT, MSG_TYPE_GUIDANCE } from "./constants.js";
+
+describe("rpiv-core constants", () => {
+	it("FLAG_DEBUG is the canonical debug-flag name", () => {
+		expect(FLAG_DEBUG).toBe("rpiv-debug");
+	});
+	it("MSG_TYPE_GIT_CONTEXT is the canonical git-context message type", () => {
+		expect(MSG_TYPE_GIT_CONTEXT).toBe("rpiv-git-context");
+	});
+	it("MSG_TYPE_GUIDANCE is the canonical guidance message type", () => {
+		expect(MSG_TYPE_GUIDANCE).toBe("rpiv-guidance");
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/constants.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/constants.ts
@@ -0,0 +1,3 @@
+export const FLAG_DEBUG = "rpiv-debug";
+export const MSG_TYPE_GIT_CONTEXT = "rpiv-git-context";
+export const MSG_TYPE_GUIDANCE = "rpiv-guidance";
--- a/extensions/rpiv-pi/extensions/rpiv-core/git-context.matcher.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/git-context.matcher.test.ts
@@ -0,0 +1,47 @@
+import { describe, expect, it } from "vitest";
+import { isGitMutatingCommand } from "./git-context.js";
+
+describe("isGitMutatingCommand — positives", () => {
+	const mutating = [
+		"git checkout main",
+		"git switch feature",
+		"git commit -m 'x'",
+		"git merge main",
+		"git rebase main",
+		"git pull",
+		"git reset --hard HEAD",
+		"git revert abc",
+		"git cherry-pick abc",
+		"git worktree add ../wt",
+		"git am < patch",
+		"git stash",
+	];
+	for (const cmd of mutating) {
+		it(`matches: ${cmd}`, () => {
+			expect(isGitMutatingCommand(cmd)).toBe(true);
+		});
+	}
+	it("matches when chained with preceding command", () => {
+		expect(isGitMutatingCommand("cd x && git commit")).toBe(true);
+	});
+});
+
+describe("isGitMutatingCommand — negatives", () => {
+	const nonMutating = [
+		"git status",
+		"git log",
+		"git diff",
+		"git rev-parse HEAD",
+		"git config user.name",
+		"gitmoji commit",
+		"git --version",
+	];
+	for (const cmd of nonMutating) {
+		it(`does NOT match: ${cmd}`, () => {
+			expect(isGitMutatingCommand(cmd)).toBe(false);
+		});
+	}
+	it("rejects empty string", () => {
+		expect(isGitMutatingCommand("")).toBe(false);
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/git-context.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/git-context.test.ts
@@ -0,0 +1,109 @@
+import { createMockPi, stubGitExec } from "@juicesharp/rpiv-test-utils";
+import { beforeEach, describe, expect, it } from "vitest";
+import { clearGitContextCache, getGitContext, resetInjectedMarker, takeGitContextIfChanged } from "./git-context.js";
+
+beforeEach(() => {
+	clearGitContextCache();
+	resetInjectedMarker();
+});
+
+describe("getGitContext", () => {
+	it("parses branch + commit + user from three exec calls", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc1234", user: "alice" }) as never,
+		});
+		const ctx = await getGitContext(pi);
+		expect(ctx).toEqual({ branch: "main", commit: "abc1234", user: "alice" });
+	});
+
+	it("remaps literal HEAD to 'detached'", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "HEAD", commit: "abc", user: "alice" }) as never,
+		});
+		const ctx = await getGitContext(pi);
+		expect(ctx?.branch).toBe("detached");
+	});
+
+	it("returns null when both branch and commit are empty (not a repo)", async () => {
+		const { pi } = createMockPi({ exec: stubGitExec({}) as never });
+		expect(await getGitContext(pi)).toBeNull();
+	});
+
+	it("falls back to process.env.USER when git config user.name errors", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", userError: new Error("no config") }) as never,
+		});
+		process.env.USER = "env-alice";
+		const ctx = await getGitContext(pi);
+		expect(ctx?.user).toBe("env-alice");
+	});
+
+	it("falls back to 'unknown' when neither git nor env has user", async () => {
+		const origUser = process.env.USER;
+		delete process.env.USER;
+		try {
+			const { pi } = createMockPi({
+				exec: stubGitExec({ branch: "main", commit: "abc", userError: new Error("x") }) as never,
+			});
+			const ctx = await getGitContext(pi);
+			expect(ctx?.user).toBe("unknown");
+		} finally {
+			if (origUser) process.env.USER = origUser;
+		}
+	});
+
+	it("memoises: subsequent calls do not re-exec", async () => {
+		const exec = stubGitExec({ branch: "main", commit: "abc", user: "alice" });
+		const { pi } = createMockPi({ exec: exec as never });
+		await getGitContext(pi);
+		await getGitContext(pi);
+		expect(exec).toHaveBeenCalledTimes(3); // 3 initial exec calls, no second-round
+	});
+
+	it("clearGitContextCache forces re-read", async () => {
+		const exec = stubGitExec({ branch: "main", commit: "abc", user: "alice" });
+		const { pi } = createMockPi({ exec: exec as never });
+		await getGitContext(pi);
+		clearGitContextCache();
+		await getGitContext(pi);
+		expect(exec).toHaveBeenCalledTimes(6);
+	});
+});
+
+describe("takeGitContextIfChanged", () => {
+	it("returns the context-line on first call", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", user: "alice" }) as never,
+		});
+		const r = await takeGitContextIfChanged(pi);
+		expect(r).toContain("- Branch: main");
+		expect(r).toContain("- Commit: abc");
+		expect(r).toContain("- User: alice");
+	});
+
+	it("returns null on second call when signature unchanged", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", user: "alice" }) as never,
+		});
+		await takeGitContextIfChanged(pi);
+		expect(await takeGitContextIfChanged(pi)).toBeNull();
+	});
+
+	it("re-emits after clearGitContextCache + resetInjectedMarker + signature change", async () => {
+		const { pi } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", user: "alice" }) as never,
+		});
+		await takeGitContextIfChanged(pi);
+		clearGitContextCache();
+		resetInjectedMarker();
+		const { pi: pi2 } = createMockPi({
+			exec: stubGitExec({ branch: "feature", commit: "def", user: "alice" }) as never,
+		});
+		expect(await takeGitContextIfChanged(pi2)).not.toBeNull();
+	});
+
+	it("returns null when not in a git repo", async () => {
+		const { pi } = createMockPi({ exec: stubGitExec({}) as never });
+		expect(await takeGitContextIfChanged(pi)).toBeNull();
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/git-context.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/git-context.ts
@@ -0,0 +1,79 @@
+/**
+ * Cached branch + short commit. Injected into the transcript once at
+ * session_start, re-injected on session_compact (transcript cleared) and
+ * only when the cached value changes (e.g. after a mutating git command).
+ * Two parallel `git rev-parse` calls — one call can't combine
+ * `--abbrev-ref` and `--short` cleanly because the `--abbrev-ref` mode
+ * persists to subsequent revs. git itself resolves worktree gitdir
+ * redirection, so either form is worktree-safe.
+ */
+
+import type { ExtensionAPI } from "@mariozechner/pi-coding-agent";
+
+type GitContext = { branch: string; commit: string; user: string };
+
+// Signature (branch+commit) of the last message pushed into the transcript.
+// null = transcript has nothing current and needs re-injection.
+let lastInjectedSig: string | null = null;
+
+// undefined = not loaded yet, null = not a git repo / failed, object = valid
+let cache: GitContext | null | undefined;
+
+export async function getGitContext(pi: ExtensionAPI): Promise<GitContext | null> {
+	if (cache !== undefined) return cache;
+	cache = await loadGitContext(pi);
+	return cache;
+}
+
+export function clearGitContextCache(): void {
+	cache = undefined;
+}
+
+// Detached HEAD emits literal "HEAD" for --abbrev-ref; remap so frontmatter is meaningful.
+async function loadGitContext(pi: ExtensionAPI): Promise<GitContext | null> {
+	try {
+		const [branchRes, commitRes] = await Promise.all([
+			pi.exec("git", ["rev-parse", "--abbrev-ref", "HEAD"], { timeout: 5000 }),
+			pi.exec("git", ["rev-parse", "--short", "HEAD"], { timeout: 5000 }),
+		]);
+		const rawBranch = branchRes.stdout.trim();
+		const commit = commitRes.stdout.trim();
+		if (!rawBranch && !commit) return null;
+		const branch = rawBranch === "HEAD" ? "detached" : rawBranch;
+		let user = "";
+		try {
+			const r2 = await pi.exec("git", ["config", "user.name"], { timeout: 5000 });
+			user = r2.stdout.trim();
+		} catch {
+			// fall through to env fallback
+		}
+		if (!user) user = process.env.USER || "unknown";
+		return {
+			branch: branch || "no-branch",
+			commit: commit || "no-commit",
+			user,
+		};
+	} catch {
+		return null;
+	}
+}
+
+export function resetInjectedMarker(): void {
+	lastInjectedSig = null;
+}
+
+// Returns the message content to inject, or null if the transcript is
+// already up-to-date or we're not in a git repo. Updates the marker
+// whenever it returns non-null.
+export async function takeGitContextIfChanged(pi: ExtensionAPI): Promise<string | null> {
+	const g = await getGitContext(pi);
+	if (!g) return null;
+	const sig = `${g.branch}\n${g.commit}\n${g.user}`;
+	if (sig === lastInjectedSig) return null;
+	lastInjectedSig = sig;
+	return `## Git Context\n- Branch: ${g.branch}\n- Commit: ${g.commit}\n- User: ${g.user}`;
+}
+
+export function isGitMutatingCommand(cmd: string): boolean {
+	return /\bgit\s+(checkout|switch|commit|merge|rebase|pull|reset|revert|cherry-pick|worktree|am|stash)\b/.test(cmd);
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/guidance.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/guidance.test.ts
@@ -0,0 +1,140 @@
+import { mkdtempSync, rmSync } from "node:fs";
+import { tmpdir } from "node:os";
+import { join } from "node:path";
+import { createMockPi, writeGuidanceTree } from "@juicesharp/rpiv-test-utils";
+import { afterEach, beforeEach, describe, expect, it, type vi } from "vitest";
+import { clearInjectionState, handleToolCallGuidance, injectRootGuidance, resolveGuidance } from "./guidance.js";
+
+let projectDir: string;
+
+beforeEach(() => {
+	projectDir = mkdtempSync(join(tmpdir(), "rpiv-guidance-"));
+	clearInjectionState();
+});
+afterEach(() => {
+	rmSync(projectDir, { recursive: true, force: true });
+});
+
+describe("resolveGuidance — ladder", () => {
+	it("AGENTS.md > CLAUDE.md > architecture.md at depth > 0", () => {
+		writeGuidanceTree(projectDir, {
+			"src/AGENTS.md": "agents-body",
+			"src/CLAUDE.md": "claude-body",
+			".rpiv/guidance/src/architecture.md": "arch-body",
+		});
+		const resolved = resolveGuidance(join(projectDir, "src", "foo.ts"), projectDir);
+		const srcEntry = resolved.find((r) => r.relativePath.startsWith("src/"));
+		expect(srcEntry?.kind).toBe("agents");
+	});
+
+	it("depth 0 skips AGENTS/CLAUDE but keeps root architecture.md", () => {
+		writeGuidanceTree(projectDir, {
+			"AGENTS.md": "root-agents",
+			".rpiv/guidance/architecture.md": "root-arch",
+		});
+		const resolved = resolveGuidance(join(projectDir, "any", "file.ts"), projectDir);
+		const rootEntry = resolved.find((r) => r.relativePath === ".rpiv/guidance/architecture.md");
+		expect(rootEntry?.kind).toBe("architecture");
+		expect(resolved.some((r) => r.relativePath === "AGENTS.md")).toBe(false);
+	});
+
+	it("returns root-first, specific-last order", () => {
+		writeGuidanceTree(projectDir, {
+			".rpiv/guidance/architecture.md": "root",
+			"a/AGENTS.md": "a",
+			"a/b/AGENTS.md": "ab",
+		});
+		const resolved = resolveGuidance(join(projectDir, "a", "b", "c.ts"), projectDir);
+		expect(resolved.map((r) => r.content)).toEqual(["root", "a", "ab"]);
+	});
+
+	it("returns empty when file is outside projectDir", () => {
+		expect(resolveGuidance("/totally/elsewhere/foo.ts", projectDir)).toEqual([]);
+	});
+
+	it("returns empty when nothing exists along the ladder", () => {
+		expect(resolveGuidance(join(projectDir, "x.ts"), projectDir)).toEqual([]);
+	});
+});
+
+describe("injectRootGuidance", () => {
+	it("sends root architecture.md when present", () => {
+		writeGuidanceTree(projectDir, { ".rpiv/guidance/architecture.md": "body" });
+		const { pi } = createMockPi();
+		injectRootGuidance(projectDir, pi);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(1);
+		const content = (pi.sendMessage as ReturnType<typeof vi.fn>).mock.calls[0][0].content;
+		expect(content).toContain("body");
+		expect(content).toContain("reference material, NOT a task");
+		expect(content).toContain("auto-loaded at session start");
+	});
+
+	it("is idempotent across calls within a session", () => {
+		writeGuidanceTree(projectDir, { ".rpiv/guidance/architecture.md": "body" });
+		const { pi } = createMockPi();
+		injectRootGuidance(projectDir, pi);
+		injectRootGuidance(projectDir, pi);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(1);
+	});
+
+	it("re-injects after clearInjectionState", () => {
+		writeGuidanceTree(projectDir, { ".rpiv/guidance/architecture.md": "body" });
+		const { pi } = createMockPi();
+		injectRootGuidance(projectDir, pi);
+		clearInjectionState();
+		injectRootGuidance(projectDir, pi);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(2);
+	});
+
+	it("no-ops when root architecture.md is missing", () => {
+		const { pi } = createMockPi();
+		injectRootGuidance(projectDir, pi);
+		expect(pi.sendMessage).not.toHaveBeenCalled();
+	});
+});
+
+describe("handleToolCallGuidance", () => {
+	it("skips non-read/edit/write tools", () => {
+		const { pi } = createMockPi();
+		handleToolCallGuidance({ toolName: "bash", input: {} }, { cwd: projectDir }, pi);
+		expect(pi.sendMessage).not.toHaveBeenCalled();
+	});
+
+	it("dedupes per-file across multiple tool_calls", () => {
+		writeGuidanceTree(projectDir, { "src/AGENTS.md": "a" });
+		const { pi } = createMockPi();
+		const ev = { toolName: "read", input: { file_path: join(projectDir, "src", "x.ts") } };
+		handleToolCallGuidance(ev, { cwd: projectDir }, pi);
+		handleToolCallGuidance(ev, { cwd: projectDir }, pi);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(1);
+	});
+
+	it("supports both 'path' and 'file_path' input keys", () => {
+		writeGuidanceTree(projectDir, { "src/AGENTS.md": "a" });
+		const { pi } = createMockPi();
+		handleToolCallGuidance(
+			{ toolName: "edit", input: { path: join(projectDir, "src", "x.ts") } },
+			{ cwd: projectDir },
+			pi,
+		);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(1);
+	});
+
+	it("emits one sendMessage combining multiple newly-resolved files", () => {
+		writeGuidanceTree(projectDir, {
+			".rpiv/guidance/architecture.md": "root",
+			"src/AGENTS.md": "src",
+		});
+		const { pi } = createMockPi();
+		handleToolCallGuidance(
+			{ toolName: "write", input: { file_path: join(projectDir, "src", "x.ts") } },
+			{ cwd: projectDir },
+			pi,
+		);
+		expect(pi.sendMessage).toHaveBeenCalledTimes(1);
+		const content = (pi.sendMessage as ReturnType<typeof vi.fn>).mock.calls[0][0].content;
+		expect(content).toContain("root");
+		expect(content).toContain("src");
+		expect(content).toContain("auto-loaded because write touched src/x.ts");
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/guidance.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/guidance.ts
@@ -0,0 +1,235 @@
+/**
+ * Guidance injection — resolves and injects subfolder guidance files.
+ *
+ * At each directory depth from project root down to the touched file's
+ * directory, picks the first existing of:
+ *   AGENTS.md > CLAUDE.md > .rpiv/guidance/<sub>/architecture.md
+ *
+ * Depth 0 (project root) skips AGENTS.md/CLAUDE.md because Pi's own
+ * resource-loader (loadContextFileFromDir at resource-loader.js:30-46)
+ * already loads <cwd>/AGENTS.md or <cwd>/CLAUDE.md into the system
+ * prompt's # Project Context block. Depth 0 still checks
+ * <cwd>/.rpiv/guidance/architecture.md — Pi's loader does not see that
+ * path.
+ *
+ * `resolveGuidance` is pure logic with no ExtensionAPI references
+ * (utility-module rule from extensions/rpiv-core/CLAUDE.md). Side
+ * effects (sendMessage, in-memory dedup Set) live in
+ * `handleToolCallGuidance`, `injectRootGuidance`, and
+ * `clearInjectionState`.
+ */
+
+import { existsSync, readFileSync } from "node:fs";
+import { dirname, isAbsolute, join, relative, sep } from "node:path";
+import type { ExtensionAPI } from "@mariozechner/pi-coding-agent";
+import { FLAG_DEBUG, MSG_TYPE_GUIDANCE } from "./constants.js";
+
+// ---------------------------------------------------------------------------
+// Guidance Resolution
+// ---------------------------------------------------------------------------
+
+type GuidanceKind = "agents" | "claude" | "architecture";
+
+interface GuidanceFile {
+	/** Forward-slash-normalized path from project root — stable dedup key. */
+	relativePath: string;
+	absolutePath: string;
+	content: string;
+	kind: GuidanceKind;
+}
+
+/**
+ * Resolve guidance files for a given file path.
+ *
+ * Walks from project root to the file's directory. At each depth, picks
+ * the first existing of AGENTS.md > CLAUDE.md > architecture.md (Pi's
+ * own per-dir precedence at resource-loader.js:30-46, extended with
+ * architecture.md as a third candidate). Depth 0 only checks
+ * architecture.md — Pi's loader already handles <cwd>/AGENTS.md and
+ * <cwd>/CLAUDE.md.
+ *
+ * Returns files root-first (general → specific), at most one per depth.
+ */
+export function resolveGuidance(filePath: string, projectDir: string): GuidanceFile[] {
+	const fileDir = dirname(filePath);
+	const relativeDir = relative(projectDir, fileDir);
+
+	// Guard: file is outside project root
+	if (relativeDir.startsWith("..") || isAbsolute(relativeDir)) {
+		return [];
+	}
+
+	const parts = relativeDir ? relativeDir.split(sep) : [];
+	const results: GuidanceFile[] = [];
+
+	for (let depth = 0; depth <= parts.length; depth++) {
+		const subPath = parts.slice(0, depth).join(sep);
+
+		// Per-depth candidate ladder. First-match wins.
+		const candidates: Array<{ relative: string; kind: GuidanceKind }> = [];
+
+		// Depth 0: skip AGENTS/CLAUDE — Pi's loader handles <cwd> already.
+		if (depth > 0) {
+			candidates.push({ relative: join(subPath, "AGENTS.md"), kind: "agents" });
+			candidates.push({ relative: join(subPath, "CLAUDE.md"), kind: "claude" });
+		}
+		candidates.push({
+			relative: subPath
+				? join(".rpiv", "guidance", subPath, "architecture.md")
+				: join(".rpiv", "guidance", "architecture.md"),
+			kind: "architecture",
+		});
+
+		for (const candidate of candidates) {
+			const absolute = join(projectDir, candidate.relative);
+			if (existsSync(absolute)) {
+				results.push({
+					relativePath: candidate.relative.split(sep).join("/"),
+					absolutePath: absolute,
+					content: readFileSync(absolute, "utf-8"),
+					kind: candidate.kind,
+				});
+				break; // first-match wins at this depth
+			}
+		}
+	}
+
+	return results;
+}
+
+// ---------------------------------------------------------------------------
+// Session State
+// ---------------------------------------------------------------------------
+
+/** In-memory set of injected guidance paths per session. */
+const injectedGuidance = new Set<string>();
+
+export function clearInjectionState() {
+	injectedGuidance.clear();
+}
+
+// ---------------------------------------------------------------------------
+// Root Guidance Injection (session_start)
+// ---------------------------------------------------------------------------
+
+/**
+ * Inject the root `.rpiv/guidance/architecture.md` at session start.
+ *
+ * Called from `session_start` so the root guidance is available before the
+ * first agent turn — without waiting for a read/edit/write tool_call.
+ * Uses the same `injectedGuidance` Set for dedup, so `handleToolCallGuidance`
+ * won't re-inject it later.
+ */
+export function injectRootGuidance(cwd: string, pi: ExtensionAPI): void {
+	const relativePath = ".rpiv/guidance/architecture.md";
+
+	if (injectedGuidance.has(relativePath)) return;
+
+	const absolutePath = join(cwd, relativePath);
+	if (!existsSync(absolutePath)) return;
+
+	let content: string;
+	try {
+		content = readFileSync(absolutePath, "utf-8");
+	} catch {
+		// Silent failure mirrors handleToolCallGuidance's posture — session_start
+		// runs before any UI is bound, so a permissions/race error here must not
+		// crash the hook. Don't mark as injected so a later tool_call can retry.
+		return;
+	}
+	injectedGuidance.add(relativePath);
+
+	const file: GuidanceFile = { relativePath, absolutePath, content, kind: "architecture" };
+	pi.sendMessage({
+		customType: MSG_TYPE_GUIDANCE,
+		content: wrapGuidance(formatLabel(file), content, "auto-loaded at session start"),
+		display: !!pi.getFlag(FLAG_DEBUG),
+	});
+}
+
+// ---------------------------------------------------------------------------
+// Tool-call Handler
+// ---------------------------------------------------------------------------
+
+/**
+ * Handle guidance injection on tool_call events for read/edit/write.
+ * Sends hidden messages via pi.sendMessage as a side effect.
+ */
+export function handleToolCallGuidance(
+	event: { toolName: string; input: Record<string, unknown> },
+	ctx: { cwd: string },
+	pi: ExtensionAPI,
+): void {
+	if (!["read", "edit", "write"].includes(event.toolName)) return;
+
+	const filePath = (event.input as any).file_path ?? (event.input as any).path;
+	if (!filePath) return;
+
+	const resolved = resolveGuidance(filePath, ctx.cwd);
+	if (resolved.length === 0) return;
+
+	const newFiles = resolved.filter((g) => !injectedGuidance.has(g.relativePath));
+	if (newFiles.length === 0) return;
+
+	// Mark before sendMessage — idempotence > reliability.
+	for (const g of newFiles) {
+		injectedGuidance.add(g.relativePath);
+	}
+
+	const trigger = `auto-loaded because ${event.toolName} touched ${shortenPath(filePath, ctx.cwd)}`;
+	const contextParts = newFiles.map((g) => wrapGuidance(formatLabel(g), g.content, trigger));
+
+	pi.sendMessage({
+		customType: MSG_TYPE_GUIDANCE,
+		content: contextParts.join("\n\n---\n\n"),
+		display: !!pi.getFlag(FLAG_DEBUG),
+	});
+}
+
+/**
+ * Wrap guidance content in a non-task envelope. The opening disclaimer tells
+ * the agent this block is reference material — not an instruction — and states
+ * the trigger so the agent can judge whether the block is relevant to the
+ * current user request. Heading is `## Architecture Guidance:` to match the
+ * `PreToolUse:Read` hook output and the actual content (architecture.md).
+ */
+function wrapGuidance(label: string, content: string, trigger: string): string {
+	return [
+		`[rpiv-guidance — reference material, NOT a task. ${trigger}.`,
+		`Consult only if directly relevant to the user's current request; otherwise ignore.]`,
+		"",
+		`## Architecture Guidance: ${label}`,
+		"",
+		content,
+	].join("\n");
+}
+
+/**
+ * Render a project-relative, forward-slash-normalized path for the trigger
+ * disclaimer. Falls back to the absolute path if the file lives outside the
+ * project root (defensive — `handleToolCallGuidance` already short-circuits
+ * via `resolveGuidance` in that case, so this branch is unreachable today).
+ */
+function shortenPath(filePath: string, cwd: string): string {
+	const r = relative(cwd, filePath);
+	return r && !r.startsWith("..") ? r.split(sep).join("/") : filePath;
+}
+
+/**
+ * Format a guidance file's heading label.
+ *   extensions/rpiv-core/AGENTS.md          → "extensions/rpiv-core (AGENTS.md)"
+ *   scripts/CLAUDE.md                       → "scripts (CLAUDE.md)"
+ *   .rpiv/guidance/scripts/architecture.md  → "scripts (architecture.md)"
+ *   .rpiv/guidance/architecture.md          → "root (architecture.md)"
+ */
+function formatLabel(g: GuidanceFile): string {
+	if (g.kind === "architecture") {
+		const stripped = g.relativePath.replace(/^\.rpiv\/guidance\//, "");
+		const sub = stripped === "architecture.md" ? "" : stripped.replace(/\/architecture\.md$/, "");
+		return `${sub || "root"} (architecture.md)`;
+	}
+	const fileName = g.kind === "agents" ? "AGENTS.md" : "CLAUDE.md";
+	const idx = g.relativePath.lastIndexOf("/");
+	const sub = idx > 0 ? g.relativePath.slice(0, idx) : "";
+	return `${sub || "root"} (${fileName})`;
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/index.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/index.ts
@@ -0,0 +1,25 @@
+/**
+ * rpiv-core — Pure-orchestrator extension for rpiv-pi.
+ *
+ * Composes session hooks and the two slash commands. All logic lives in the
+ * registrar modules; this file is the table of contents.
+ *
+ * Tool-owning plugins are siblings (see siblings.ts); install via /rpiv-setup.
+ */
+
+import type { ExtensionAPI } from "@mariozechner/pi-coding-agent";
+import { FLAG_DEBUG } from "./constants.js";
+import { registerSessionHooks } from "./session-hooks.js";
+import { registerSetupCommand } from "./setup-command.js";
+import { registerUpdateAgentsCommand } from "./update-agents-command.js";
+
+export default function (pi: ExtensionAPI) {
+	pi.registerFlag(FLAG_DEBUG, {
+		description: "Show injected guidance and git-context messages",
+		type: "boolean",
+		default: false,
+	});
+	registerSessionHooks(pi);
+	registerUpdateAgentsCommand(pi);
+	registerSetupCommand(pi);
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/package-checks.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/package-checks.test.ts
@@ -0,0 +1,59 @@
+import { mkdirSync, writeFileSync } from "node:fs";
+import { dirname, join } from "node:path";
+import { describe, expect, it } from "vitest";
+import { findMissingSiblings } from "./package-checks.js";
+import { SIBLINGS } from "./siblings.js";
+
+const SETTINGS_PATH = join(process.env.HOME!, ".pi", "agent", "settings.json");
+
+function writeSettings(contents: unknown) {
+	mkdirSync(dirname(SETTINGS_PATH), { recursive: true });
+	writeFileSync(SETTINGS_PATH, JSON.stringify(contents), "utf-8");
+}
+
+describe("findMissingSiblings", () => {
+	it("returns all 7 siblings when settings.json is missing", () => {
+		expect(findMissingSiblings()).toHaveLength(SIBLINGS.length);
+	});
+
+	it("returns all 7 siblings when JSON is invalid", () => {
+		mkdirSync(dirname(SETTINGS_PATH), { recursive: true });
+		writeFileSync(SETTINGS_PATH, "{not json", "utf-8");
+		expect(findMissingSiblings()).toHaveLength(SIBLINGS.length);
+	});
+
+	it("returns all 7 siblings when packages field is absent", () => {
+		writeSettings({ other: "data" });
+		expect(findMissingSiblings()).toHaveLength(SIBLINGS.length);
+	});
+
+	it("returns all 7 siblings when packages is not an array", () => {
+		writeSettings({ packages: "not-array" });
+		expect(findMissingSiblings()).toHaveLength(SIBLINGS.length);
+	});
+
+	it("filters out non-string entries defensively", () => {
+		writeSettings({ packages: [null, 42, "@juicesharp/rpiv-todo"] });
+		const missing = findMissingSiblings();
+		expect(missing.find((s) => s.matches.test("@juicesharp/rpiv-todo"))).toBeUndefined();
+	});
+
+	it("matches case-insensitively", () => {
+		writeSettings({ packages: ["@JUICESHARP/RPIV-TODO"] });
+		const missing = findMissingSiblings();
+		expect(missing.find((s) => s.matches.test("@juicesharp/rpiv-todo"))).toBeUndefined();
+	});
+
+	it("rpiv-args word-boundary: treats rpiv-args-extended as non-install", () => {
+		writeSettings({ packages: ["@juicesharp/rpiv-args-extended"] });
+		const missing = findMissingSiblings();
+		expect(missing.find((s) => s.pkg.endsWith("/rpiv-args"))).toBeDefined();
+	});
+
+	it("returns [] when all 7 siblings are installed", () => {
+		writeSettings({
+			packages: SIBLINGS.map((s) => s.pkg.replace(/^npm:/, "")),
+		});
+		expect(findMissingSiblings()).toEqual([]);
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/package-checks.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/package-checks.ts
@@ -0,0 +1,33 @@
+/**
+ * Detect which SIBLINGS are installed by reading ~/.pi/agent/settings.json.
+ * Pure utility — no ExtensionAPI.
+ */
+
+import { existsSync, readFileSync } from "node:fs";
+import { homedir } from "node:os";
+import { join } from "node:path";
+import { SIBLINGS, type SiblingPlugin } from "./siblings.js";
+
+const PI_AGENT_SETTINGS = join(homedir(), ".pi", "agent", "settings.json");
+
+function readInstalledPackages(): string[] {
+	if (!existsSync(PI_AGENT_SETTINGS)) return [];
+	try {
+		const raw = readFileSync(PI_AGENT_SETTINGS, "utf-8");
+		const settings = JSON.parse(raw) as { packages?: unknown };
+		if (!Array.isArray(settings.packages)) return [];
+		return settings.packages.filter((e): e is string => typeof e === "string");
+	} catch {
+		return [];
+	}
+}
+
+/**
+ * Return the SIBLINGS not currently installed.
+ * Reads ~/.pi/agent/settings.json once per call — callers that need both the
+ * full snapshot and the missing subset should call this once and filter.
+ */
+export function findMissingSiblings(): SiblingPlugin[] {
+	const installed = readInstalledPackages();
+	return SIBLINGS.filter((s) => !installed.some((entry) => s.matches.test(entry)));
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/pi-installer.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/pi-installer.test.ts
@@ -0,0 +1,100 @@
+import { makeSpawnStub } from "@juicesharp/rpiv-test-utils";
+import { beforeEach, describe, expect, it, vi } from "vitest";
+
+vi.mock("node:child_process", () => ({ spawn: vi.fn() }));
+
+import { spawn } from "node:child_process";
+import { spawnPiInstall } from "./pi-installer.js";
+
+beforeEach(() => {
+	vi.mocked(spawn).mockReset();
+});
+
+describe("spawnPiInstall — success path", () => {
+	it("resolves with exit 0 + buffered stdout/stderr", async () => {
+		vi.mocked(spawn).mockImplementationOnce(
+			() => makeSpawnStub({ stdout: "installed\n", stderr: "", exitCode: 0 }) as unknown as ReturnType<typeof spawn>,
+		);
+		const r = await spawnPiInstall("@x/y", 30_000);
+		expect(r).toEqual({ code: 0, stdout: "installed\n", stderr: "" });
+	});
+});
+
+describe("spawnPiInstall — non-zero exit", () => {
+	it("returns exit code and accumulated stderr", async () => {
+		vi.mocked(spawn).mockImplementationOnce(
+			() => makeSpawnStub({ stdout: "", stderr: "fail\n", exitCode: 2 }) as unknown as ReturnType<typeof spawn>,
+		);
+		const r = await spawnPiInstall("@x/y", 30_000);
+		expect(r.code).toBe(2);
+		expect(r.stderr).toBe("fail\n");
+	});
+
+	it("fallback code=1 when close emits null", async () => {
+		const stub = makeSpawnStub({ neverSettles: true });
+		vi.mocked(spawn).mockImplementationOnce(() => stub as unknown as ReturnType<typeof spawn>);
+		const promise = spawnPiInstall("@x/y", 30_000);
+		stub.emit("close", null);
+		const r = await promise;
+		expect(r.code).toBe(1);
+	});
+});
+
+describe("spawnPiInstall — error event before close", () => {
+	it("settles with code=1 + error.message in stderr", async () => {
+		vi.mocked(spawn).mockImplementationOnce(
+			() => makeSpawnStub({ error: new Error("ENOENT pi") }) as unknown as ReturnType<typeof spawn>,
+		);
+		const r = await spawnPiInstall("@x/y", 30_000);
+		expect(r.code).toBe(1);
+		expect(r.stderr).toContain("ENOENT pi");
+	});
+});
+
+describe("spawnPiInstall — timeout", () => {
+	it("kills with SIGTERM at timeout and resolves with code 124", async () => {
+		vi.useFakeTimers();
+		const stub = makeSpawnStub({ neverSettles: true });
+		const killSpy = vi.spyOn(stub, "kill");
+		vi.mocked(spawn).mockImplementationOnce(() => stub as unknown as ReturnType<typeof spawn>);
+		const promise = spawnPiInstall("@x/y", 30_000);
+		await vi.advanceTimersByTimeAsync(30_000);
+		vi.useRealTimers();
+		const r = await promise;
+		expect(killSpy).toHaveBeenCalledWith("SIGTERM");
+		expect(r.code).toBe(124);
+		expect(r.stderr).toContain("timed out");
+	});
+});
+
+describe("spawnPiInstall — settle idempotence", () => {
+	it("only resolves once even if close fires after timeout", async () => {
+		vi.useFakeTimers();
+		const stub = makeSpawnStub({ neverSettles: true });
+		vi.mocked(spawn).mockImplementationOnce(() => stub as unknown as ReturnType<typeof spawn>);
+		const promise = spawnPiInstall("@x/y", 30_000);
+		await vi.advanceTimersByTimeAsync(30_000);
+		stub.emit("close", 0); // late close — must not replace the timeout result
+		vi.useRealTimers();
+		const r = await promise;
+		expect(r.code).toBe(124);
+	});
+});
+
+describe("spawnPiInstall — Windows branch", () => {
+	it("invokes via cmd.exe /c pi install on win32", async () => {
+		const origPlatform = process.platform;
+		Object.defineProperty(process, "platform", { value: "win32", configurable: true });
+		try {
+			vi.mocked(spawn).mockImplementationOnce(
+				() => makeSpawnStub({ exitCode: 0 }) as unknown as ReturnType<typeof spawn>,
+			);
+			await spawnPiInstall("@x/y", 30_000);
+			const firstCall = vi.mocked(spawn).mock.calls[0];
+			expect(firstCall[0]).toBe("cmd.exe");
+			expect(firstCall[1]).toEqual(["/c", "pi", "install", "@x/y"]);
+		} finally {
+			Object.defineProperty(process, "platform", { value: origPlatform, configurable: true });
+		}
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/pi-installer.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/pi-installer.ts
@@ -0,0 +1,59 @@
+/**
+ * Windows-safe wrapper around `pi install <pkg>`.
+ *
+ * Pi's own `pi.exec` calls `child_process.spawn(cmd, args, { shell: false })`,
+ * which cannot launch `.cmd`/`.bat` shims on Windows — npm installs `pi` as
+ * `pi.cmd`, so on Windows the spawn ENOENTs silently and the caller sees only
+ * `exit 1`. We side-step it here by invoking via `cmd.exe /c` on Windows.
+ */
+
+import { spawn } from "node:child_process";
+
+export interface PiInstallResult {
+	code: number;
+	stdout: string;
+	stderr: string;
+}
+
+export function spawnPiInstall(pkg: string, timeoutMs: number): Promise<PiInstallResult> {
+	return new Promise((resolve) => {
+		const isWindows = process.platform === "win32";
+		const [cmd, args, spawnOpts] = isWindows
+			? (["cmd.exe", ["/c", "pi", "install", pkg], { windowsHide: true }] as const)
+			: (["pi", ["install", pkg], {}] as const);
+
+		let settled = false;
+		let stdout = "";
+		let stderr = "";
+
+		const proc = spawn(cmd, args, { ...spawnOpts, stdio: ["ignore", "pipe", "pipe"] });
+		proc.stdout?.on("data", (d) => {
+			stdout += d.toString();
+		});
+		proc.stderr?.on("data", (d) => {
+			stderr += d.toString();
+		});
+
+		const settle = (result: PiInstallResult) => {
+			if (settled) return;
+			settled = true;
+			clearTimeout(timer);
+			resolve(result);
+		};
+
+		const timer = setTimeout(() => {
+			proc.kill("SIGTERM");
+			setTimeout(() => {
+				if (!proc.killed) proc.kill("SIGKILL");
+			}, 5000);
+			settle({ code: 124, stdout, stderr: `${stderr}\n[timed out after ${timeoutMs}ms]` });
+		}, timeoutMs);
+
+		proc.on("error", (err) => {
+			settle({ code: 1, stdout, stderr: stderr + (stderr ? "\n" : "") + err.message });
+		});
+		proc.on("close", (code) => {
+			settle({ code: code ?? 1, stdout, stderr });
+		});
+	});
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/prune-legacy-siblings.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/prune-legacy-siblings.test.ts
@@ -0,0 +1,162 @@
+import { mkdirSync, readFileSync, writeFileSync } from "node:fs";
+import { dirname, join } from "node:path";
+import { describe, expect, it } from "vitest";
+import { findLegacySiblings, pruneLegacySiblings } from "./prune-legacy-siblings.js";
+
+const SETTINGS_PATH = join(process.env.HOME!, ".pi", "agent", "settings.json");
+
+function writeSettings(contents: unknown): void {
+	mkdirSync(dirname(SETTINGS_PATH), { recursive: true });
+	writeFileSync(SETTINGS_PATH, JSON.stringify(contents), "utf-8");
+}
+
+function readSettings(): unknown {
+	return JSON.parse(readFileSync(SETTINGS_PATH, "utf-8"));
+}
+
+describe("pruneLegacySiblings", () => {
+	it("no settings file → pruned: []", () => {
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+	});
+
+	it("invalid JSON → pruned: [], file byte-exact unchanged", () => {
+		mkdirSync(dirname(SETTINGS_PATH), { recursive: true });
+		writeFileSync(SETTINGS_PATH, "{not json", "utf-8");
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+		expect(readFileSync(SETTINGS_PATH, "utf-8")).toBe("{not json");
+	});
+
+	it("non-object top-level (array) → pruned: [], file unchanged", () => {
+		writeSettings([1, 2, 3]);
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+		expect(readSettings()).toEqual([1, 2, 3]);
+	});
+
+	it("no packages field → pruned: []", () => {
+		writeSettings({ other: "data" });
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+		expect(readSettings()).toEqual({ other: "data" });
+	});
+
+	it("non-array packages field → pruned: []", () => {
+		writeSettings({ packages: "not-array" });
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+	});
+
+	it("only non-legacy entries → pruned: [], file unchanged", () => {
+		writeSettings({
+			packages: ["npm:pi-perplexity", "npm:@juicesharp/rpiv-todo", "npm:@tintinweb/pi-subagents"],
+		});
+		const before = readFileSync(SETTINGS_PATH, "utf-8");
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+		expect(readFileSync(SETTINGS_PATH, "utf-8")).toBe(before);
+	});
+
+	it("legacy-only: removes pi-subagents (nicobailon fork), preserves other top-level keys", () => {
+		writeSettings({
+			defaultProvider: "zai",
+			theme: "dark",
+			packages: ["npm:pi-subagents"],
+		});
+		const result = pruneLegacySiblings();
+		expect(result.pruned).toEqual(["npm:pi-subagents"]);
+		expect(readSettings()).toEqual({
+			defaultProvider: "zai",
+			theme: "dark",
+			packages: [],
+		});
+	});
+
+	it("mixed list: prunes nicobailon's pi-subagents only, preserves @tintinweb/pi-subagents and other entries", () => {
+		writeSettings({
+			packages: [
+				"npm:pi-perplexity",
+				"npm:@tintinweb/pi-subagents",
+				"npm:@juicesharp/rpiv-todo",
+				"/Users/x/rpiv-mono/packages/rpiv-pi",
+				null,
+				42,
+				"npm:pi-subagents",
+			],
+		});
+		const result = pruneLegacySiblings();
+		expect(result.pruned).toEqual(["npm:pi-subagents"]);
+		expect(readSettings()).toEqual({
+			packages: [
+				"npm:pi-perplexity",
+				"npm:@tintinweb/pi-subagents",
+				"npm:@juicesharp/rpiv-todo",
+				"/Users/x/rpiv-mono/packages/rpiv-pi",
+				null,
+				42,
+			],
+		});
+	});
+
+	it("idempotent: second call after prune is a no-op", () => {
+		writeSettings({
+			packages: ["npm:pi-subagents"],
+		});
+		expect(pruneLegacySiblings().pruned).toEqual(["npm:pi-subagents"]);
+		expect(pruneLegacySiblings()).toEqual({ pruned: [] });
+	});
+
+	it("case-insensitive match", () => {
+		writeSettings({
+			packages: ["NPM:Pi-Subagents"],
+		});
+		expect(pruneLegacySiblings().pruned).toEqual(["NPM:Pi-Subagents"]);
+	});
+});
+
+describe("findLegacySiblings (read-only scan)", () => {
+	it("no settings file → []", () => {
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("invalid JSON → []", () => {
+		mkdirSync(dirname(SETTINGS_PATH), { recursive: true });
+		writeFileSync(SETTINGS_PATH, "{not json", "utf-8");
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("non-object top-level → []", () => {
+		writeSettings([1, 2, 3]);
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("no packages field → []", () => {
+		writeSettings({ other: "data" });
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("non-array packages field → []", () => {
+		writeSettings({ packages: "not-array" });
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("only non-legacy entries → []", () => {
+		writeSettings({
+			packages: ["npm:pi-perplexity", "npm:@juicesharp/rpiv-todo", "npm:@tintinweb/pi-subagents"],
+		});
+		expect(findLegacySiblings()).toEqual([]);
+	});
+
+	it("returns legacy entries without mutating settings.json", () => {
+		writeSettings({
+			defaultProvider: "zai",
+			packages: ["npm:pi-subagents", "npm:@juicesharp/rpiv-todo"],
+		});
+		const before = readFileSync(SETTINGS_PATH, "utf-8");
+		expect(findLegacySiblings()).toEqual(["npm:pi-subagents"]);
+		expect(readFileSync(SETTINGS_PATH, "utf-8")).toBe(before);
+	});
+
+	it("idempotent: repeat call returns the same list and does not mutate", () => {
+		writeSettings({ packages: ["npm:pi-subagents"] });
+		const before = readFileSync(SETTINGS_PATH, "utf-8");
+		expect(findLegacySiblings()).toEqual(["npm:pi-subagents"]);
+		expect(findLegacySiblings()).toEqual(["npm:pi-subagents"]);
+		expect(readFileSync(SETTINGS_PATH, "utf-8")).toBe(before);
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/prune-legacy-siblings.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/prune-legacy-siblings.ts
@@ -0,0 +1,95 @@
+/**
+ * Detect + remove deprecated sibling package entries from
+ * ~/.pi/agent/settings.json.
+ *
+ * Split into two phases so /rpiv-setup can preview pending changes in the
+ * confirmation dialog and apply the mutation only after the user agrees:
+ *
+ *   findLegacySiblings()  — read-only scan; returns the entries that WOULD
+ *                           be pruned. Safe to call before confirmation.
+ *   pruneLegacySiblings() — mutating apply step; rewrites settings.json.
+ *                           Call only after the user has confirmed.
+ *
+ * Both helpers are fail-soft (missing file / invalid JSON / non-object /
+ * unwritable → empty result), idempotent, and have no plugin API
+ * dependency.
+ *
+ * Background: 0.13.x → 1.0.0 upgraders may have both nicobailon's
+ * pi-subagents and @tintinweb/pi-subagents in settings.json simultaneously,
+ * which makes Pi reject boot with duplicate-tool registration when both
+ * load. The prune is the upgrade's must-do mutation, but it must not run
+ * before the user has consented to /rpiv-setup mutating settings.json.
+ */
+
+import { existsSync, readFileSync, writeFileSync } from "node:fs";
+import { homedir } from "node:os";
+import { join } from "node:path";
+import { LEGACY_SIBLINGS } from "./siblings.js";
+
+const PI_AGENT_SETTINGS = join(homedir(), ".pi", "agent", "settings.json");
+
+export interface PruneLegacySiblingsResult {
+	/** settings.json `packages[]` entries that were removed (empty = no-op). */
+	pruned: string[];
+}
+
+interface ParsedSettings {
+	settings: Record<string, unknown>;
+	packages: unknown[];
+}
+
+function readSettings(): ParsedSettings | undefined {
+	if (!existsSync(PI_AGENT_SETTINGS)) return undefined;
+	let parsed: unknown;
+	try {
+		parsed = JSON.parse(readFileSync(PI_AGENT_SETTINGS, "utf-8"));
+	} catch {
+		return undefined;
+	}
+	if (!parsed || typeof parsed !== "object" || Array.isArray(parsed)) return undefined;
+	const settings = parsed as Record<string, unknown>;
+	if (!Array.isArray(settings.packages)) return undefined;
+	return { settings, packages: settings.packages as unknown[] };
+}
+
+function partitionPackages(packages: unknown[]): { legacy: string[]; kept: unknown[] } {
+	const legacy: string[] = [];
+	const kept = packages.filter((entry) => {
+		if (typeof entry !== "string") return true;
+		const isLegacy = LEGACY_SIBLINGS.some((l) => l.matches.test(entry));
+		if (isLegacy) legacy.push(entry);
+		return !isLegacy;
+	});
+	return { legacy, kept };
+}
+
+/**
+ * Read-only scan: returns the legacy entries that pruneLegacySiblings()
+ * would remove. Does not touch the filesystem beyond reading settings.json.
+ * Safe to call before any user confirmation.
+ */
+export function findLegacySiblings(): string[] {
+	const parsed = readSettings();
+	if (!parsed) return [];
+	return partitionPackages(parsed.packages).legacy;
+}
+
+/**
+ * Mutating apply step: rewrites settings.json with legacy entries removed.
+ * Returns a structured report so callers can emit a conditional notify.
+ * Never throws. Call AFTER the user has confirmed the cleanup.
+ */
+export function pruneLegacySiblings(): PruneLegacySiblingsResult {
+	const parsed = readSettings();
+	if (!parsed) return { pruned: [] };
+	const { legacy, kept } = partitionPackages(parsed.packages);
+	if (legacy.length === 0) return { pruned: [] };
+
+	parsed.settings.packages = kept;
+	try {
+		writeFileSync(PI_AGENT_SETTINGS, `${JSON.stringify(parsed.settings, null, 2)}\n`, "utf-8");
+	} catch {
+		return { pruned: [] };
+	}
+	return { pruned: legacy };
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/session-hooks.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/session-hooks.test.ts
@@ -0,0 +1,216 @@
+import { existsSync, mkdtempSync, rmSync } from "node:fs";
+import { tmpdir } from "node:os";
+import { join } from "node:path";
+import { createMockCtx, createMockPi, stubGitExec } from "@juicesharp/rpiv-test-utils";
+import { afterEach, beforeEach, describe, expect, it, vi } from "vitest";
+
+vi.mock("./package-checks.js", () => ({ findMissingSiblings: vi.fn(() => []) }));
+vi.mock("./agents.js", async (importOriginal) => {
+	const actual = await importOriginal<typeof import("./agents.js")>();
+	return {
+		...actual,
+		syncBundledAgents: vi.fn(() => ({
+			added: [],
+			updated: [],
+			unchanged: [],
+			removed: [],
+			pendingUpdate: [],
+			pendingRemove: [],
+			errors: [],
+		})),
+	};
+});
+
+import type { SyncResult } from "./agents.js";
+import { syncBundledAgents } from "./agents.js";
+import { clearGitContextCache, getGitContext, resetInjectedMarker, takeGitContextIfChanged } from "./git-context.js";
+import { clearInjectionState } from "./guidance.js";
+import { findMissingSiblings } from "./package-checks.js";
+import { registerSessionHooks } from "./session-hooks.js";
+
+const emptySync: SyncResult = {
+	added: [],
+	updated: [],
+	unchanged: [],
+	removed: [],
+	pendingUpdate: [],
+	pendingRemove: [],
+	errors: [],
+};
+
+let projectDir: string;
+
+beforeEach(() => {
+	projectDir = mkdtempSync(join(tmpdir(), "rpiv-session-"));
+	clearInjectionState();
+	clearGitContextCache();
+	resetInjectedMarker();
+});
+afterEach(() => {
+	rmSync(projectDir, { recursive: true, force: true });
+});
+
+describe("registerSessionHooks — event wiring", () => {
+	it("registers 5 events", () => {
+		const { pi, captured } = createMockPi();
+		registerSessionHooks(pi);
+		for (const ev of ["session_start", "session_compact", "session_shutdown", "tool_call", "before_agent_start"]) {
+			expect(captured.events.has(ev)).toBe(true);
+		}
+	});
+});
+
+describe("session_start hook", () => {
+	it("scaffolds thoughts dirs under ctx.cwd", async () => {
+		const { pi, captured } = createMockPi({ exec: stubGitExec({}) as never });
+		registerSessionHooks(pi);
+		const handler = captured.events.get("session_start")?.[0];
+		const ctx = createMockCtx({ cwd: projectDir, hasUI: true });
+		await handler?.({ reason: "startup" } as never, ctx as never);
+		for (const d of [
+			"thoughts/shared/discover",
+			"thoughts/shared/research",
+			"thoughts/shared/designs",
+			"thoughts/shared/plans",
+			"thoughts/shared/handoffs",
+			"thoughts/shared/reviews",
+		]) {
+			expect(existsSync(join(projectDir, d))).toBe(true);
+		}
+	});
+});
+
+describe("session_start hook — notifications", () => {
+	it("emits 'Copied N agents' info when added > 0", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValueOnce({ ...emptySync, added: ["a.md", "b.md"] });
+		vi.mocked(findMissingSiblings).mockReturnValueOnce([]);
+		const { pi, captured } = createMockPi({ exec: stubGitExec({}) as never });
+		registerSessionHooks(pi);
+		const ctx = createMockCtx({ cwd: projectDir, hasUI: true });
+		await captured.events.get("session_start")?.[0]({ reason: "startup" } as never, ctx as never);
+		expect(ctx.ui.notify).toHaveBeenCalledWith(expect.stringMatching(/Copied 2 rpiv-pi agent/), "info");
+	});
+
+	it("emits a single drift line combining pendingUpdate + pendingRemove", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValueOnce({
+			...emptySync,
+			pendingUpdate: ["a.md"],
+			pendingRemove: ["b.md", "c.md"],
+		});
+		vi.mocked(findMissingSiblings).mockReturnValueOnce([]);
+		const { pi, captured } = createMockPi({ exec: stubGitExec({}) as never });
+		registerSessionHooks(pi);
+		const ctx = createMockCtx({ cwd: projectDir, hasUI: true });
+		await captured.events.get("session_start")?.[0]({ reason: "startup" } as never, ctx as never);
+		const driftCall = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.find(
+			(c) => typeof c[0] === "string" && c[0].includes("outdated"),
+		);
+		expect(driftCall).toBeDefined();
+		expect(driftCall?.[0]).toContain("1 outdated");
+		expect(driftCall?.[0]).toContain("2 removed from bundle");
+		expect(driftCall?.[1]).toBe("info");
+	});
+
+	it("warns about missing siblings with npm: prefix stripped", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValueOnce(emptySync);
+		vi.mocked(findMissingSiblings).mockReturnValueOnce([
+			{ pkg: "npm:@juicesharp/rpiv-advisor", matches: /./, provides: "x" },
+			{ pkg: "npm:@juicesharp/rpiv-args", matches: /./, provides: "y" },
+		] as never);
+		const { pi, captured } = createMockPi({ exec: stubGitExec({}) as never });
+		registerSessionHooks(pi);
+		const ctx = createMockCtx({ cwd: projectDir, hasUI: true });
+		await captured.events.get("session_start")?.[0]({ reason: "startup" } as never, ctx as never);
+		const warnCall = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.find((c) => c[1] === "warning");
+		expect(warnCall).toBeDefined();
+		expect(warnCall?.[0]).toContain("rpiv-pi requires 2 sibling");
+		expect(warnCall?.[0]).toContain("@juicesharp/rpiv-advisor");
+		expect(warnCall?.[0]).toContain("@juicesharp/rpiv-args");
+		expect(warnCall?.[0]).not.toContain("npm:");
+	});
+
+	it("skips notifications when !hasUI", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValueOnce({ ...emptySync, added: ["a.md"] });
+		vi.mocked(findMissingSiblings).mockReturnValueOnce([
+			{ pkg: "npm:@juicesharp/rpiv-todo", matches: /./, provides: "t" },
+		] as never);
+		const { pi, captured } = createMockPi({ exec: stubGitExec({}) as never });
+		registerSessionHooks(pi);
+		const ctx = createMockCtx({ cwd: projectDir, hasUI: false });
+		await captured.events.get("session_start")?.[0]({ reason: "startup" } as never, ctx as never);
+		expect(ctx.ui.notify).not.toHaveBeenCalled();
+	});
+});
+
+describe("session_compact hook", () => {
+	it("re-injects guidance + git-context after compaction (clears caches first)", async () => {
+		const exec = stubGitExec({ branch: "main", commit: "abc", user: "alice" });
+		const { pi, captured } = createMockPi({ exec: exec as never });
+		registerSessionHooks(pi);
+		// Prime the git-context cache first via session_start so compact's clear has work to do.
+		await captured.events.get("session_start")?.[0](
+			{ reason: "startup" } as never,
+			createMockCtx({ cwd: projectDir, hasUI: false }) as never,
+		);
+		const sendBefore = (pi.sendMessage as ReturnType<typeof vi.fn>).mock.calls.length;
+		await captured.events.get("session_compact")?.[0]({} as never, createMockCtx({ cwd: projectDir }) as never);
+		// After compact, the next pi.sendMessage call (from injectGitContext) should fire because
+		// resetInjectedMarker + clearGitContextCache make takeGitContextIfChanged re-emit.
+		const sendAfter = (pi.sendMessage as ReturnType<typeof vi.fn>).mock.calls.length;
+		expect(sendAfter).toBeGreaterThan(sendBefore);
+	});
+});
+
+describe("session_shutdown hook", () => {
+	it("clears git-context cache and allows takeGitContextIfChanged to re-emit", async () => {
+		const exec = stubGitExec({ branch: "main", commit: "abc", user: "alice" });
+		const { pi, captured } = createMockPi({ exec: exec as never });
+		registerSessionHooks(pi);
+		await takeGitContextIfChanged(pi);
+		const callsBefore = exec.mock.calls.length;
+		await captured.events.get("session_shutdown")?.[0]({} as never, createMockCtx() as never);
+		const reemit = await takeGitContextIfChanged(pi);
+		expect(reemit).not.toBeNull();
+		expect(exec.mock.calls.length).toBeGreaterThan(callsBefore);
+	});
+});
+
+describe("tool_call hook", () => {
+	it("clears git-context cache on mutating bash command", async () => {
+		const exec = stubGitExec({ branch: "main", commit: "a", user: "u" });
+		const { pi, captured } = createMockPi({ exec: exec as never });
+		registerSessionHooks(pi);
+		const handler = captured.events.get("tool_call")?.[0];
+		const ctx = createMockCtx({ cwd: projectDir });
+		await getGitContext(pi);
+		const before = exec.mock.calls.length;
+		await handler?.({ toolName: "bash", input: { command: "git commit -m x" } } as never, ctx as never);
+		await getGitContext(pi);
+		expect(exec.mock.calls.length).toBeGreaterThan(before);
+	});
+});
+
+describe("before_agent_start hook", () => {
+	it("returns {message} on changed git sig", async () => {
+		const { pi, captured } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", user: "alice" }) as never,
+		});
+		registerSessionHooks(pi);
+		const handler = captured.events.get("before_agent_start")?.[0];
+		const ctx = createMockCtx({ cwd: projectDir });
+		const r = await handler?.({} as never, ctx as never);
+		expect(r).toHaveProperty("message");
+	});
+
+	it("returns undefined on dedup (signature unchanged)", async () => {
+		const { pi, captured } = createMockPi({
+			exec: stubGitExec({ branch: "main", commit: "abc", user: "alice" }) as never,
+		});
+		registerSessionHooks(pi);
+		const handler = captured.events.get("before_agent_start")?.[0];
+		const ctx = createMockCtx({ cwd: projectDir });
+		await handler?.({} as never, ctx as never);
+		const second = await handler?.({} as never, ctx as never);
+		expect(second).toBeUndefined();
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/session-hooks.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/session-hooks.ts
@@ -0,0 +1,114 @@
+/**
+ * Session lifecycle wiring for rpiv-core.
+ *
+ * Each handler body is a named helper; pi.on(...) lines are pure wiring.
+ * Ordering and invariants preserved verbatim from the pre-refactor index.ts.
+ */
+
+import { mkdirSync } from "node:fs";
+import { join } from "node:path";
+import { type ExtensionAPI, isToolCallEventType } from "@mariozechner/pi-coding-agent";
+import { type SyncResult, syncBundledAgents } from "./agents.js";
+import { FLAG_DEBUG, MSG_TYPE_GIT_CONTEXT } from "./constants.js";
+import {
+	clearGitContextCache,
+	isGitMutatingCommand,
+	resetInjectedMarker,
+	takeGitContextIfChanged,
+} from "./git-context.js";
+import { clearInjectionState, handleToolCallGuidance, injectRootGuidance } from "./guidance.js";
+import { findMissingSiblings } from "./package-checks.js";
+
+const THOUGHTS_DIRS = [
+	"thoughts/shared/discover",
+	"thoughts/shared/research",
+	"thoughts/shared/designs",
+	"thoughts/shared/plans",
+	"thoughts/shared/handoffs",
+	"thoughts/shared/reviews",
+] as const;
+
+const msgAgentsAdded = (n: number) => `Copied ${n} rpiv-pi agent(s) to .pi/agents/`;
+const msgAgentsDrift = (parts: string[]) => `${parts.join(", ")} agent(s). Run /rpiv-update-agents to sync.`;
+const msgMissingSiblings = (n: number, list: string) =>
+	`rpiv-pi requires ${n} sibling extension(s): ${list}. Run /rpiv-setup to install them.`;
+
+type UI = { notify: (msg: string, sev: "info" | "warning" | "error") => void };
+
+export function registerSessionHooks(pi: ExtensionAPI): void {
+	pi.on("session_start", async (_event, ctx) => {
+		resetInjectionState();
+		injectRootGuidance(ctx.cwd, pi);
+		scaffoldThoughtsDirs(ctx.cwd);
+		await injectGitContext(pi, (msg) =>
+			pi.sendMessage({ customType: MSG_TYPE_GIT_CONTEXT, content: msg, display: !!pi.getFlag(FLAG_DEBUG) }),
+		);
+		const agents = syncBundledAgents(ctx.cwd, false);
+		if (ctx.hasUI) {
+			notifyAgentSyncDrift(ctx.ui, agents);
+			warnMissingSiblings(ctx.ui);
+		}
+	});
+
+	pi.on("session_compact", async (_event, ctx) => {
+		resetInjectionState();
+		clearGitContextCache();
+		resetInjectedMarker();
+		injectRootGuidance(ctx.cwd, pi);
+		await injectGitContext(pi, (msg) =>
+			pi.sendMessage({ customType: MSG_TYPE_GIT_CONTEXT, content: msg, display: !!pi.getFlag(FLAG_DEBUG) }),
+		);
+	});
+
+	pi.on("session_shutdown", async () => {
+		resetInjectionState();
+		clearGitContextCache();
+		resetInjectedMarker();
+	});
+
+	pi.on("tool_call", async (event, ctx) => {
+		handleToolCallGuidance(event, ctx, pi);
+		if (isToolCallEventType("bash", event) && isGitMutatingCommand(event.input.command)) {
+			clearGitContextCache();
+		}
+	});
+
+	pi.on("before_agent_start", async () => {
+		const content = await takeGitContextIfChanged(pi);
+		if (!content) return;
+		return { message: { customType: MSG_TYPE_GIT_CONTEXT, content, display: !!pi.getFlag(FLAG_DEBUG) } };
+	});
+}
+
+function resetInjectionState(): void {
+	clearInjectionState();
+}
+
+function scaffoldThoughtsDirs(cwd: string): void {
+	for (const dir of THOUGHTS_DIRS) {
+		mkdirSync(join(cwd, dir), { recursive: true });
+	}
+}
+
+async function injectGitContext(pi: ExtensionAPI, send: (msg: string) => void): Promise<void> {
+	const msg = await takeGitContextIfChanged(pi);
+	if (msg) send(msg);
+}
+
+function notifyAgentSyncDrift(ui: UI, result: SyncResult): void {
+	if (result.added.length > 0) {
+		ui.notify(msgAgentsAdded(result.added.length), "info");
+	}
+	const parts: string[] = [];
+	if (result.pendingUpdate.length > 0) parts.push(`${result.pendingUpdate.length} outdated`);
+	if (result.pendingRemove.length > 0) parts.push(`${result.pendingRemove.length} removed from bundle`);
+	if (parts.length > 0) {
+		ui.notify(msgAgentsDrift(parts), "info");
+	}
+}
+
+function warnMissingSiblings(ui: UI): void {
+	const missing = findMissingSiblings();
+	if (missing.length === 0) return;
+	ui.notify(msgMissingSiblings(missing.length, missing.map((m) => m.pkg.replace(/^npm:/, "")).join(", ")), "warning");
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/setup-command.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/setup-command.test.ts
@@ -0,0 +1,200 @@
+import { createMockCtx, createMockPi } from "@juicesharp/rpiv-test-utils";
+import { beforeEach, describe, expect, it, vi } from "vitest";
+
+vi.mock("./pi-installer.js", () => ({ spawnPiInstall: vi.fn() }));
+vi.mock("./package-checks.js", () => ({ findMissingSiblings: vi.fn() }));
+vi.mock("./prune-legacy-siblings.js", () => ({
+	findLegacySiblings: vi.fn(),
+	pruneLegacySiblings: vi.fn(),
+}));
+
+import { findMissingSiblings } from "./package-checks.js";
+import { spawnPiInstall } from "./pi-installer.js";
+import { findLegacySiblings, pruneLegacySiblings } from "./prune-legacy-siblings.js";
+import { registerSetupCommand } from "./setup-command.js";
+
+beforeEach(() => {
+	vi.mocked(spawnPiInstall).mockReset();
+	vi.mocked(findMissingSiblings).mockReset();
+	vi.mocked(findLegacySiblings).mockReset();
+	vi.mocked(findLegacySiblings).mockReturnValue([]);
+	vi.mocked(pruneLegacySiblings).mockReset();
+	vi.mocked(pruneLegacySiblings).mockReturnValue({ pruned: [] });
+});
+
+describe("/rpiv-setup — command shape", () => {
+	it("registers under 'rpiv-setup'", () => {
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		expect(captured.commands.has("rpiv-setup")).toBe(true);
+	});
+});
+
+describe("/rpiv-setup — !hasUI", () => {
+	it("notifies error and exits without inspecting siblings or settings", async () => {
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: false });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(ctx.ui.notify).toHaveBeenCalledWith(expect.stringContaining("interactive"), "error");
+		expect(findMissingSiblings).not.toHaveBeenCalled();
+		expect(findLegacySiblings).not.toHaveBeenCalled();
+		expect(pruneLegacySiblings).not.toHaveBeenCalled();
+		expect(spawnPiInstall).not.toHaveBeenCalled();
+	});
+});
+
+describe("/rpiv-setup — nothing to do", () => {
+	it("notifies all-installed and skips confirmation when no missing siblings AND no legacy entries", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([]);
+		vi.mocked(findLegacySiblings).mockReturnValue([]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(ctx.ui.notify).toHaveBeenCalledWith(expect.stringContaining("already installed"), "info");
+		expect(ctx.ui.confirm).not.toHaveBeenCalled();
+		expect(pruneLegacySiblings).not.toHaveBeenCalled();
+	});
+});
+
+describe("/rpiv-setup — pre-confirm read-only contract", () => {
+	it("does NOT call pruneLegacySiblings before user confirmation", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([]);
+		vi.mocked(findLegacySiblings).mockReturnValue(["npm:pi-subagents"]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		(ctx.ui.confirm as ReturnType<typeof vi.fn>).mockImplementation(async () => {
+			expect(pruneLegacySiblings).not.toHaveBeenCalled();
+			return false;
+		});
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(ctx.ui.confirm).toHaveBeenCalledTimes(1);
+	});
+
+	it("includes legacy entries in the confirmation body so the user sees what will be removed", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([]);
+		vi.mocked(findLegacySiblings).mockReturnValue(["npm:pi-subagents"]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		const confirmCall = (ctx.ui.confirm as ReturnType<typeof vi.fn>).mock.calls[0]!;
+		expect(confirmCall[1]).toContain("Remove from");
+		expect(confirmCall[1]).toContain("npm:pi-subagents");
+	});
+
+	it("includes pending installs in the confirmation body", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([{ pkg: "npm:@x/a", matches: /./, provides: "A" }]);
+		vi.mocked(findLegacySiblings).mockReturnValue([]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		const confirmCall = (ctx.ui.confirm as ReturnType<typeof vi.fn>).mock.calls[0]!;
+		expect(confirmCall[1]).toContain("Install via `pi install`:");
+		expect(confirmCall[1]).toContain("npm:@x/a");
+	});
+});
+
+describe("/rpiv-setup — user cancels", () => {
+	it("notifies cancelled and skips both prune and install", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([{ pkg: "npm:@x/y", matches: /./, provides: "p" }]);
+		vi.mocked(findLegacySiblings).mockReturnValue(["npm:pi-subagents"]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		(ctx.ui.confirm as ReturnType<typeof vi.fn>).mockResolvedValueOnce(false);
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(ctx.ui.notify).toHaveBeenCalledWith(expect.stringContaining("cancelled"), "info");
+		expect(pruneLegacySiblings).not.toHaveBeenCalled();
+		expect(spawnPiInstall).not.toHaveBeenCalled();
+	});
+});
+
+describe("/rpiv-setup — post-confirm prune execution", () => {
+	it("runs pruneLegacySiblings after confirm and emits notify when entries removed", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([]);
+		vi.mocked(findLegacySiblings).mockReturnValue(["npm:pi-subagents"]);
+		vi.mocked(pruneLegacySiblings).mockReturnValue({ pruned: ["npm:pi-subagents"] });
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(pruneLegacySiblings).toHaveBeenCalledTimes(1);
+		const pruneNotify = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.find(
+			(c) => typeof c[0] === "string" && c[0].startsWith("Removed legacy subagent library"),
+		);
+		expect(pruneNotify).toBeDefined();
+		expect(pruneNotify?.[0]).toContain("npm:pi-subagents");
+	});
+
+	it("skips pruneLegacySiblings when no legacy entries were detected pre-confirm", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([{ pkg: "npm:@x/y", matches: /./, provides: "p" }]);
+		vi.mocked(findLegacySiblings).mockReturnValue([]);
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(pruneLegacySiblings).not.toHaveBeenCalled();
+	});
+});
+
+describe("/rpiv-setup — mixed success/failure report", () => {
+	it("reports succeeded + failed with 300-char stderr snippets", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([
+			{ pkg: "npm:@x/a", matches: /./, provides: "A" },
+			{ pkg: "npm:@x/b", matches: /./, provides: "B" },
+		]);
+		vi.mocked(spawnPiInstall)
+			.mockResolvedValueOnce({ code: 0, stdout: "ok", stderr: "" })
+			.mockResolvedValueOnce({ code: 1, stdout: "", stderr: "x".repeat(500) });
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		const reportCall = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.at(-1);
+		const report: string = reportCall![0];
+		expect(report).toContain("npm:@x/a");
+		expect(report).toContain("npm:@x/b");
+		expect((report.match(/x+/g) ?? []).every((m) => m.length <= 300)).toBe(true);
+		expect(reportCall![1]).toBe("warning");
+	});
+
+	it("uses stdout fallback when stderr empty", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([{ pkg: "npm:@x/a", matches: /./, provides: "A" }]);
+		vi.mocked(spawnPiInstall).mockResolvedValueOnce({ code: 1, stdout: "stdout-error", stderr: "" });
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		const report = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.at(-1)![0];
+		expect(report).toContain("stdout-error");
+	});
+
+	it("all-failed report omits Restart line", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([{ pkg: "npm:@x/a", matches: /./, provides: "A" }]);
+		vi.mocked(spawnPiInstall).mockResolvedValueOnce({ code: 1, stdout: "", stderr: "err" });
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		const report = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls.at(-1)![0];
+		expect(report).not.toContain("Restart");
+	});
+});
+
+describe("/rpiv-setup — prune-only flow (no missing siblings)", () => {
+	it("skips installMissing when only legacy entries exist", async () => {
+		vi.mocked(findMissingSiblings).mockReturnValue([]);
+		vi.mocked(findLegacySiblings).mockReturnValue(["npm:pi-subagents"]);
+		vi.mocked(pruneLegacySiblings).mockReturnValue({ pruned: ["npm:pi-subagents"] });
+		const { pi, captured } = createMockPi();
+		registerSetupCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-setup")?.handler("", ctx as never);
+		expect(pruneLegacySiblings).toHaveBeenCalledTimes(1);
+		expect(spawnPiInstall).not.toHaveBeenCalled();
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/setup-command.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/setup-command.ts
@@ -0,0 +1,128 @@
+/**
+ * /rpiv-setup — installs any SIBLINGS not present in ~/.pi/agent/settings.json
+ * and prunes deprecated entries (e.g. the unscoped `npm:pi-subagents` from
+ * the rpiv-pi 0.12.x → 0.14.0 line). Both mutations are previewed in the
+ * confirmation dialog and only executed after the user agrees.
+ *
+ * Serial `pi install <pkg>` loop via spawnPiInstall (Windows-safe).
+ * Reports succeeded/failed split; prompts the user to restart Pi on success.
+ */
+
+import type { ExtensionAPI } from "@mariozechner/pi-coding-agent";
+import { findMissingSiblings } from "./package-checks.js";
+import { spawnPiInstall } from "./pi-installer.js";
+import { findLegacySiblings, pruneLegacySiblings } from "./prune-legacy-siblings.js";
+import type { SiblingPlugin } from "./siblings.js";
+
+const INSTALL_TIMEOUT_MS = 120_000;
+const STDERR_SNIPPET_CHARS = 300;
+
+const MSG_INTERACTIVE_ONLY = "/rpiv-setup requires interactive mode";
+const MSG_NOTHING_TO_DO = "All rpiv-pi sibling dependencies already installed.";
+const MSG_CANCELLED = "/rpiv-setup cancelled";
+const MSG_CONFIRM_TITLE = "Apply rpiv-pi setup changes?";
+const MSG_RESTART = "Restart your Pi session to load the newly-installed extensions.";
+
+const msgInstalling = (pkg: string) => `Installing ${pkg}…`;
+const msgInstalledLine = (pkgs: string[]) => `✓ Installed: ${pkgs.join(", ")}`;
+const msgFailedHeader = () => `✗ Failed:`;
+const msgFailedLine = (pkg: string, err: string) => `  ${pkg}: ${err}`;
+const msgLegacyPruned = (entries: string[]) =>
+	`Removed legacy subagent library from settings.json: ${entries.join(", ")}. Run \`pi uninstall\` to free disk space, then restart Pi.`;
+
+type UI = {
+	notify: (msg: string, sev: "info" | "warning" | "error") => void;
+	confirm: (title: string, body: string) => Promise<boolean>;
+};
+
+function buildConfirmBody(missing: SiblingPlugin[], legacyEntries: string[]): string {
+	const lines: string[] = ["rpiv-pi will apply the following changes:", ""];
+	if (missing.length > 0) {
+		lines.push("Install via `pi install`:");
+		for (const m of missing) lines.push(`  • ${m.pkg}  (required — provides ${m.provides})`);
+		lines.push("");
+	}
+	if (legacyEntries.length > 0) {
+		lines.push("Remove from `~/.pi/agent/settings.json` (deprecated):");
+		for (const entry of legacyEntries) lines.push(`  • ${entry}`);
+		lines.push("");
+	}
+	lines.push("Your `~/.pi/agent/settings.json` will be updated. Proceed?");
+	return lines.join("\n");
+}
+
+export function registerSetupCommand(pi: ExtensionAPI): void {
+	pi.registerCommand("rpiv-setup", {
+		description: "Install rpiv-pi's sibling extension plugins",
+		handler: async (_args, ctx) => {
+			if (!ctx.hasUI) {
+				ctx.ui.notify(MSG_INTERACTIVE_ONLY, "error");
+				return;
+			}
+
+			const missing = findMissingSiblings();
+			const legacyEntries = findLegacySiblings();
+			if (missing.length === 0 && legacyEntries.length === 0) {
+				ctx.ui.notify(MSG_NOTHING_TO_DO, "info");
+				return;
+			}
+
+			const confirmed = await ctx.ui.confirm(MSG_CONFIRM_TITLE, buildConfirmBody(missing, legacyEntries));
+			if (!confirmed) {
+				ctx.ui.notify(MSG_CANCELLED, "info");
+				return;
+			}
+
+			if (legacyEntries.length > 0) {
+				const prune = pruneLegacySiblings();
+				if (prune.pruned.length > 0) {
+					ctx.ui.notify(msgLegacyPruned(prune.pruned), "info");
+				}
+			}
+
+			if (missing.length === 0) return;
+
+			const { succeeded, failed } = await installMissing(ctx.ui, missing);
+			ctx.ui.notify(buildReport(succeeded, failed), failed.length > 0 ? "warning" : "info");
+		},
+	});
+}
+
+async function installMissing(
+	ui: UI,
+	missing: SiblingPlugin[],
+): Promise<{ succeeded: string[]; failed: Array<{ pkg: string; error: string }> }> {
+	const succeeded: string[] = [];
+	const failed: Array<{ pkg: string; error: string }> = [];
+	for (const { pkg } of missing) {
+		ui.notify(msgInstalling(pkg), "info");
+		try {
+			const result = await spawnPiInstall(pkg, INSTALL_TIMEOUT_MS);
+			if (result.code === 0) {
+				succeeded.push(pkg);
+			} else {
+				failed.push({
+					pkg,
+					error: (result.stderr || result.stdout || `exit ${result.code}`).trim().slice(0, STDERR_SNIPPET_CHARS),
+				});
+			}
+		} catch (err) {
+			failed.push({ pkg, error: err instanceof Error ? err.message : String(err) });
+		}
+	}
+	return { succeeded, failed };
+}
+
+function buildReport(succeeded: string[], failed: Array<{ pkg: string; error: string }>): string {
+	const lines: string[] = [];
+	if (succeeded.length > 0) lines.push(msgInstalledLine(succeeded));
+	if (failed.length > 0) {
+		lines.push(msgFailedHeader());
+		for (const { pkg, error } of failed) lines.push(msgFailedLine(pkg, error));
+	}
+	if (succeeded.length > 0) {
+		lines.push("");
+		lines.push(MSG_RESTART);
+	}
+	return lines.join("\n");
+}
--- a/extensions/rpiv-pi/extensions/rpiv-core/siblings.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/siblings.test.ts
@@ -0,0 +1,64 @@
+import { describe, expect, it } from "vitest";
+import { LEGACY_SIBLINGS, SIBLINGS } from "./siblings.js";
+
+describe("SIBLINGS registry", () => {
+	it("contains 8 entries (pi-subagents at SIBLINGS[0] — tintinweb fork is the dispatch runtime)", () => {
+		expect(SIBLINGS).toHaveLength(8);
+	});
+
+	it("lists @tintinweb/pi-subagents at SIBLINGS[0]", () => {
+		expect(SIBLINGS[0]?.pkg).toBe("npm:@tintinweb/pi-subagents");
+	});
+
+	it("does NOT list nicobailon's unscoped pi-subagents (superseded in 0.14.0)", () => {
+		expect(SIBLINGS.find((s) => s.pkg === "npm:pi-subagents")).toBeUndefined();
+	});
+
+	for (const s of SIBLINGS) {
+		it(`${s.pkg} — self-match against settings.json line shape`, () => {
+			expect(s.matches.test(s.pkg.replace(/^npm:/, ""))).toBe(true);
+		});
+		it(`${s.pkg} — case-insensitive match`, () => {
+			expect(s.matches.test(s.pkg.toUpperCase().replace(/^NPM:/, ""))).toBe(true);
+		});
+	}
+
+	it("rpiv-args does NOT match rpiv-args-extended (word boundary)", () => {
+		const argsEntry = SIBLINGS.find((s) => s.pkg.endsWith("/rpiv-args"));
+		expect(argsEntry).toBeDefined();
+		expect(argsEntry?.matches.test("@juicesharp/rpiv-args-extended")).toBe(false);
+	});
+
+	it("rpiv-i18n does NOT match rpiv-i18n-utils (word boundary)", () => {
+		const i18nEntry = SIBLINGS.find((s) => s.pkg.endsWith("/rpiv-i18n"));
+		expect(i18nEntry).toBeDefined();
+		expect(i18nEntry?.matches.test("@juicesharp/rpiv-i18n-utils")).toBe(false);
+		expect(i18nEntry?.matches.test("@juicesharp/rpiv-i18n")).toBe(true);
+	});
+
+	it("every entry has non-empty pkg + provides", () => {
+		for (const s of SIBLINGS) {
+			expect(s.pkg.length).toBeGreaterThan(0);
+			expect(s.provides.length).toBeGreaterThan(0);
+		}
+	});
+});
+
+describe("LEGACY_SIBLINGS registry", () => {
+	it("lists nicobailon's pi-subagents for pruning (superseded by @tintinweb/pi-subagents in 0.14.0)", () => {
+		const entry = LEGACY_SIBLINGS.find((l) => l.label === "pi-subagents");
+		expect(entry).toBeDefined();
+		expect(entry?.matches.test("npm:pi-subagents")).toBe(true);
+		expect(entry?.matches.test("pi-subagents")).toBe(true);
+	});
+
+	it("pi-subagents legacy match does NOT catch @tintinweb/pi-subagents (active sibling)", () => {
+		const piSubagents = LEGACY_SIBLINGS.find((l) => l.label === "pi-subagents");
+		expect(piSubagents?.matches.test("@tintinweb/pi-subagents")).toBe(false);
+	});
+
+	it("pi-subagents legacy match does NOT catch pi-subagents-legacy (word boundary)", () => {
+		const piSubagents = LEGACY_SIBLINGS.find((l) => l.label === "pi-subagents");
+		expect(piSubagents?.matches.test("pi-subagents-legacy")).toBe(false);
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/siblings.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/siblings.ts
@@ -0,0 +1,90 @@
+/**
+ * Declarative registry of rpiv-pi's sibling Pi plugins.
+ *
+ * Single source of truth for: presence detection (package-checks.ts),
+ * session_start "missing plugins" warning (session-hooks.ts), and
+ * /rpiv-setup installer (setup-command.ts). Add a sibling here and every
+ * consumer picks it up automatically.
+ *
+ * Detection is filesystem-based via a regex over ~/.pi/agent/settings.json
+ * — no runtime import of sibling packages (keeps rpiv-core pure-orchestrator).
+ */
+
+export interface SiblingPlugin {
+	/** Install spec passed to `pi install`. Prefixed with `npm:` for Pi's installer. */
+	readonly pkg: string;
+	/** Case-insensitive regex that matches the package in ~/.pi/agent/settings.json. */
+	readonly matches: RegExp;
+	/** What the sibling provides — shown in /rpiv-setup confirmation and reports. */
+	readonly provides: string;
+}
+
+export const SIBLINGS: readonly SiblingPlugin[] = [
+	{
+		pkg: "npm:@tintinweb/pi-subagents",
+		matches: /@tintinweb\/pi-subagents/i,
+		provides: "Agent / get_subagent_result / steer_subagent tools",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-ask-user-question",
+		matches: /rpiv-ask-user-question/i,
+		provides: "ask_user_question tool",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-todo",
+		matches: /rpiv-todo/i,
+		provides: "todo tool + /todos command + overlay widget",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-advisor",
+		matches: /rpiv-advisor/i,
+		provides: "advisor tool + /advisor command",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-btw",
+		matches: /rpiv-btw/i,
+		provides: "/btw side-question command",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-i18n",
+		matches: /rpiv-i18n(?![-\w])/i,
+		provides: "i18n SDK for Pi extensions — /languages command + --locale flag + registerStrings/scope/tr API",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-web-tools",
+		matches: /rpiv-web-tools/i,
+		provides: "web_search + web_fetch tools + /web-search-config",
+	},
+	{
+		pkg: "npm:@juicesharp/rpiv-args",
+		matches: /rpiv-args(?![-\w])/i,
+		provides: "skill-argument resolver — substitutes $N/$ARGUMENTS in skill bodies",
+	},
+];
+
+/**
+ * Deprecated sibling packages that `/rpiv-setup` actively prunes from
+ * ~/.pi/agent/settings.json (so upgraders don't end up with superseded
+ * libraries loaded alongside their replacements). Single source of truth
+ * for `prune-legacy-siblings.ts`.
+ */
+export interface LegacyPackage {
+	/** Human-readable label used in the prune notify message. */
+	readonly label: string;
+	/** Case-insensitive regex matched against settings.json `packages[]` entries. */
+	readonly matches: RegExp;
+	/** Short reason — useful when debugging; not user-facing. */
+	readonly reason: string;
+}
+
+export const LEGACY_SIBLINGS: readonly LegacyPackage[] = [
+	{
+		// nicobailon's pi-subagents fork was the SIBLINGS[0] package between
+		// rpiv-pi 0.12.0 and 0.13.x. Reverted to @tintinweb/pi-subagents in
+		// rpiv-pi 1.0.0 once tintinweb resumed active maintenance and shipped
+		// 0.6.x against pi-coding-agent ^0.70.5.
+		label: "pi-subagents",
+		matches: /(^|[^\w/-])pi-subagents(?![-\w])/i,
+		reason: "superseded by @tintinweb/pi-subagents (resumed maintenance) in rpiv-pi 1.0.0",
+	},
+];
--- a/extensions/rpiv-pi/extensions/rpiv-core/update-agents-command.test.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/update-agents-command.test.ts
@@ -0,0 +1,74 @@
+import { createMockCtx, createMockPi } from "@juicesharp/rpiv-test-utils";
+import { beforeEach, describe, expect, it, vi } from "vitest";
+
+vi.mock("./agents.js", () => ({
+	syncBundledAgents: vi.fn(),
+}));
+
+import { syncBundledAgents } from "./agents.js";
+import { registerUpdateAgentsCommand } from "./update-agents-command.js";
+
+beforeEach(() => {
+	vi.mocked(syncBundledAgents).mockReset();
+});
+
+const empty = (overrides: Partial<ReturnType<typeof syncBundledAgents>> = {}) => ({
+	added: [],
+	updated: [],
+	unchanged: [],
+	removed: [],
+	pendingUpdate: [],
+	pendingRemove: [],
+	errors: [],
+	...overrides,
+});
+
+describe("/rpiv-update-agents", () => {
+	it("registers the command", () => {
+		const { pi, captured } = createMockPi();
+		registerUpdateAgentsCommand(pi);
+		expect(captured.commands.has("rpiv-update-agents")).toBe(true);
+	});
+
+	it("UP_TO_DATE when no changes, no errors", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValue(empty());
+		const { pi, captured } = createMockPi();
+		registerUpdateAgentsCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-update-agents")?.handler("", ctx as never);
+		expect(ctx.ui.notify).toHaveBeenCalledWith(expect.stringContaining("up-to-date"), "info");
+	});
+
+	it("synced report when added+updated+removed > 0", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValue(empty({ added: ["a.md"], updated: ["b.md"], removed: ["c.md"] }));
+		const { pi, captured } = createMockPi();
+		registerUpdateAgentsCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-update-agents")?.handler("", ctx as never);
+		const report = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls[0][0];
+		expect(report).toContain("1 added");
+		expect(report).toContain("1 updated");
+		expect(report).toContain("1 removed");
+	});
+
+	it("errors-only report uses 'warning' severity", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValue(
+			empty({ errors: [{ op: "copy", message: "EACCES", file: "a.md" }] }),
+		);
+		const { pi, captured } = createMockPi();
+		registerUpdateAgentsCommand(pi);
+		const ctx = createMockCtx({ hasUI: true });
+		await captured.commands.get("rpiv-update-agents")?.handler("", ctx as never);
+		const [, severity] = (ctx.ui.notify as ReturnType<typeof vi.fn>).mock.calls[0];
+		expect(severity).toBe("warning");
+	});
+
+	it("stays silent when !hasUI", async () => {
+		vi.mocked(syncBundledAgents).mockReturnValue(empty({ added: ["x.md"] }));
+		const { pi, captured } = createMockPi();
+		registerUpdateAgentsCommand(pi);
+		const ctx = createMockCtx({ hasUI: false });
+		await captured.commands.get("rpiv-update-agents")?.handler("", ctx as never);
+		expect(ctx.ui.notify).not.toHaveBeenCalled();
+	});
+});
--- a/extensions/rpiv-pi/extensions/rpiv-core/update-agents-command.ts
+++ b/extensions/rpiv-pi/extensions/rpiv-core/update-agents-command.ts
@@ -0,0 +1,44 @@
+/**
+ * /rpiv-update-agents — apply-mode sync of bundled agents into <cwd>/.pi/agents/.
+ * Adds new, overwrites changed managed files, removes stale managed files.
+ */
+
+import type { ExtensionAPI } from "@mariozechner/pi-coding-agent";
+import { type SyncResult, syncBundledAgents } from "./agents.js";
+
+const MSG_UP_TO_DATE = "All agents already up-to-date.";
+const MSG_NO_CHANGES = "No changes needed.";
+
+const msgSynced = (parts: string[]) => `Synced agents: ${parts.join(", ")}.`;
+const msgSyncedWithErrors = (summary: string, errors: string[]) =>
+	`${summary} ${errors.length} error(s): ${errors.join("; ")}`;
+
+export function registerUpdateAgentsCommand(pi: ExtensionAPI): void {
+	pi.registerCommand("rpiv-update-agents", {
+		description: "Sync rpiv-pi bundled agents into .pi/agents/: add new, update changed, remove stale",
+		handler: async (_args, ctx) => {
+			const result = syncBundledAgents(ctx.cwd, true);
+			if (!ctx.hasUI) return;
+			ctx.ui.notify(formatSyncReport(result), result.errors.length > 0 ? "warning" : "info");
+		},
+	});
+}
+
+function formatSyncReport(result: SyncResult): string {
+	const totalSynced = result.added.length + result.updated.length + result.removed.length;
+	if (totalSynced === 0 && result.errors.length === 0) return MSG_UP_TO_DATE;
+
+	const parts: string[] = [];
+	if (result.added.length > 0) parts.push(`${result.added.length} added`);
+	if (result.updated.length > 0) parts.push(`${result.updated.length} updated`);
+	if (result.removed.length > 0) parts.push(`${result.removed.length} removed`);
+
+	const summary = parts.length > 0 ? msgSynced(parts) : MSG_NO_CHANGES;
+	if (result.errors.length > 0) {
+		return msgSyncedWithErrors(
+			summary,
+			result.errors.map((e) => e.message),
+		);
+	}
+	return summary;
+}
--- a/extensions/rpiv-pi/package.json
+++ b/extensions/rpiv-pi/package.json
@@ -0,0 +1,58 @@
+{
+  "name": "@juicesharp/rpiv-pi",
+  "version": "1.2.1",
+  "description": "A skill-based development workflow for Pi Agent. Five skills (research, design, plan, implement, validate) and the shared subagents that compose its ship-loop.",
+  "keywords": [
+    "pi-package",
+    "pi-extension",
+    "rpiv",
+    "skills",
+    "workflow"
+  ],
+  "license": "MIT",
+  "author": "juicesharp",
+  "type": "module",
+  "repository": {
+    "type": "git",
+    "url": "git+https://github.com/juicesharp/rpiv-mono.git",
+    "directory": "packages/rpiv-pi"
+  },
+  "homepage": "https://github.com/juicesharp/rpiv-mono/tree/main/packages/rpiv-pi#readme",
+  "bugs": {
+    "url": "https://github.com/juicesharp/rpiv-mono/issues"
+  },
+  "publishConfig": {
+    "access": "public"
+  },
+  "scripts": {
+    "test": "vitest run"
+  },
+  "files": [
+    "extensions/",
+    "skills/",
+    "agents/",
+    "scripts/",
+    "README.md",
+    "LICENSE"
+  ],
+  "pi": {
+    "extensions": [
+      "./extensions"
+    ],
+    "skills": [
+      "./skills"
+    ]
+  },
+  "peerDependencies": {
+    "@mariozechner/pi-coding-agent": "*",
+    "@tintinweb/pi-subagents": "*",
+    "@juicesharp/rpiv-ask-user-question": "*",
+    "@juicesharp/rpiv-todo": "*",
+    "@juicesharp/rpiv-advisor": "*",
+    "@juicesharp/rpiv-btw": "*",
+    "@juicesharp/rpiv-i18n": "*",
+    "@juicesharp/rpiv-web-tools": "*",
+    "@juicesharp/rpiv-args": "*",
+    "yaml": "*"
+  }
+}
--- a/extensions/rpiv-pi/scripts/migrate.js
+++ b/extensions/rpiv-pi/scripts/migrate.js
@@ -0,0 +1,245 @@
+import { execSync } from "child_process";
+import { existsSync, mkdirSync, readdirSync, readFileSync, statSync, unlinkSync, writeFileSync } from "fs";
+import { dirname, join, relative, sep } from "path";
+
+// --- CLI Argument Parsing ---
+function parseArgs(argv) {
+	let projectDir = process.cwd();
+	let deleteOriginals = false;
+	let dryRun = false;
+	let force = false;
+	for (let i = 2; i < argv.length; i++) {
+		if (argv[i] === "--project-dir" && argv[i + 1]) {
+			projectDir = argv[++i];
+		} else if (argv[i] === "--delete-originals") {
+			deleteOriginals = true;
+		} else if (argv[i] === "--dry-run") {
+			dryRun = true;
+		} else if (argv[i] === "--force") {
+			force = true;
+		}
+	}
+	return { projectDir, deleteOriginals, dryRun, force };
+}
+// --- Discovery ---
+const HARDCODED_EXCLUDES = new Set([
+	"node_modules",
+	"dist",
+	"build",
+	".git",
+	"vendor",
+	".rpiv",
+	".next",
+	".nuxt",
+	".output",
+	"coverage",
+	"__pycache__",
+	".venv",
+]);
+function discoverClaudeMdFiles(projectDir) {
+	const gitDir = join(projectDir, ".git");
+	if (existsSync(gitDir)) {
+		return discoverViaGit(projectDir);
+	}
+	return discoverViaWalk(projectDir);
+}
+function discoverViaGit(projectDir) {
+	try {
+		const output = execSync("git ls-files --cached --others --exclude-standard", {
+			cwd: projectDir,
+			encoding: "utf-8",
+			maxBuffer: 10 * 1024 * 1024,
+		});
+		return output
+			.split("\n")
+			.filter((f) => f.endsWith("/CLAUDE.md") || f === "CLAUDE.md")
+			.filter((f) => !f.startsWith(".rpiv/"));
+	} catch {
+		// git command failed — fall back to walk
+		return discoverViaWalk(projectDir);
+	}
+}
+function discoverViaWalk(projectDir) {
+	const results = [];
+	function walk(dir) {
+		let entries;
+		try {
+			entries = readdirSync(dir);
+		} catch {
+			return; // permission error, skip
+		}
+		for (const entry of entries) {
+			if (HARDCODED_EXCLUDES.has(entry)) continue;
+			const fullPath = join(dir, entry);
+			let stat;
+			try {
+				stat = statSync(fullPath);
+			} catch {
+				continue;
+			}
+			if (stat.isDirectory()) {
+				walk(fullPath);
+			} else if (entry === "CLAUDE.md") {
+				const rel = relative(projectDir, fullPath).split(sep).join("/");
+				if (!rel.startsWith(".rpiv/")) {
+					results.push(rel);
+				}
+			}
+		}
+	}
+	walk(projectDir);
+	return results;
+}
+// --- Path Mapping ---
+function computeTargetPath(claudeMdRelative) {
+	const dir = dirname(claudeMdRelative);
+	if (dir === ".") {
+		return ".rpiv/guidance/architecture.md";
+	}
+	return join(".rpiv", "guidance", dir, "architecture.md").split(sep).join("/");
+}
+function transformContent(content, targetPath) {
+	let refsTransformed = 0;
+	const warnings = [];
+	// Pattern 1: Backtick-wrapped path references like `src/core/CLAUDE.md`
+	let transformed = content.replace(/`((?:[\w][\w./-]*\/)?CLAUDE\.md)`/g, (_match, claudePath) => {
+		const replacement = claudePathToGuidancePath(claudePath);
+		refsTransformed++;
+		return `\`${replacement}\``;
+	});
+	// Pattern 2: Bare path references (with directory prefix) not inside backticks
+	// Match things like "src/core/CLAUDE.md" but not already-backtick-wrapped
+	transformed = transformed.replace(/(?<!`)([\w][\w./-]*\/CLAUDE\.md)(?!`)/g, (_match, claudePath) => {
+		const replacement = claudePathToGuidancePath(claudePath);
+		refsTransformed++;
+		return replacement;
+	});
+	// Pattern 3: Standalone "CLAUDE.md" that references the root file
+	// Only match when it looks like a file reference (not part of a longer word)
+	// Avoid matching inside paths already transformed above
+	transformed = transformed.replace(/(?<![/\w`])CLAUDE\.md(?![/\w`])/g, () => {
+		refsTransformed++;
+		return ".rpiv/guidance/architecture.md";
+	});
+	// Scan for remaining prose references that might need manual attention
+	const lines = transformed.split("\n");
+	for (let i = 0; i < lines.length; i++) {
+		// Look for prose patterns like "see X CLAUDE.md" or "X layer CLAUDE.md"
+		if (
+			/\b\w+\s+CLAUDE\.md\b/i.test(content.split("\n")[i] ?? "") &&
+			!/(src|lib|app|packages|apps)\//.test(content.split("\n")[i] ?? "")
+		) {
+			// Check if this line still has an untransformed prose reference
+			if (/CLAUDE\.md/i.test(lines[i])) {
+				warnings.push({
+					file: targetPath,
+					line: i + 1,
+					message: `Prose reference to CLAUDE.md may need manual update: "${lines[i].trim()}"`,
+				});
+			}
+		}
+	}
+	return { content: transformed, refsTransformed, warnings };
+}
+function claudePathToGuidancePath(claudePath) {
+	const dir = dirname(claudePath);
+	if (dir === ".") {
+		return ".rpiv/guidance/architecture.md";
+	}
+	return `.rpiv/guidance/${dir}/architecture.md`;
+}
+// --- Main ---
+function main() {
+	const { projectDir, deleteOriginals, dryRun, force } = parseArgs(process.argv);
+	process.stderr.write(`[rpiv:migrate] scanning ${projectDir} for CLAUDE.md files\n`);
+	const claudeFiles = discoverClaudeMdFiles(projectDir);
+	if (claudeFiles.length === 0) {
+		const report = {
+			migrated: [],
+			conflicts: [],
+			warnings: [],
+			originalsDeleted: false,
+			dryRun,
+		};
+		process.stdout.write(JSON.stringify(report, null, 2));
+		return;
+	}
+	process.stderr.write(`[rpiv:migrate] found ${claudeFiles.length} CLAUDE.md file(s)\n`);
+	const migrated = [];
+	const conflicts = [];
+	const allWarnings = [];
+	const writtenFiles = [];
+	for (const source of claudeFiles) {
+		const target = computeTargetPath(source);
+		const targetAbs = join(projectDir, target);
+		// Check for conflicts
+		if (existsSync(targetAbs) && !force) {
+			conflicts.push(target);
+			continue;
+		}
+		// Read source content
+		const sourceAbs = join(projectDir, source);
+		let content;
+		try {
+			content = readFileSync(sourceAbs, "utf-8");
+		} catch (err) {
+			allWarnings.push({
+				file: source,
+				line: 0,
+				message: `Failed to read: ${err instanceof Error ? err.message : String(err)}`,
+			});
+			continue;
+		}
+		if (content.trim().length === 0) {
+			allWarnings.push({
+				file: source,
+				line: 0,
+				message: "Empty file, skipped",
+			});
+			continue;
+		}
+		// Transform content
+		const { content: transformed, refsTransformed, warnings } = transformContent(content, target);
+		const lines = transformed.split("\n").length;
+		migrated.push({ source, target, lines, refsTransformed });
+		allWarnings.push(...warnings);
+		if (!dryRun) {
+			writtenFiles.push({ targetAbs, content: transformed });
+		}
+	}
+	// Write all files (all-or-nothing approach for safety)
+	if (!dryRun) {
+		for (const { targetAbs, content } of writtenFiles) {
+			mkdirSync(dirname(targetAbs), { recursive: true });
+			writeFileSync(targetAbs, content, "utf-8");
+		}
+		process.stderr.write(`[rpiv:migrate] wrote ${writtenFiles.length} file(s)\n`);
+	}
+	// Delete originals only after all writes succeed
+	let originalsDeleted = false;
+	if (!dryRun && deleteOriginals && writtenFiles.length > 0) {
+		for (const entry of migrated) {
+			const sourceAbs = join(projectDir, entry.source);
+			try {
+				unlinkSync(sourceAbs);
+			} catch (err) {
+				allWarnings.push({
+					file: entry.source,
+					line: 0,
+					message: `Failed to delete original: ${err instanceof Error ? err.message : String(err)}`,
+				});
+			}
+		}
+		originalsDeleted = true;
+		process.stderr.write(`[rpiv:migrate] deleted ${migrated.length} original CLAUDE.md file(s)\n`);
+	}
+	const report = {
+		migrated,
+		conflicts,
+		warnings: allWarnings,
+		originalsDeleted,
+		dryRun,
+	};
+	process.stdout.write(JSON.stringify(report, null, 2));
+}
+main();
--- a/extensions/rpiv-pi/skills/annotate-guidance/SKILL.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/SKILL.md
@@ -0,0 +1,304 @@
+---
+name: annotate-guidance
+description: Generate architecture.md guidance files under .rpiv/guidance/ that document a project's architecture and patterns for AI assistants, written to a shadow tree alongside the source. Use when the user wants to onboard Claude, Cursor, or an AI agent to a codebase via the guidance system, document architecture, or asks to "annotate guidance". Prefer this over annotate-inline when the project uses the .rpiv/guidance/ shadow tree instead of inline CLAUDE.md files.
+argument-hint: [target-directory]
+allowed-tools: Agent, Read, Write, Glob, Grep
+---
+
+# Annotate Project
+
+You are tasked with generating architecture guidance files for a brownfield project. You will map the project structure, auto-detect its architecture, analyze each architectural layer, and batch-write compact architecture.md files under `.rpiv/guidance/` mirroring the project's directory structure.
+
+## Initial Setup:
+
+Use the current working directory as the target project by default. If the user provides a specific directory path as an argument, use that instead.
+
+## Steps to follow:
+
+1. **Read any directly mentioned files first:**
+   - If the user mentions specific files (existing architecture.md, CLAUDE.md, architecture docs, READMEs), read them FULLY first
+   - **IMPORTANT**: Use the Read tool WITHOUT limit/offset parameters to read entire files
+   - **CRITICAL**: Read these files yourself in the main context before invoking any skills
+   - This ensures you have full context before decomposing the work
+
+2. **Pass 1 — Map the project (parallel agents):**
+   - Spawn the following agents in parallel using the Agent tool:
+
+   **Agent A — Project tree mapping:**
+   - subagent_type: `codebase-locator`
+   - Prompt: "Map the full project tree structure for {target directory}. List all directories and their contents, respecting .gitignore. Focus on source code directories, configuration files, and build artifacts. Return a complete tree view."
+
+   **Agent B — Architecture and conventions:**
+   - subagent_type: `codebase-locator`
+   - Prompt: "Identify the architectural layout of {target directory} from path shape and manifest files — NO content analysis. Detect: (1) Architecture pattern inferred from folder shape — clean-arch via Domain/Application/Infrastructure dirs; MVC via Controllers/Models/Views; monorepo via packages/* + workspaces; microservices via services/* with individual manifests; hexagonal via ports/adapters. (2) Main layers/modules — top-level source directories + their names. (3) Frameworks and languages from manifest files (package.json dependencies, *.csproj TargetFramework, pyproject.toml, go.mod, Cargo.toml) and file extensions. (4) Build system from build-config filenames (vite/webpack/tsup/esbuild configs, Makefile, nx.json, turbo.json, dotnet .sln). For each main layer/module, check sub-directory composition. If sub-directories with distinct names/roles exist, flag each as a guidance target candidate with: (a) path, (b) role inferred from folder name (controllers/, services/, entities/, components/, stores/, etc.), (c) file count via ls, (d) how its sub-directory composition differs from sibling layers. Use grep/find/ls only. Do not read file contents. Pass 2 runs codebase-analyzer + codebase-pattern-finder per target folder for deep analysis."
+
+   - While agents run, read .gitignore yourself to understand exclusion rules
+
+3. **Wait for Pass 1 and determine guidance targets:**
+   - IMPORTANT: Wait for ALL agents from Pass 1 to complete before proceeding
+   - Synthesize the tree structure and architecture findings
+   - Auto-detect the architecture pattern (clean architecture, MVC, monorepo, microservices, etc.)
+   - Determine guidance targets using a two-pass process:
+
+     **Initial pass — identify top-level targets:**
+     - Apply the Guidance Depth Rules (see below) to top-level architectural layers
+     - This produces the initial target list (one per distinct layer/project)
+
+     **Decomposition pass — expand composite targets (ADD, never REPLACE):**
+     - For EACH initial target, review Agent B's sub-layer candidates
+     - If Agent B flagged sub-layers with distinct roles and file counts >10, ADD them as separate guidance targets alongside the parent — the parent stays in the list as an overview, sub-layers are added beneath it
+     - NEVER remove the parent when promoting sub-layers — decomposition expands the target list, it does not substitute entries
+     - Do NOT apply a blanket "sub-folders same as parent" skip — evaluate each sub-layer Agent B flagged individually against the Depth Rules
+     - Common decompositions: Angular/React/Vue apps → components/, services/, shared/; monorepo packages → per-package; large shared libraries → per-concern
+
+   - Present the proposed guidance locations to the user:
+     ```
+     ## Proposed Guidance Locations
+
+     Architecture detected: {pattern name}
+
+     Files will be written to `.rpiv/guidance/` mirroring the project structure.
+
+     ### Folders that need architectural guidance:
+     - `/` (root) — Project overview (compact)
+     - `src/core/` — Core domain layer
+     - `src/services/` — Service layer
+     - {etc.}
+
+     ### Folders to skip:
+     - `src/core/entities/` — Entity grouping, same pattern as parent
+     - {etc.}
+
+     Does this look right? Should I add or remove any locations?
+     ```
+   - Use the `ask_user_question` tool with the following question: "{N} guidance targets across {M} layers. Proceed with analysis?". Options: "Proceed (Recommended)" (Analyze all proposed folders and write architecture.md files); "Add folders" (I want to add more folders to the target list); "Remove folders" (Some proposed folders should be skipped).
+   - Adjust the target list based on user feedback
+
+4. **Pass 2 — Analyze each layer (parallel analyzer agents):**
+   - For each confirmed target folder, spawn agents in parallel using the Agent tool:
+
+   **For each target folder, spawn TWO agents:**
+
+   **Analyzer agent:**
+   - subagent_type: `codebase-analyzer`
+   - Prompt: "Analyze {folder path} in detail. Determine: 1) What is this layer's responsibility? 2) What are its dependencies (what does it import/use)? 3) Who consumes it (what imports/uses it)? 4) What are the key architectural boundaries and constraints? 5) What is the module structure — list DIRECTORIES with their roles, base types, and naming conventions. Use architectural annotations (e.g., 'one repo per entity', 'one controller per resource') instead of listing individual filenames. The structure should remain valid when non-architectural files are added. 6) What naming conventions are used (prefixes, suffixes, base classes)?"
+
+   **Pattern finder agent:**
+   - subagent_type: `codebase-pattern-finder`
+   - Prompt: "Find all distinct code patterns used in {folder path}. For each pattern found: 1) Name the pattern with a descriptive heading (e.g., 'Repository Boundary (CRITICAL: Plain Types, NOT Result<T>)'). 2) Provide an IDIOMATIC code example — a generalized, representative version that shows the pattern's essential shape (constructor, key method signatures, return types, error handling). Do NOT copy-paste a single file verbatim; instead synthesize the typical usage across the layer. 3) Add inline comments highlighting important conventions (e.g., '// DB int → boolean', '// throws on error — service wraps in Result'). 4) If the pattern involves a boundary between layers, show both sides. 5) Identify any repeatable workflows for adding new elements to this layer — backend entities (repositories, services, controllers) AND frontend elements (components, services, pages/routes, directives). For example: creating a new repository requires extending BaseRepository + registering in factory; adding a new Angular component requires extending BaseComponent + adding to routes + creating the template. Return these as step-by-step checklists. Return patterns with full code block examples."
+
+   - Emit 1 analyzer + 1 pattern finder per folder as separate `Agent(...)` calls in the same tool-use batch
+   - For the root architecture.md, use findings from ALL folders to create the overview
+
+5. **Wait for Pass 2 and synthesize:**
+   - IMPORTANT: Wait for ALL agents from Pass 2 to complete before proceeding
+   - Compile all agent findings per folder
+   - **Do NOT draft architecture.md content yet** — proceed to developer checkpoint first (Step 6)
+
+6. **Developer checkpoint — validate findings before drafting:**
+
+   Present a per-folder findings summary, then ask grounded questions. This pulls domain knowledge that agents can't discover from code alone — deprecated patterns, undocumented conventions, migration-in-progress situations, or cross-layer rules that only the developer knows.
+
+   **Findings summary** — one block per target folder, 2-3 lines each:
+   ```
+   ## Findings Summary
+
+   ### src/core/
+   Patterns: Repository base class, Entity base with soft-delete, Value Objects
+   Dependencies: Database layer (outbound), Services layer (inbound)
+   Workflows detected: "Add new entity" (5 steps), "Add new value object" (2 steps)
+
+   ### src/services/
+   Patterns: Result<T> wrapping, Transaction scope per operation
+   Dependencies: Core (outbound), Controllers (inbound)
+   Workflows detected: "Add new service" (4 steps)
+
+   {etc.}
+   ```
+
+   Then ask grounded questions — **one at a time**, waiting for the developer's answer before asking the next. Ask as many as the findings warrant — one per significant ambiguity or discovery gap. Use a **❓ Question:** prefix. Each question must reference real findings and pull NEW information from the developer — not confirm what you already found, and not ask about cosmetic issues (typos, formatting) or absences the developer can't add context to.
+
+   Only ask questions whose answer would change what gets written in an architecture.md file. Focus on:
+   - Competing patterns that need a canonical vs. legacy designation (which style should new code follow?)
+   - Cross-layer dependencies that look like violations but might be design decisions
+   - Undocumented architectural constraints not visible in code (ordering, idempotency, invariants)
+
+   Example grounded questions:
+   - "Found two different mapping approaches in `src/services/`: manual mapping in `OrderService` and AutoMapper in `UserService`. Which is the current convention, or is there a migration in progress I should document?"
+   - "The analyzer found no event/message patterns in `src/core/`. Is domain event publishing handled elsewhere, or is it not used in this project?"
+   - "Detected 3 different error-handling styles across layers. Is there a canonical approach, or are these intentional per-layer differences?"
+
+   **CRITICAL**: Ask ONE question at a time. Wait for the answer before asking the next. Lead with your most significant finding. The developer will redirect you if needed.
+
+   **Choosing question format:**
+
+   - **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example: Use the `ask_user_question` tool with the question "Found 2 mapping approaches — which should new code follow?". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+   - **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+     "❓ Question: Integration scanner found no background job registration for this area. Is that expected, or is there async processing I'm not seeing?"
+
+   **Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+   **Incorporate developer input:**
+
+   **Corrections** ("that pattern is deprecated", "wrong — we use X"):
+   - Update synthesis. If the correction reveals a pattern that needs fresh analysis, re-prompt a targeted **codebase-analyzer** or **codebase-pattern-finder** (max 2 agents).
+
+   **Missing conventions** ("you missed the soft-delete convention", "all handlers must be idempotent"):
+   - Add directly to synthesis for the relevant folder.
+
+   **Migration context** ("we're moving from X to Y", "old pattern in these files, new pattern in those"):
+   - Record both old and new approaches in synthesis — architecture.md should document the canonical (new) way with a note about the legacy approach still present in specific areas.
+
+   **Scope adjustments** ("skip that layer, it's being rewritten", "add src/shared/"):
+   - Update target list. For new targets, run a targeted Pass 2 (analyzer + pattern-finder, max 2 agents), then fold results into synthesis.
+
+   **Confirmations** ("looks right", "yes that's correct"):
+   - Proceed to drafting.
+
+   After incorporating all input, proceed to Step 7.
+
+7. **Draft architecture.md content:**
+   - Draft architecture.md content in this order — **subfolder files first, root last**:
+     - Subfolder: Use the **Subfolder Architecture Template** (detailed, max 100 lines)
+     - Root folder (LAST): Use the **Root Architecture Template** (compact overview). Draft root only after all subfolder files are finalized — this ensures the deduplication rule can be applied and cross-layer checklists can accurately reference subfolder content
+   - **Output directory convention:** All architecture.md files are written under `.rpiv/guidance/` at the project root, mirroring the project's directory structure. For a target folder at `src/core/`, the output path is `.rpiv/guidance/src/core/architecture.md`. For the root target, the output path is `.rpiv/guidance/architecture.md`. Create intermediate directories as needed.
+   - Enforce the 100-line limit on subfolder files — code examples are essential but keep them concise
+   - If the pattern-finder identified repeatable "add new entity" workflows, include them as `<important if="you are adding a new {entity} to this layer">` conditional sections
+   - If testing patterns were detected, include them as `<important if="you are writing or modifying tests for this layer">` conditional sections
+   - Conditional sections are optional — only include when the pattern-finder found clear evidence of a repeatable workflow
+   - Conditions must be narrow and action-specific (NOT "you are writing code" — too broad)
+   - Do NOT include conventions enforceable by linters, formatters, or pre-commit hooks (e.g., naming conventions, import ordering, indentation) — these add noise without value
+   - Do NOT include patterns easily discoverable from existing code — LLMs are in-context learners and will follow patterns after a few file reads. Only document conventions that are surprising, non-obvious, or span multiple layers
+   - If a pattern section would contain only prose or comments with no code example, either expand it with a real idiomatic example or omit it and reference the source file (e.g., "see `BaseModalComponent` for the modal pattern")
+   - Before writing, verify: no root conditional block duplicates content from a subfolder architecture.md. If a layer has its own subfolder file, remove its summary from root
+   - For cross-layer vertical-slice checklists in root, each step should reference the relevant subfolder architecture.md (e.g., "see `.rpiv/guidance/src/data/architecture.md`") rather than inlining the full procedure
+   - If an existing root architecture.md or CLAUDE.md was found:
+     - Review its content
+     - Redistribute any detailed layer-specific content to the appropriate subfolder architecture.md files
+     - Rewrite the root as a compact overview
+
+8. **Self-review pass — verify every drafted file before writing:**
+   Walk through each drafted architecture.md and check every item below. Fix violations in-place before proceeding to writing.
+
+   **Dependencies** — for each listed dependency, ask: "does this library impose patterns, constraints, or conventions on the code?" If the answer is no (utility libraries like lodash, moment, xlsx, FontAwesome), remove it. Only frameworks and libraries that shape how you write code survive.
+
+   **Module Structure** — count top-level entries. If more than 7, group related directories on one line (e.g., `guards/, interceptors/, pipes/ — cross-cutting plumbing`). Target 4-7 entries.
+
+   **Pattern sections** — every pattern H2 must contain a fenced code block with an idiomatic example. If a section is prose-only or comment-only, either expand it with a real code example or replace the section with a one-line file reference (e.g., "see `TradeDeskMapping.cs` for the mapping pattern").
+
+   **Root deduplication** — for each root conditional block, verify it is NOT summarizing a layer that has its own subfolder architecture.md. If it is, remove the block. For cross-layer vertical-slice checklists, verify each step references the relevant subfolder file (e.g., "see `.rpiv/guidance/X/architecture.md`") rather than inlining the procedure.
+
+   **Frontend/UI conditional coverage** — for each frontend/UI layer, list every repeatable workflow the pattern-finder reported (components, services, pages/routes, directives, pipes, hooks, stores — whatever was detected). Then compare that list against the drafted `<important if>` conditional sections. Any workflow on the list without a matching conditional is a gap — draft and add the missing section before proceeding.
+
+   After fixing all violations, re-scan the corrected drafts to confirm every check passes. Only proceed to writing when all checks are clean. Present a brief summary of what was fixed:
+   ```
+   ## Self-review results
+   - {file}: removed 2 utility deps (moment, xlsx-js-style)
+   - {file}: grouped Module Structure from 11 → 6 entries
+   - {file}: added "Adding a New Service" conditional
+   - Root: no violations found
+   ```
+
+9. **Pass 3 — Write all architecture.md files:**
+   - Write each file to `.rpiv/guidance/{relative_path}/architecture.md`. For the root file, write to `.rpiv/guidance/architecture.md`. Create any intermediate directories that do not exist.
+   - Write ALL files at once using the Write tool
+   - Do NOT ask for confirmation before each file — batch mode
+   - After writing, present a summary:
+     ```
+     ## Architecture Guidance Files Created
+
+     | File | Lines | Description |
+     |------|-------|-------------|
+     | .rpiv/guidance/architecture.md | 45 | Root project overview |
+     | .rpiv/guidance/src/core/architecture.md | 78 | Core domain layer |
+     | .rpiv/guidance/src/services/architecture.md | 65 | Service layer |
+     | {etc.} | | |
+
+     Total: {N} files created/updated
+
+     Please review the files and let me know if you'd like any adjustments.
+     ```
+
+10. **Handle Follow-ups:**
+    - **Edit in-place.** If the user requests changes to specific files, edit them directly using the Edit tool — annotation files are pure markdown, no frontmatter to bump.
+    - **Re-dispatch narrowly.** If the user wants additional folders annotated, run a targeted Pass 2 (analyzer + pattern finder) for those folders, then write.
+    - **Removals.** If the user wants a file removed, note that they can delete it themselves — annotate does not delete.
+    - **When to re-invoke instead.** Re-run `/skill:annotate-guidance` for project-wide refresh after major architectural changes; for single-folder updates, prefer in-place edits.
+
+## Root Architecture Template (compact):
+
+Read the full template at `templates/root-architecture.md`.
+
+Key principles:
+- Bare sections (Overview, Architecture, Commands, Business Context) are foundational — always included
+- Cross-cutting patterns go in `<important if>` blocks with narrow conditions
+- Deduplication rule: if a layer has a subfolder architecture.md, don't summarize it in root
+- Root MAY include cross-layer vertical-slice checklists referencing subfolder files
+
+### Root Architecture Reference Examples
+
+See `examples/root-nodejs-monorepo.md` (Node.js monorepo) and `examples/root-dotnet-clean-arch.md` (.NET Clean Architecture) for well-formed root architecture.md examples.
+
+What makes these examples good:
+- **Bare sections** (Overview, Project map, Commands) are relevant to nearly every task — no wrapper needed
+- **Each `<important if>` has a narrow trigger** — "adding a new API endpoint" not "writing backend code"
+- **No linter territory** — formatting rules left to tooling
+- **No code snippets** — uses file path references since patterns are better shown in subfolder architecture.md files
+- **Same structure, different ecosystems** — the pattern works identically for Node.js and .NET
+
+## Subfolder Architecture Template (max 100 lines):
+
+Read the full template at `templates/subfolder-architecture.md`.
+
+Key principles:
+- Each distinct pattern gets its own H2 section with a fenced code block
+- Module Structure: aim for 4-7 top-level entries, use architectural annotations
+- Conditional sections (`<important if>`) are optional — only for detected repeatable workflows
+- Conditional sections do NOT count toward the 100-line budget
+
+### Reference Examples
+
+See the following for well-formed subfolder architecture.md examples:
+- `examples/subfolder-database-layer.md` — Database layer (~80 lines)
+- `examples/subfolder-schemas-layer.md` — Schemas layer (~70 lines)
+- `examples/subfolder-dotnet-application.md` — .NET Application layer (~65 lines)
+
+### What makes these examples good:
+- **Module Structure**: Compact, uses architectural annotations, groups related files on one line
+- **Patterns as H2 sections**: Each pattern has a descriptive name, NOT a generic umbrella
+- **Code examples are idiomatic**: Generalized to show the pattern's shape
+- **Cross-boundary patterns**: Shows both sides of layer boundaries
+- **Concise**: All fit well within 100 lines
+- **Conditional blocks**: Wrap scenario-specific recipes with narrow conditions
+
+## Guidance Depth Rules:
+
+**CREATE architecture.md when:**
+- Folder represents a distinct **architectural layer** (core, services, database, redis, ipc)
+- Folder contains **unique organizational logic** not captured by parent
+- Subfolder has **different patterns/constraints** than parent (e.g., `database/repositories/` vs `database/`)
+- Folder has **its own responsibility** (e.g., `database/migrations/`)
+- Folder is a **composite application root** (e.g., SPA, monorepo package) whose children represent distinct sub-layers with different patterns — apply Depth Rules recursively to its children
+
+**SKIP architecture.md when:**
+- Folder only groups entities/DTOs by domain boundary following the same pattern
+- Folder content is fully described by parent architecture.md
+- Folder is a simple grouping without unique constraints
+
+## Important notes:
+- Parallel Agent dispatch — every `Agent(...)` call in the same assistant message (multiple tool_use blocks in one response), never one per turn. Call shape: `Agent({ subagent_type: "<agent-name>", description: "<3-5 word task label>", prompt: "<task>" })`.
+- **File reading**: Always read mentioned files FULLY (no limit/offset) before invoking skills
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS read mentioned files first before invoking skills (step 1)
+  - ALWAYS wait for all skills in a pass to complete before proceeding to the next step
+  - NEVER write architecture.md files with placeholder values — all content must come from skill findings
+  - NEVER proceed to Pass 2 without user confirmation of target locations
+  - NEVER skip the developer checkpoint (step 6) — developer input is the highest-value signal for architecture.md quality
+  - NEVER draft architecture.md content before completing the developer checkpoint
+- **.gitignore compliance**: Skip directories excluded by .gitignore (node_modules, dist, build, .git, vendor, etc.)
+- **Batch output mode**: Write all architecture.md files at once in Pass 3, do not ask for per-file confirmation
+- **Existing file handling**: If an architecture.md already exists at any target location in `.rpiv/guidance/`, replace it entirely using the Write tool
+- **Line budget**: Subfolder architecture.md files must not exceed 100 lines — code examples in Key Patterns are mandatory, keep them idiomatic and concise
+- **No frontmatter**: architecture.md files are pure markdown, no YAML frontmatter
+- Keep the main agent focused on synthesis, not deep file reading — delegate analysis to sub-agents
--- a/extensions/rpiv-pi/skills/annotate-guidance/examples/root-dotnet-clean-arch.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/examples/root-dotnet-clean-arch.md
@@ -0,0 +1,38 @@
+# Project Overview
+
+ASP.NET Core 8 Web API with Clean Architecture (CQRS + MediatR).
+
+## Project map
+
+- `src/Api/` - ASP.NET Core controllers, middleware, DI setup
+- `src/Application/` - MediatR handlers, validators, DTOs
+- `src/Domain/` - Entities, value objects, domain events
+- `src/Infrastructure/` - EF Core, external services, file storage
+- `tests/` - Unit and integration tests
+
+## Commands
+
+| Command | What it does |
+|---|---|
+| `dotnet build` | Build solution |
+| `dotnet test` | Run all tests |
+| `dotnet run --project src/Api` | Start API locally |
+| `dotnet ef migrations add <Name> -p src/Infrastructure` | Create EF migration |
+| `dotnet ef database update -p src/Infrastructure` | Apply migrations |
+
+<important if="you are adding a new API endpoint">
+- Add controller in `Api/Controllers/` inheriting `BaseApiController`
+- Add command/query + handler + validator in `Application/Features/`
+- See `Application/Features/Orders/Commands/CreateOrder/` for the pattern
+</important>
+
+<important if="you are adding or modifying EF Core migrations or database schema">
+- Entities configured via `IEntityTypeConfiguration<T>` in `Infrastructure/Persistence/Configurations/`
+- Always create a migration after schema changes — never modify existing migrations
+</important>
+
+<important if="you are writing or modifying tests">
+- Unit tests: xUnit + NSubstitute, one test class per handler
+- Integration tests: `WebApplicationFactory<Program>` with test database
+- See `tests/Application.IntegrationTests/TestBase.cs` for setup
+</important>
--- a/extensions/rpiv-pi/skills/annotate-guidance/examples/root-nodejs-monorepo.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/examples/root-nodejs-monorepo.md
@@ -0,0 +1,42 @@
+# Project Overview
+
+Express API + React frontend in a Turborepo monorepo.
+
+## Project map
+
+- `apps/api/` - Express REST API
+- `apps/web/` - React SPA
+- `packages/db/` - Prisma schema and client
+- `packages/ui/` - Shared component library
+- `packages/config/` - Shared configuration
+
+## Commands
+
+| Command | What it does |
+|---|---|
+| `turbo build` | Build all packages |
+| `turbo test` | Run all tests |
+| `turbo lint` | Lint all packages |
+| `turbo dev` | Start dev server |
+| `turbo db:generate` | Regenerate Prisma client after schema changes |
+| `turbo db:migrate` | Run database migrations |
+
+<important if="you are adding or modifying API routes">
+- All routes go in `apps/api/src/routes/`
+- Use Zod for request validation — see `apps/api/src/routes/connections.ts` for the pattern
+- Error responses follow RFC 7807 format
+- Authentication via JWT middleware
+</important>
+
+<important if="you are writing or modifying tests">
+- API: Jest + Supertest, Frontend: Vitest + Testing Library
+- Test fixtures in `__fixtures__/` directories
+- Use `createTestClient()` helper for API integration tests
+- Mock database with `prismaMock` from `packages/db/test`
+</important>
+
+<important if="you are working with client-side state, stores, or data fetching">
+- Zustand for global client state
+- React Query for server state
+- URL state via `nuqs`
+</important>
--- a/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-database-layer.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-database-layer.md
@@ -0,0 +1,81 @@
+# Database Layer Architecture
+
+## Responsibility
+SQLite persistence with better-sqlite3, repository pattern (plain types), QueryQueue concurrency, type transformations.
+
+## Dependencies
+- **better-sqlite3**: Native SQLite (requires rebuild for Electron)
+- **@redis-ui/core**: Domain types
+- **p-queue**: Query serialization
+
+## Consumers
+- **@redis-ui/services**: Repositories via RepositoryFactory
+- **Main process**: DatabaseManager initialization
+
+## Module Structure
+```
+src/
+├── DatabaseManager.ts, QueryQueue.ts   # Singleton, concurrency
+├── BaseRepository.ts, RepositoryFactory.ts
+├── schema.ts
+└── repositories/                        # One repo per entity
+```
+
+## Repository Boundary (CRITICAL: Plain Types, NOT Result<T>)
+
+```typescript
+export class ConnectionRepository extends BaseRepository<ConnectionDB, Connection, ConnectionId> {
+  protected toApplication(db: ConnectionDB): Connection {
+    return {
+      id: ConnectionId.create(db.id),
+      host: db.host,
+      port: db.port,
+      sslEnabled: Boolean(db.ssl_enabled),     // DB int → boolean
+      createdAt: new Date(db.created_at),      // timestamp → Date
+    };
+  }
+
+  async findById(id: ConnectionId): Promise<Connection | null> {
+    return this.queue.enqueueRead((db) => {
+      const row = db.prepare('SELECT * FROM connections WHERE id = ?').get(id);
+      return row ? this.toApplication(row) : null;
+    });
+  }
+}
+
+// Service: Wraps repository in Result<T>
+async createConnection(input: CreateInput): Promise<Result<Connection>> {
+  try {
+    const connection = await this.connectionRepo.create(input);
+    return Result.ok(connection);
+  } catch (error) {
+    return Result.fail(new InfrastructureError(error.message));
+  }
+}
+```
+
+## QueryQueue Pattern (Write Serialization)
+
+```typescript
+export class QueryQueue {
+  private writeQueue = new PQueue({ concurrency: 1 }) // Single writer
+  private readQueue = new PQueue({ concurrency: 5 })  // Multiple readers
+
+  async enqueueWrite<T>(op: (db: Database) => T): Promise<T> {
+    return this.writeQueue.add(() => op(this.db))
+  }
+}
+```
+
+## Architectural Boundaries
+- **NO Result<T> in repos**: Services wrap with Result
+- **NO unqueued DB ops**: Always use QueryQueue
+- **NO raw SQL in services**: Use repositories
+
+<important if="you are adding a new repository to this layer">
+## Adding a New Repository
+1. Create `XRepository.ts` extending `BaseRepository<XDB, X, XId>`
+2. Implement `toApplication()` and `toDatabase()` type mappers
+3. Register in `RepositoryFactory`
+4. Add table schema in `schema.ts`
+</important>
--- a/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-dotnet-application.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-dotnet-application.md
@@ -0,0 +1,64 @@
+# Application Layer (CQRS + MediatR)
+
+## Responsibility
+Command/query handlers orchestrating domain logic via MediatR pipeline. Sits between API controllers and Domain layer.
+
+## Dependencies
+- **MediatR**: Command/query dispatch
+- **FluentValidation**: Request validation via pipeline behavior
+- **AutoMapper**: Domain ↔ DTO mapping
+
+## Consumers
+- **API Controllers**: Send commands/queries via `IMediator`
+- **Integration tests**: Direct handler invocation
+
+## Module Structure
+```
+Application/
+├── Common/
+│   ├── Behaviors/          # MediatR pipeline (validation, logging)
+│   └── Mappings/           # AutoMapper profiles
+├── Features/               # One folder per aggregate
+│   └── Orders/
+│       ├── Commands/       # CreateOrder/, UpdateOrder/ (handler + validator + DTO)
+│       └── Queries/        # GetOrder/, ListOrders/
+└── DependencyInjection.cs  # Service registration
+```
+
+## Handler Pattern (Command with Validation)
+
+```csharp
+public record CreateOrderCommand(string CustomerId, List<LineItemDto> Items)
+    : IRequest<Result<OrderDto>>;
+
+public class CreateOrderValidator : AbstractValidator<CreateOrderCommand> {
+    public CreateOrderValidator(IOrderRepository repo) {
+        RuleFor(x => x.CustomerId).NotEmpty();
+        RuleFor(x => x.Items).NotEmpty();
+    }
+}
+
+public class CreateOrderHandler : IRequestHandler<CreateOrderCommand, Result<OrderDto>> {
+    public async Task<Result<OrderDto>> Handle(
+        CreateOrderCommand request, CancellationToken ct) {
+        var order = Order.Create(request.CustomerId, request.Items); // Domain factory
+        await _repo.AddAsync(order, ct);
+        await _unitOfWork.SaveChangesAsync(ct);
+        return Result.Ok(_mapper.Map<OrderDto>(order));
+    }
+}
+```
+
+## Architectural Boundaries
+- **NO domain logic in handlers**: Handlers orchestrate, domain objects contain logic
+- **NO direct DbContext access**: Use repository abstractions
+- **NO cross-feature references**: Features are independent vertical slices
+
+<important if="you are adding a new feature or command/query handler">
+## Adding a New Feature
+1. Create folder under `Features/{Aggregate}/{Commands|Queries}/`
+2. Add `Command`/`Query` record implementing `IRequest<Result<TDto>>`
+3. Add `Validator` extending `AbstractValidator<TCommand>`
+4. Add `Handler` implementing `IRequestHandler<TCommand, Result<TDto>>`
+5. Add AutoMapper profile in `Common/Mappings/` if new DTO
+</important>
--- a/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-schemas-layer.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/examples/subfolder-schemas-layer.md
@@ -0,0 +1,50 @@
+# Schemas Layer Architecture
+
+## Responsibility
+Zod validation schemas for dual-layer validation (preload UX + main security), type inference via z.infer<>.
+
+## Dependencies
+- **zod**: Runtime validation
+
+## Consumers
+- **@redis-ui/ipc**: Main process validation (security)
+- **Preload**: Fail-fast validation (UX)
+- **TypeScript**: Type inference
+
+## Module Structure
+```
+src/
+├── connection.ts, backup.ts    # Domain schemas
+└── __tests__/                  # Validation tests
+```
+
+## Complete Schema Pattern (Types + Validation + Composition)
+
+```typescript
+export const createConnectionSchema = z.object({
+  name: z.string().min(1).max(255),
+  host: z.string().min(1),
+  port: z.number().int().min(1).max(65535),
+  password: z.string().optional(),
+  database: z.number().int().min(0).max(15).default(0),
+})
+
+// Type inference
+export type CreateConnectionInput = z.infer<typeof createConnectionSchema>
+
+// Update schema (partial + ID required)
+export const updateConnectionSchema = createConnectionSchema.partial().extend({
+  id: z.string().min(1)
+})
+```
+
+## Dual-Validation Flow
+
+```
+Renderer input → Preload (Zod parse, fail fast) → IPC → Main (Zod parse again, security)
+```
+
+## Architectural Boundaries
+- **NO any types**: Use z.unknown()
+- **NO skipping validation**: Always validate at boundaries
+- **NO business logic**: Structure validation only
--- a/extensions/rpiv-pi/skills/annotate-guidance/templates/root-architecture.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/templates/root-architecture.md
@@ -0,0 +1,46 @@
+```markdown
+# Project Overview
+{1-2 sentences: what it is, tech stack}
+
+# Architecture
+{monorepo structure tree + dependency flow diagram}
+{process architecture if applicable}
+
+# Commands
+{key commands table — always bare, never wrapped in <important if>}
+
+# Business Context
+{1-2 sentences if applicable}
+```
+
+The sections above (Overview, Architecture, Commands, Business Context) are foundational — they stay bare because they're relevant to virtually every task.
+
+Cross-cutting patterns and domain-specific conventions go in `<important if>` blocks with narrow, action-specific conditions. Do NOT group unrelated rules under a single broad condition like "you are writing or modifying code". Instead, shard by trigger.
+
+Root conditional blocks are for **cross-cutting conventions that don't belong to any single layer**. Layer-specific recipes (like "adding a new controller" or "adding a new repository") belong in the subfolder architecture.md, not the root.
+
+**Deduplication rule:** If a layer has its own subfolder architecture.md, do NOT add a root conditional block summarizing that layer's conventions. The subfolder file is the authoritative guide — it provides detailed layer-specific documentation in `.rpiv/guidance/`. Root conditionals that mirror subfolder content waste attention budget and create staleness risk.
+
+Root MAY include cross-layer vertical-slice checklists (e.g., "adding a new domain entity end-to-end") that reference multiple subfolder architecture.md files — but each step should point to the relevant subfolder for details, not inline them.
+
+Good root conditions — things that span multiple layers:
+
+```markdown
+<important if="you are writing or modifying tests">
+- Unit: xUnit + NSubstitute / Jest + Testing Library
+- Integration: WebApplicationFactory / Supertest
+- Test fixtures in `__fixtures__/` or `tests/Fixtures/`
+</important>
+
+<important if="you are adding or modifying database migrations">
+- Never modify existing migrations — always create new ones
+- Run `dotnet ef migrations add` / `turbo db:migrate` after schema changes
+</important>
+
+<important if="you are adding or modifying environment configuration">
+- All config via `IOptions<T>` pattern / environment variables
+- Secrets in user-secrets locally, Key Vault in production
+</important>
+```
+
+Each block should contain only rules that share the same trigger condition. If a codebase has 3 distinct convention areas, that's 3 blocks — not 1 block with a broad condition. Layer-specific checklists (adding a controller, adding a repository) go in the subfolder architecture.md using `<important if="you are adding a new {entity} to this layer">`.
--- a/extensions/rpiv-pi/skills/annotate-guidance/templates/subfolder-architecture.md
+++ b/extensions/rpiv-pi/skills/annotate-guidance/templates/subfolder-architecture.md
@@ -0,0 +1,57 @@
+```markdown
+# {Layer/Component Name}
+
+## Responsibility
+{1-2 sentences: what this layer does, where it sits in architecture}
+
+## Dependencies
+{List only architectural dependencies — frameworks and libraries that shape how you write code in this layer.
+Do NOT list utility libraries discoverable from package.json/imports (e.g., lodash, moment, xlsx).
+A dependency is worth listing if it imposes patterns, constraints, or conventions on the code.}
+- **{dep}**: Why it's used
+
+## Consumers
+- **{consumer}**: How it uses this layer
+
+## Module Structure
+{Compact directory tree — aim for 4-7 top-level entries, not 15.
+Group related files on one line (e.g., "Service.ts, Handler.ts").
+Use architectural annotations for directories (e.g., "# One repo per entity", "# Domain schemas").
+DO NOT enumerate individual files inside directories — describe the convention.
+When a layer has many directories (10+), group related concerns on one line
+(e.g., "guards/, interceptors/, pipes/ — infrastructure plumbing") rather than listing each separately.
+The structure must stay valid when non-architectural files are added.}
+
+## {Pattern Name} ({Key Constraint or Characterization})
+{Each distinct pattern gets its own H2 section — NOT a generic "## Key Patterns" umbrella.
+Include a fenced code block with an idiomatic, generalized example showing:
+- Constructor / dependencies
+- Key method signatures and return types
+- Error handling / wrapping conventions
+- Inline comments for important conventions (e.g., "// throws on error — service wraps in Result")
+If a pattern spans a layer boundary, show both sides briefly.
+Multiple patterns = multiple H2 sections.}
+
+## {Additional Pattern Name}
+{Second pattern with code block if applicable}
+
+## Architectural Boundaries
+- **NO {X}**: {Why}
+- **NO {Y}**: {Why}
+
+<important if="you are adding a new {entity type} to this layer">
+## Adding a New {Entity Type}
+{Step-by-step checklist inferred from existing code:
+1. Create file following naming convention
+2. Extend/implement base class or interface
+3. Register in factory/container/index
+4. Add related artifacts (schema, test, migration)}
+</important>
+
+<important if="you are writing or modifying tests for this layer">
+## Testing Conventions
+{Test patterns, helpers, fixture locations, mocking approach — if detectable from code}
+</important>
+```
+
+Conditional sections are OPTIONAL — only include them if the pattern-finder skill detects testable patterns or clear "add new entity" workflows. Conditions must be narrow and action-specific. These sections contain checklists/recipes, not code examples (those stay in the unconditional pattern sections). Conditional sections do NOT count toward the 100-line budget for unconditional content.
--- a/extensions/rpiv-pi/skills/annotate-inline/SKILL.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/SKILL.md
@@ -0,0 +1,300 @@
+---
+name: annotate-inline
+description: Generate CLAUDE.md files placed inline next to source code across a project, documenting architecture and patterns for AI assistants. Use when the user wants to onboard Claude to a codebase via inline CLAUDE.md files, generate per-directory guidance, document architecture in-place, or asks to "annotate inline". Prefer this over annotate-guidance when CLAUDE.md should live alongside the code rather than in a shadow tree.
+argument-hint: [target-directory]
+allowed-tools: Agent, Read, Write, Glob, Grep
+---
+
+# Annotate Project
+
+You are tasked with generating CLAUDE.md files across a brownfield project. You will map the project structure, auto-detect its architecture, analyze each architectural layer, and batch-write compact CLAUDE.md files at the root and relevant subdirectories.
+
+## Initial Setup:
+
+Use the current working directory as the target project by default. If the user provides a specific directory path as an argument, use that instead.
+
+## Steps to follow:
+
+1. **Read any directly mentioned files first:**
+   - If the user mentions specific files (existing CLAUDE.md, architecture docs, READMEs), read them FULLY first
+   - **IMPORTANT**: Use the Read tool WITHOUT limit/offset parameters to read entire files
+   - **CRITICAL**: Read these files yourself in the main context before invoking any skills
+   - This ensures you have full context before decomposing the work
+
+2. **Pass 1 — Map the project (parallel agents):**
+   - Spawn the following agents in parallel using the Agent tool:
+
+   **Agent A — Project tree mapping:**
+   - subagent_type: `codebase-locator`
+   - Prompt: "Map the full project tree structure for {target directory}. List all directories and their contents, respecting .gitignore. Focus on source code directories, configuration files, and build artifacts. Return a complete tree view."
+
+   **Agent B — Architecture and conventions:**
+   - subagent_type: `codebase-locator`
+   - Prompt: "Identify the architectural layout of {target directory} from path shape and manifest files — NO content analysis. Detect: (1) Architecture pattern inferred from folder shape — clean-arch via Domain/Application/Infrastructure dirs; MVC via Controllers/Models/Views; monorepo via packages/* + workspaces; microservices via services/* with individual manifests; hexagonal via ports/adapters. (2) Main layers/modules — top-level source directories + their names. (3) Frameworks and languages from manifest files (package.json dependencies, *.csproj TargetFramework, pyproject.toml, go.mod, Cargo.toml) and file extensions. (4) Build system from build-config filenames (vite/webpack/tsup/esbuild configs, Makefile, nx.json, turbo.json, dotnet .sln). For each main layer/module, check sub-directory composition. If sub-directories with distinct names/roles exist, flag each as a CLAUDE.md target candidate with: (a) path, (b) role inferred from folder name (controllers/, services/, entities/, components/, stores/, etc.), (c) file count via ls, (d) how its sub-directory composition differs from sibling layers. Use grep/find/ls only. Do not read file contents. Pass 2 runs codebase-analyzer + codebase-pattern-finder per target folder for deep analysis."
+
+   - While agents run, read .gitignore yourself to understand exclusion rules
+
+3. **Wait for Pass 1 and determine CLAUDE.md targets:**
+   - IMPORTANT: Wait for ALL agents from Pass 1 to complete before proceeding
+   - Synthesize the tree structure and architecture findings
+   - Auto-detect the architecture pattern (clean architecture, MVC, monorepo, microservices, etc.)
+   - Determine CLAUDE.md targets using a two-pass process:
+
+     **Initial pass — identify top-level targets:**
+     - Apply the CLAUDE.md Depth Rules (see below) to top-level architectural layers
+     - This produces the initial target list (one per distinct layer/project)
+
+     **Decomposition pass — expand composite targets (ADD, never REPLACE):**
+     - For EACH initial target, review Agent B's sub-layer candidates
+     - If Agent B flagged sub-layers with distinct roles and file counts >10, ADD them as separate CLAUDE.md targets alongside the parent — the parent stays in the list as an overview, sub-layers are added beneath it
+     - NEVER remove the parent when promoting sub-layers — decomposition expands the target list, it does not substitute entries
+     - Do NOT apply a blanket "sub-folders same as parent" skip — evaluate each sub-layer Agent B flagged individually against the Depth Rules
+     - Common decompositions: Angular/React/Vue apps → components/, services/, shared/; monorepo packages → per-package; large shared libraries → per-concern
+
+   - Present the proposed CLAUDE.md locations to the user:
+     ```
+     ## Proposed CLAUDE.md Locations
+
+     Architecture detected: {pattern name}
+
+     ### Will create CLAUDE.md in:
+     - `/` (root) — Project overview (compact)
+     - `/src/core/` — Core domain layer
+     - `/src/services/` — Service layer
+     - {etc.}
+
+     ### Will skip:
+     - `/src/core/entities/` — Entity grouping, same pattern as parent
+     - {etc.}
+
+     Does this look right? Should I add or remove any locations?
+     ```
+   - Use the `ask_user_question` tool with the following question: "{N} CLAUDE.md targets across {M} layers. Proceed with analysis?". Options: "Proceed (Recommended)" (Analyze all proposed folders and write CLAUDE.md files); "Add folders" (I want to add more folders to the target list); "Remove folders" (Some proposed folders should be skipped).
+   - Adjust the target list based on user feedback
+
+4. **Pass 2 — Analyze each layer (parallel analyzer agents):**
+   - For each confirmed target folder, spawn agents in parallel using the Agent tool:
+
+   **For each target folder, spawn TWO agents:**
+
+   **Analyzer agent:**
+   - subagent_type: `codebase-analyzer`
+   - Prompt: "Analyze {folder path} in detail. Determine: 1) What is this layer's responsibility? 2) What are its dependencies (what does it import/use)? 3) Who consumes it (what imports/uses it)? 4) What are the key architectural boundaries and constraints? 5) What is the module structure — list DIRECTORIES with their roles, base types, and naming conventions. Use architectural annotations (e.g., 'one repo per entity', 'one controller per resource') instead of listing individual filenames. The structure should remain valid when non-architectural files are added. 6) What naming conventions are used (prefixes, suffixes, base classes)?"
+
+   **Pattern finder agent:**
+   - subagent_type: `codebase-pattern-finder`
+   - Prompt: "Find all distinct code patterns used in {folder path}. For each pattern found: 1) Name the pattern with a descriptive heading (e.g., 'Repository Boundary (CRITICAL: Plain Types, NOT Result<T>)'). 2) Provide an IDIOMATIC code example — a generalized, representative version that shows the pattern's essential shape (constructor, key method signatures, return types, error handling). Do NOT copy-paste a single file verbatim; instead synthesize the typical usage across the layer. 3) Add inline comments highlighting important conventions (e.g., '// DB int → boolean', '// throws on error — service wraps in Result'). 4) If the pattern involves a boundary between layers, show both sides. 5) Identify any repeatable workflows for adding new elements to this layer — backend entities (repositories, services, controllers) AND frontend elements (components, services, pages/routes, directives). For example: creating a new repository requires extending BaseRepository + registering in factory; adding a new Angular component requires extending BaseComponent + adding to routes + creating the template. Return these as step-by-step checklists. Return patterns with full code block examples."
+
+   - Emit 1 analyzer + 1 pattern finder per folder as separate `Agent(...)` calls in the same tool-use batch
+   - For the root CLAUDE.md, use findings from ALL folders to create the overview
+
+5. **Wait for Pass 2 and synthesize:**
+   - IMPORTANT: Wait for ALL agents from Pass 2 to complete before proceeding
+   - Compile all agent findings per folder
+   - **Do NOT draft CLAUDE.md content yet** — proceed to developer checkpoint first (Step 6)
+
+6. **Developer checkpoint — validate findings before drafting:**
+
+   Present a per-folder findings summary, then ask grounded questions. This pulls domain knowledge that agents can't discover from code alone — deprecated patterns, undocumented conventions, migration-in-progress situations, or cross-layer rules that only the developer knows.
+
+   **Findings summary** — one block per target folder, 2-3 lines each:
+   ```
+   ## Findings Summary
+
+   ### src/core/
+   Patterns: Repository base class, Entity base with soft-delete, Value Objects
+   Dependencies: Database layer (outbound), Services layer (inbound)
+   Workflows detected: "Add new entity" (5 steps), "Add new value object" (2 steps)
+
+   ### src/services/
+   Patterns: Result<T> wrapping, Transaction scope per operation
+   Dependencies: Core (outbound), Controllers (inbound)
+   Workflows detected: "Add new service" (4 steps)
+
+   {etc.}
+   ```
+
+   Then ask grounded questions — **one at a time**, waiting for the developer's answer before asking the next. Ask as many as the findings warrant — one per significant ambiguity or discovery gap. Use a **❓ Question:** prefix. Each question must reference real findings and pull NEW information from the developer — not confirm what you already found, and not ask about cosmetic issues (typos, formatting) or absences the developer can't add context to.
+
+   Only ask questions whose answer would change what gets written in a CLAUDE.md file. Focus on:
+   - Competing patterns that need a canonical vs. legacy designation (which style should new code follow?)
+   - Cross-layer dependencies that look like violations but might be design decisions
+   - Undocumented architectural constraints not visible in code (ordering, idempotency, invariants)
+
+   Example grounded questions:
+   - "Found two different mapping approaches in `src/services/`: manual mapping in `OrderService` and AutoMapper in `UserService`. Which is the current convention, or is there a migration in progress I should document?"
+   - "The analyzer found no event/message patterns in `src/core/`. Is domain event publishing handled elsewhere, or is it not used in this project?"
+   - "Detected 3 different error-handling styles across layers. Is there a canonical approach, or are these intentional per-layer differences?"
+
+   **CRITICAL**: Ask ONE question at a time. Wait for the answer before asking the next. Lead with your most significant finding. The developer will redirect you if needed.
+
+   **Choosing question format:**
+
+   - **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example: Use the `ask_user_question` tool with the question "Found 2 mapping approaches — which should new code follow?". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+   - **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+     "❓ Question: Integration scanner found no background job registration for this area. Is that expected, or is there async processing I'm not seeing?"
+
+   **Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+   **Incorporate developer input:**
+
+   **Corrections** ("that pattern is deprecated", "wrong — we use X"):
+   - Update synthesis. If the correction reveals a pattern that needs fresh analysis, re-prompt a targeted **codebase-analyzer** or **codebase-pattern-finder** (max 2 agents).
+
+   **Missing conventions** ("you missed the soft-delete convention", "all handlers must be idempotent"):
+   - Add directly to synthesis for the relevant folder.
+
+   **Migration context** ("we're moving from X to Y", "old pattern in these files, new pattern in those"):
+   - Record both old and new approaches in synthesis — CLAUDE.md should document the canonical (new) way with a note about the legacy approach still present in specific areas.
+
+   **Scope adjustments** ("skip that layer, it's being rewritten", "add src/shared/"):
+   - Update target list. For new targets, run a targeted Pass 2 (analyzer + pattern-finder, max 2 agents), then fold results into synthesis.
+
+   **Confirmations** ("looks right", "yes that's correct"):
+   - Proceed to drafting.
+
+   After incorporating all input, proceed to Step 7.
+
+7. **Draft CLAUDE.md content:**
+   - Draft CLAUDE.md content in this order — **subfolder files first, root last**:
+     - Subfolder: Use the **Subfolder CLAUDE.md Template** (detailed, max 100 lines)
+     - Root folder (LAST): Use the **Root CLAUDE.md Template** (compact overview). Draft root only after all subfolder files are finalized — this ensures the deduplication rule can be applied and cross-layer checklists can accurately reference subfolder content
+   - Enforce the 100-line limit on subfolder files — code examples are essential but keep them concise
+   - If the pattern-finder identified repeatable "add new entity" workflows, include them as `<important if="you are adding a new {entity} to this layer">` conditional sections
+   - If testing patterns were detected, include them as `<important if="you are writing or modifying tests for this layer">` conditional sections
+   - Conditional sections are optional — only include when the pattern-finder found clear evidence of a repeatable workflow
+   - Conditions must be narrow and action-specific (NOT "you are writing code" — too broad)
+   - Do NOT include conventions enforceable by linters, formatters, or pre-commit hooks (e.g., naming conventions, import ordering, indentation) — these add noise without value
+   - Do NOT include patterns easily discoverable from existing code — LLMs are in-context learners and will follow patterns after a few file reads. Only document conventions that are surprising, non-obvious, or span multiple layers
+   - If a pattern section would contain only prose or comments with no code example, either expand it with a real idiomatic example or omit it and reference the source file (e.g., "see `BaseModalComponent` for the modal pattern")
+   - Before writing, verify: no root conditional block duplicates content from a subfolder CLAUDE.md. If a layer has its own subfolder file, remove its summary from root
+   - For cross-layer vertical-slice checklists in root, each step should reference the relevant subfolder CLAUDE.md ("see Data layer CLAUDE.md") rather than inlining the full procedure
+   - If an existing root CLAUDE.md was found:
+     - Review its content
+     - Redistribute any detailed layer-specific content to the appropriate subfolder CLAUDE.md files
+     - Rewrite the root as a compact overview
+
+8. **Self-review pass — verify every drafted file before writing:**
+   Walk through each drafted CLAUDE.md and check every item below. Fix violations in-place before proceeding to writing.
+
+   **Dependencies** — for each listed dependency, ask: "does this library impose patterns, constraints, or conventions on the code?" If the answer is no (utility libraries like lodash, moment, xlsx, FontAwesome), remove it. Only frameworks and libraries that shape how you write code survive.
+
+   **Module Structure** — count top-level entries. If more than 7, group related directories on one line (e.g., `guards/, interceptors/, pipes/ — cross-cutting plumbing`). Target 4-7 entries.
+
+   **Pattern sections** — every pattern H2 must contain a fenced code block with an idiomatic example. If a section is prose-only or comment-only, either expand it with a real code example or replace the section with a one-line file reference (e.g., "see `TradeDeskMapping.cs` for the mapping pattern").
+
+   **Root deduplication** — for each root conditional block, verify it is NOT summarizing a layer that has its own subfolder CLAUDE.md. If it is, remove the block. For cross-layer vertical-slice checklists, verify each step references the relevant subfolder file ("see X CLAUDE.md") rather than inlining the procedure.
+
+   **Frontend/UI conditional coverage** — for each frontend/UI layer, list every repeatable workflow the pattern-finder reported (components, services, pages/routes, directives, pipes, hooks, stores — whatever was detected). Then compare that list against the drafted `<important if>` conditional sections. Any workflow on the list without a matching conditional is a gap — draft and add the missing section before proceeding.
+
+   After fixing all violations, re-scan the corrected drafts to confirm every check passes. Only proceed to writing when all checks are clean. Present a brief summary of what was fixed:
+   ```
+   ## Self-review results
+   - {file}: removed 2 utility deps (moment, xlsx-js-style)
+   - {file}: grouped Module Structure from 11 → 6 entries
+   - {file}: added "Adding a New Service" conditional
+   - Root: no violations found
+   ```
+
+9. **Pass 3 — Write all CLAUDE.md files:**
+   - Write ALL files at once using the Write tool
+   - Do NOT ask for confirmation before each file — batch mode
+   - After writing, present a summary:
+     ```
+     ## CLAUDE.md Files Created
+
+     | File | Lines | Description |
+     |------|-------|-------------|
+     | CLAUDE.md | 45 | Root project overview |
+     | src/core/CLAUDE.md | 78 | Core domain layer |
+     | src/services/CLAUDE.md | 65 | Service layer |
+     | {etc.} | | |
+
+     Total: {N} files created/updated
+
+     Please review the files and let me know if you'd like any adjustments.
+     ```
+
+10. **Handle Follow-ups:**
+    - **Edit in-place.** If the user requests changes to specific files, edit them directly using the Edit tool — CLAUDE.md files are pure markdown, no frontmatter to bump.
+    - **Re-dispatch narrowly.** If the user wants additional folders annotated, run a targeted Pass 2 (analyzer + pattern finder) for those folders, then write.
+    - **Removals.** If the user wants a file removed, note that they can delete it themselves — annotate does not delete.
+    - **When to re-invoke instead.** Re-run `/skill:annotate-inline` for project-wide refresh after major architectural changes; for single-folder updates, prefer in-place edits.
+
+## Root CLAUDE.md Template (compact):
+
+Read the full template at `templates/root-claude-md.md`.
+
+Key principles:
+- Bare sections (Overview, Architecture, Commands, Business Context) are foundational — always included
+- Cross-cutting patterns go in `<important if>` blocks with narrow conditions
+- Deduplication rule: if a layer has a subfolder CLAUDE.md, don't summarize it in root
+- Root MAY include cross-layer vertical-slice checklists referencing subfolder files
+
+### Root CLAUDE.md Reference Examples
+
+See `examples/root-nodejs-monorepo.md` (Node.js monorepo) and `examples/root-dotnet-clean-arch.md` (.NET Clean Architecture) for well-formed root CLAUDE.md examples.
+
+What makes these examples good:
+- **Bare sections** (Overview, Project map, Commands) are relevant to nearly every task — no wrapper needed
+- **Each `<important if>` has a narrow trigger** — "adding a new API endpoint" not "writing backend code"
+- **No linter territory** — formatting rules left to tooling
+- **No code snippets** — uses file path references since patterns are better shown in subfolder CLAUDE.md files
+- **Same structure, different ecosystems** — the pattern works identically for Node.js and .NET
+
+## Subfolder CLAUDE.md Template (max 100 lines):
+
+Read the full template at `templates/subfolder-claude-md.md`.
+
+Key principles:
+- Each distinct pattern gets its own H2 section with a fenced code block
+- Module Structure: aim for 4-7 top-level entries, use architectural annotations
+- Conditional sections (`<important if>`) are optional — only for detected repeatable workflows
+- Conditional sections do NOT count toward the 100-line budget
+
+### Reference Examples
+
+See the following for well-formed subfolder CLAUDE.md examples:
+- `examples/subfolder-database-layer.md` — Database layer (~80 lines)
+- `examples/subfolder-schemas-layer.md` — Schemas layer (~70 lines)
+- `examples/subfolder-dotnet-application.md` — .NET Application layer (~65 lines)
+
+### What makes these examples good:
+- **Module Structure**: Compact, uses architectural annotations, groups related files on one line
+- **Patterns as H2 sections**: Each pattern has a descriptive name, NOT a generic umbrella
+- **Code examples are idiomatic**: Generalized to show the pattern's shape
+- **Cross-boundary patterns**: Shows both sides of layer boundaries
+- **Concise**: All fit well within 100 lines
+- **Conditional blocks**: Wrap scenario-specific recipes with narrow conditions
+
+## CLAUDE.md Depth Rules:
+
+**CREATE CLAUDE.md when:**
+- Folder represents a distinct **architectural layer** (core, services, database, redis, ipc)
+- Folder contains **unique organizational logic** not captured by parent
+- Subfolder has **different patterns/constraints** than parent (e.g., `database/repositories/` vs `database/`)
+- Folder has **its own responsibility** (e.g., `database/migrations/`)
+- Folder is a **composite application root** (e.g., SPA, monorepo package) whose children represent distinct sub-layers with different patterns — apply Depth Rules recursively to its children
+
+**SKIP CLAUDE.md when:**
+- Folder only groups entities/DTOs by domain boundary following the same pattern
+- Folder content is fully described by parent CLAUDE.md
+- Folder is a simple grouping without unique constraints
+
+## Important notes:
+- Parallel Agent dispatch — every `Agent(...)` call in the same assistant message (multiple tool_use blocks in one response), never one per turn. Call shape: `Agent({ subagent_type: "<agent-name>", description: "<3-5 word task label>", prompt: "<task>" })`.
+- **File reading**: Always read mentioned files FULLY (no limit/offset) before invoking skills
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS read mentioned files first before invoking skills (step 1)
+  - ALWAYS wait for all skills in a pass to complete before proceeding to the next step
+  - NEVER write CLAUDE.md files with placeholder values — all content must come from skill findings
+  - NEVER proceed to Pass 2 without user confirmation of target locations
+  - NEVER skip the developer checkpoint (step 6) — developer input is the highest-value signal for CLAUDE.md quality
+  - NEVER draft CLAUDE.md content before completing the developer checkpoint
+- **.gitignore compliance**: Skip directories excluded by .gitignore (node_modules, dist, build, .git, vendor, etc.)
+- **Batch output mode**: Write all CLAUDE.md files at once in Pass 3, do not ask for per-file confirmation
+- **Existing CLAUDE.md handling**: If a CLAUDE.md already exists at any target location, replace it entirely using the Write tool
+- **Line budget**: Subfolder CLAUDE.md files must not exceed 100 lines — code examples in Key Patterns are mandatory, keep them idiomatic and concise
+- **No frontmatter**: CLAUDE.md files are pure markdown, no YAML frontmatter
+- Keep the main agent focused on synthesis, not deep file reading — delegate analysis to sub-agents
--- a/extensions/rpiv-pi/skills/annotate-inline/examples/root-dotnet-clean-arch.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/examples/root-dotnet-clean-arch.md
@@ -0,0 +1,38 @@
+# CLAUDE.md
+
+ASP.NET Core 8 Web API with Clean Architecture (CQRS + MediatR).
+
+## Project map
+
+- `src/Api/` - ASP.NET Core controllers, middleware, DI setup
+- `src/Application/` - MediatR handlers, validators, DTOs
+- `src/Domain/` - Entities, value objects, domain events
+- `src/Infrastructure/` - EF Core, external services, file storage
+- `tests/` - Unit and integration tests
+
+## Commands
+
+| Command | What it does |
+|---|---|
+| `dotnet build` | Build solution |
+| `dotnet test` | Run all tests |
+| `dotnet run --project src/Api` | Start API locally |
+| `dotnet ef migrations add <Name> -p src/Infrastructure` | Create EF migration |
+| `dotnet ef database update -p src/Infrastructure` | Apply migrations |
+
+<important if="you are adding a new API endpoint">
+- Add controller in `Api/Controllers/` inheriting `BaseApiController`
+- Add command/query + handler + validator in `Application/Features/`
+- See `Application/Features/Orders/Commands/CreateOrder/` for the pattern
+</important>
+
+<important if="you are adding or modifying EF Core migrations or database schema">
+- Entities configured via `IEntityTypeConfiguration<T>` in `Infrastructure/Persistence/Configurations/`
+- Always create a migration after schema changes — never modify existing migrations
+</important>
+
+<important if="you are writing or modifying tests">
+- Unit tests: xUnit + NSubstitute, one test class per handler
+- Integration tests: `WebApplicationFactory<Program>` with test database
+- See `tests/Application.IntegrationTests/TestBase.cs` for setup
+</important>
--- a/extensions/rpiv-pi/skills/annotate-inline/examples/root-nodejs-monorepo.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/examples/root-nodejs-monorepo.md
@@ -0,0 +1,42 @@
+# CLAUDE.md
+
+Express API + React frontend in a Turborepo monorepo.
+
+## Project map
+
+- `apps/api/` - Express REST API
+- `apps/web/` - React SPA
+- `packages/db/` - Prisma schema and client
+- `packages/ui/` - Shared component library
+- `packages/config/` - Shared configuration
+
+## Commands
+
+| Command | What it does |
+|---|---|
+| `turbo build` | Build all packages |
+| `turbo test` | Run all tests |
+| `turbo lint` | Lint all packages |
+| `turbo dev` | Start dev server |
+| `turbo db:generate` | Regenerate Prisma client after schema changes |
+| `turbo db:migrate` | Run database migrations |
+
+<important if="you are adding or modifying API routes">
+- All routes go in `apps/api/src/routes/`
+- Use Zod for request validation — see `apps/api/src/routes/connections.ts` for the pattern
+- Error responses follow RFC 7807 format
+- Authentication via JWT middleware
+</important>
+
+<important if="you are writing or modifying tests">
+- API: Jest + Supertest, Frontend: Vitest + Testing Library
+- Test fixtures in `__fixtures__/` directories
+- Use `createTestClient()` helper for API integration tests
+- Mock database with `prismaMock` from `packages/db/test`
+</important>
+
+<important if="you are working with client-side state, stores, or data fetching">
+- Zustand for global client state
+- React Query for server state
+- URL state via `nuqs`
+</important>
--- a/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-database-layer.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-database-layer.md
@@ -0,0 +1,81 @@
+# Database Layer Architecture
+
+## Responsibility
+SQLite persistence with better-sqlite3, repository pattern (plain types), QueryQueue concurrency, type transformations.
+
+## Dependencies
+- **better-sqlite3**: Native SQLite (requires rebuild for Electron)
+- **@redis-ui/core**: Domain types
+- **p-queue**: Query serialization
+
+## Consumers
+- **@redis-ui/services**: Repositories via RepositoryFactory
+- **Main process**: DatabaseManager initialization
+
+## Module Structure
+```
+src/
+├── DatabaseManager.ts, QueryQueue.ts   # Singleton, concurrency
+├── BaseRepository.ts, RepositoryFactory.ts
+├── schema.ts
+└── repositories/                        # One repo per entity
+```
+
+## Repository Boundary (CRITICAL: Plain Types, NOT Result<T>)
+
+```typescript
+export class ConnectionRepository extends BaseRepository<ConnectionDB, Connection, ConnectionId> {
+  protected toApplication(db: ConnectionDB): Connection {
+    return {
+      id: ConnectionId.create(db.id),
+      host: db.host,
+      port: db.port,
+      sslEnabled: Boolean(db.ssl_enabled),     // DB int → boolean
+      createdAt: new Date(db.created_at),      // timestamp → Date
+    };
+  }
+
+  async findById(id: ConnectionId): Promise<Connection | null> {
+    return this.queue.enqueueRead((db) => {
+      const row = db.prepare('SELECT * FROM connections WHERE id = ?').get(id);
+      return row ? this.toApplication(row) : null;
+    });
+  }
+}
+
+// Service: Wraps repository in Result<T>
+async createConnection(input: CreateInput): Promise<Result<Connection>> {
+  try {
+    const connection = await this.connectionRepo.create(input);
+    return Result.ok(connection);
+  } catch (error) {
+    return Result.fail(new InfrastructureError(error.message));
+  }
+}
+```
+
+## QueryQueue Pattern (Write Serialization)
+
+```typescript
+export class QueryQueue {
+  private writeQueue = new PQueue({ concurrency: 1 }) // Single writer
+  private readQueue = new PQueue({ concurrency: 5 })  // Multiple readers
+
+  async enqueueWrite<T>(op: (db: Database) => T): Promise<T> {
+    return this.writeQueue.add(() => op(this.db))
+  }
+}
+```
+
+## Architectural Boundaries
+- **NO Result<T> in repos**: Services wrap with Result
+- **NO unqueued DB ops**: Always use QueryQueue
+- **NO raw SQL in services**: Use repositories
+
+<important if="you are adding a new repository to this layer">
+## Adding a New Repository
+1. Create `XRepository.ts` extending `BaseRepository<XDB, X, XId>`
+2. Implement `toApplication()` and `toDatabase()` type mappers
+3. Register in `RepositoryFactory`
+4. Add table schema in `schema.ts`
+</important>
--- a/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-dotnet-application.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-dotnet-application.md
@@ -0,0 +1,64 @@
+# Application Layer (CQRS + MediatR)
+
+## Responsibility
+Command/query handlers orchestrating domain logic via MediatR pipeline. Sits between API controllers and Domain layer.
+
+## Dependencies
+- **MediatR**: Command/query dispatch
+- **FluentValidation**: Request validation via pipeline behavior
+- **AutoMapper**: Domain ↔ DTO mapping
+
+## Consumers
+- **API Controllers**: Send commands/queries via `IMediator`
+- **Integration tests**: Direct handler invocation
+
+## Module Structure
+```
+Application/
+├── Common/
+│   ├── Behaviors/          # MediatR pipeline (validation, logging)
+│   └── Mappings/           # AutoMapper profiles
+├── Features/               # One folder per aggregate
+│   └── Orders/
+│       ├── Commands/       # CreateOrder/, UpdateOrder/ (handler + validator + DTO)
+│       └── Queries/        # GetOrder/, ListOrders/
+└── DependencyInjection.cs  # Service registration
+```
+
+## Handler Pattern (Command with Validation)
+
+```csharp
+public record CreateOrderCommand(string CustomerId, List<LineItemDto> Items)
+    : IRequest<Result<OrderDto>>;
+
+public class CreateOrderValidator : AbstractValidator<CreateOrderCommand> {
+    public CreateOrderValidator(IOrderRepository repo) {
+        RuleFor(x => x.CustomerId).NotEmpty();
+        RuleFor(x => x.Items).NotEmpty();
+    }
+}
+
+public class CreateOrderHandler : IRequestHandler<CreateOrderCommand, Result<OrderDto>> {
+    public async Task<Result<OrderDto>> Handle(
+        CreateOrderCommand request, CancellationToken ct) {
+        var order = Order.Create(request.CustomerId, request.Items); // Domain factory
+        await _repo.AddAsync(order, ct);
+        await _unitOfWork.SaveChangesAsync(ct);
+        return Result.Ok(_mapper.Map<OrderDto>(order));
+    }
+}
+```
+
+## Architectural Boundaries
+- **NO domain logic in handlers**: Handlers orchestrate, domain objects contain logic
+- **NO direct DbContext access**: Use repository abstractions
+- **NO cross-feature references**: Features are independent vertical slices
+
+<important if="you are adding a new feature or command/query handler">
+## Adding a New Feature
+1. Create folder under `Features/{Aggregate}/{Commands|Queries}/`
+2. Add `Command`/`Query` record implementing `IRequest<Result<TDto>>`
+3. Add `Validator` extending `AbstractValidator<TCommand>`
+4. Add `Handler` implementing `IRequestHandler<TCommand, Result<TDto>>`
+5. Add AutoMapper profile in `Common/Mappings/` if new DTO
+</important>
--- a/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-schemas-layer.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/examples/subfolder-schemas-layer.md
@@ -0,0 +1,50 @@
+# Schemas Layer Architecture
+
+## Responsibility
+Zod validation schemas for dual-layer validation (preload UX + main security), type inference via z.infer<>.
+
+## Dependencies
+- **zod**: Runtime validation
+
+## Consumers
+- **@redis-ui/ipc**: Main process validation (security)
+- **Preload**: Fail-fast validation (UX)
+- **TypeScript**: Type inference
+
+## Module Structure
+```
+src/
+├── connection.ts, backup.ts    # Domain schemas
+└── __tests__/                  # Validation tests
+```
+
+## Complete Schema Pattern (Types + Validation + Composition)
+
+```typescript
+export const createConnectionSchema = z.object({
+  name: z.string().min(1).max(255),
+  host: z.string().min(1),
+  port: z.number().int().min(1).max(65535),
+  password: z.string().optional(),
+  database: z.number().int().min(0).max(15).default(0),
+})
+
+// Type inference
+export type CreateConnectionInput = z.infer<typeof createConnectionSchema>
+
+// Update schema (partial + ID required)
+export const updateConnectionSchema = createConnectionSchema.partial().extend({
+  id: z.string().min(1)
+})
+```
+
+## Dual-Validation Flow
+
+```
+Renderer input → Preload (Zod parse, fail fast) → IPC → Main (Zod parse again, security)
+```
+
+## Architectural Boundaries
+- **NO any types**: Use z.unknown()
+- **NO skipping validation**: Always validate at boundaries
+- **NO business logic**: Structure validation only
--- a/extensions/rpiv-pi/skills/annotate-inline/templates/root-claude-md.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/templates/root-claude-md.md
@@ -0,0 +1,46 @@
+```markdown
+# Project Overview
+{1-2 sentences: what it is, tech stack}
+
+# Architecture
+{monorepo structure tree + dependency flow diagram}
+{process architecture if applicable}
+
+# Commands
+{key commands table — always bare, never wrapped in <important if>}
+
+# Business Context
+{1-2 sentences if applicable}
+```
+
+The sections above (Overview, Architecture, Commands, Business Context) are foundational — they stay bare because they're relevant to virtually every task.
+
+Cross-cutting patterns and domain-specific conventions go in `<important if>` blocks with narrow, action-specific conditions. Do NOT group unrelated rules under a single broad condition like "you are writing or modifying code". Instead, shard by trigger.
+
+Root conditional blocks are for **cross-cutting conventions that don't belong to any single layer**. Layer-specific recipes (like "adding a new controller" or "adding a new repository") belong in the subfolder CLAUDE.md, not the root.
+
+**Deduplication rule:** If a layer has its own subfolder CLAUDE.md, do NOT add a root conditional block summarizing that layer's conventions. The subfolder file is the authoritative guide — the agent will see it when working in that directory. Root conditionals that mirror subfolder content waste attention budget and create staleness risk.
+
+Root MAY include cross-layer vertical-slice checklists (e.g., "adding a new domain entity end-to-end") that reference multiple subfolder CLAUDE.md files — but each step should point to the relevant subfolder for details, not inline them.
+
+Good root conditions — things that span multiple layers:
+
+```markdown
+<important if="you are writing or modifying tests">
+- Unit: xUnit + NSubstitute / Jest + Testing Library
+- Integration: WebApplicationFactory / Supertest
+- Test fixtures in `__fixtures__/` or `tests/Fixtures/`
+</important>
+
+<important if="you are adding or modifying database migrations">
+- Never modify existing migrations — always create new ones
+- Run `dotnet ef migrations add` / `turbo db:migrate` after schema changes
+</important>
+
+<important if="you are adding or modifying environment configuration">
+- All config via `IOptions<T>` pattern / environment variables
+- Secrets in user-secrets locally, Key Vault in production
+</important>
+```
+
+Each block should contain only rules that share the same trigger condition. If a codebase has 3 distinct convention areas, that's 3 blocks — not 1 block with a broad condition. Layer-specific checklists (adding a controller, adding a repository) go in the subfolder CLAUDE.md using `<important if="you are adding a new {entity} to this layer">`.
--- a/extensions/rpiv-pi/skills/annotate-inline/templates/subfolder-claude-md.md
+++ b/extensions/rpiv-pi/skills/annotate-inline/templates/subfolder-claude-md.md
@@ -0,0 +1,57 @@
+```markdown
+# {Layer/Component Name}
+
+## Responsibility
+{1-2 sentences: what this layer does, where it sits in architecture}
+
+## Dependencies
+{List only architectural dependencies — frameworks and libraries that shape how you write code in this layer.
+Do NOT list utility libraries discoverable from package.json/imports (e.g., lodash, moment, xlsx).
+A dependency is worth listing if it imposes patterns, constraints, or conventions on the code.}
+- **{dep}**: Why it's used
+
+## Consumers
+- **{consumer}**: How it uses this layer
+
+## Module Structure
+{Compact directory tree — aim for 4-7 top-level entries, not 15.
+Group related files on one line (e.g., "Service.ts, Handler.ts").
+Use architectural annotations for directories (e.g., "# One repo per entity", "# Domain schemas").
+DO NOT enumerate individual files inside directories — describe the convention.
+When a layer has many directories (10+), group related concerns on one line
+(e.g., "guards/, interceptors/, pipes/ — infrastructure plumbing") rather than listing each separately.
+The structure must stay valid when non-architectural files are added.}
+
+## {Pattern Name} ({Key Constraint or Characterization})
+{Each distinct pattern gets its own H2 section — NOT a generic "## Key Patterns" umbrella.
+Include a fenced code block with an idiomatic, generalized example showing:
+- Constructor / dependencies
+- Key method signatures and return types
+- Error handling / wrapping conventions
+- Inline comments for important conventions (e.g., "// throws on error — service wraps in Result")
+If a pattern spans a layer boundary, show both sides briefly.
+Multiple patterns = multiple H2 sections.}
+
+## {Additional Pattern Name}
+{Second pattern with code block if applicable}
+
+## Architectural Boundaries
+- **NO {X}**: {Why}
+- **NO {Y}**: {Why}
+
+<important if="you are adding a new {entity type} to this layer">
+## Adding a New {Entity Type}
+{Step-by-step checklist inferred from existing code:
+1. Create file following naming convention
+2. Extend/implement base class or interface
+3. Register in factory/container/index
+4. Add related artifacts (schema, test, migration)}
+</important>
+
+<important if="you are writing or modifying tests for this layer">
+## Testing Conventions
+{Test patterns, helpers, fixture locations, mocking approach — if detectable from code}
+</important>
+```
+
+Conditional sections are OPTIONAL — only include them if the pattern-finder skill detects testable patterns or clear "add new entity" workflows. Conditions must be narrow and action-specific. These sections contain checklists/recipes, not code examples (those stay in the unconditional pattern sections). Conditional sections do NOT count toward the 100-line budget for unconditional content.
--- a/extensions/rpiv-pi/skills/blueprint/SKILL.md
+++ b/extensions/rpiv-pi/skills/blueprint/SKILL.md
@@ -0,0 +1,447 @@
+---
+name: blueprint
+description: Plan complex features by decomposing them into vertical slices (one slice equals one phase) with developer micro-checkpoints between phases, producing an implement-ready phased plan in thoughts/shared/plans/. Use for complex multi-component features touching 6+ files across multiple layers when iterative review between slices is valuable. Requires a research artifact or a solutions artifact (from explore). Prefer blueprint over plan when mid-flight micro-checkpoints matter, and prefer plan when a straightforward phased breakdown is enough.
+argument-hint: [research artifact path]
+---
+
+# Plan
+
+You are tasked with planning how code will be shaped for a feature or change AND emitting an implement-ready phased plan. Decompose the feature into vertical slices (one slice = one phase), generate code slice-by-slice with developer micro-checkpoints between slices, and write the final artifact directly into `thoughts/shared/plans/` for `/skill:implement` to consume.
+
+**How it works**:
+- Read input and key source files into context (Step 1)
+- Spawn targeted research agents for depth analysis (Step 2)
+- Identify ambiguities — triage into simple decisions and genuine ambiguities (Step 3)
+- Holistic self-critique — review the combined design for gaps and contradictions (Step 4)
+- Developer checkpoint — resolve genuine ambiguities one at a time (Step 5)
+- Decompose into vertical slices holistically before generating code (Step 6)
+- Generate code slice-by-slice with developer micro-checkpoints (Step 7)
+- Verify cross-slice integration consistency (Step 8)
+- Finalize the design artifact (Step 9)
+- Review and iterate with the developer (Step 10)
+
+The final artifact is implement-ready.
+
+## Step 1: Input Handling
+
+When this command is invoked:
+
+1. **Read research artifact**:
+
+   **Research artifact provided** (argument contains a path to a `.md` file in `thoughts/`):
+   - Read the research artifact FULLY using the Read tool WITHOUT limit/offset
+   - Extract: Summary, Code References, Integration Points, Architecture Insights, Precedents & Lessons, Developer Context, Open Questions
+   - **Read the key source files from Code References** into the main context — especially hooks, shared utilities, and integration points the design will depend on. Read them FULLY. This ensures you have complete understanding before proceeding.
+   - These become starting context — no need to re-discover what exists
+   - Research Developer Context Q/As = inherited decisions (record in Decisions, never re-ask); Open Questions = starting ambiguity queue, filtered by dimension in Step 3
+
+   **No arguments provided**:
+   ```
+   I'll plan a feature iteratively from a research artifact. Please provide:
+
+   `/skill:blueprint [research artifact] [task description]`
+
+   Research artifact is required. Task description is optional.
+   ```
+   Then wait for input.
+
+2. **Read any additional files mentioned** — tickets, related designs, existing implementations. Read them FULLY before proceeding.
+
+## Step 2: Targeted Research
+
+This is NOT a discovery sweep. Focus on DEPTH (how things work, what patterns to follow) not BREADTH (where things are).
+
+1. **Spawn parallel research agents** using the Agent tool:
+
+   - Use **codebase-pattern-finder** to find existing implementations to model after — the primary template for code shape
+
+   For integration wiring (inbound refs, outbound deps, config/DI/event registration), use the `## Integration Points` section already extracted from research in Step 1. For precedent context (similar past changes, blast radius, follow-up fixes, lessons), use the `## Precedents & Lessons` section already extracted from research in Step 1. Do NOT dispatch a fresh agent to re-map either surface.
+
+   **Novel work** (new libraries, first-time patterns, no existing codebase precedent):
+   - Add **web-search-researcher** for external documentation, API references, and community patterns
+   - Instruct it to return LINKS with findings — include those links in the final design artifact
+
+   Agent prompts should focus on (labeled by target agent):
+   - **codebase-pattern-finder**: "Find the implementation pattern I should model after for {feature type}"
+
+   NOT: "Find all files related to X" — that's discovery's job, upstream of this skill. NOT: "Analyze {component} integration" — the integration surface is in research's `## Integration Points`; if a specific anchor needs deeper inspection, defer to the on-demand `codebase-analyzer` dispatch in Step 5 (correction path) or Step 7a (mid-generation gap).
+
+2. **Read all key files identified by agents** into the main context — especially the pattern templates you'll model after.
+
+3. **Wait for ALL agents to complete** before proceeding.
+
+4. **Analyze and verify understanding**:
+   - Cross-reference research findings with actual code read in Step 1
+   - Identify any discrepancies or misunderstandings
+   - Note assumptions that need verification
+   - Determine true scope based on codebase reality
+
+## Step 3: Identify Ambiguities — Dimension Sweep
+
+Walk Step 2 findings, inherited research Q/As, and carried Open Questions through six architectural dimensions that map 1:1 to the plan artifact's section coverage — the sweep guarantees downstream completeness. Add **migration** as a seventh dimension only if the feature changes persisted schema.
+
+- **Data model** — types, schemas, entities
+- **API surface** — signatures, exports, routes
+- **Integration wiring** — mount points, DI, events, config
+- **Scope** — in / explicitly deferred
+- **Verification** — tests, assertions, risk-bearing behaviors
+- **Performance** — load paths, caching, N+1 risks
+
+For each dimension, classify findings as **simple decisions** (one valid option, obvious from codebase — record in Decisions with `file:line` evidence, do not ask) or **genuine ambiguities** (multiple valid options, conflicting patterns, scope questions, novel choices — queue for Step 5). Inherited research Q/As land as simple; Open Questions filter by dimension — architectural survives, implementation-detail defers.
+
+**Pre-validate every option** before queuing it against research constraints and runtime code behavior. Eliminate or caveat options that contradict Steps 1-2 evidence. **Coverage check**: every Step 2 file read appears in at least one decision or ambiguity; every dimension is addressed (silently-resolved valid, skipped-unchecked not).
+
+## Step 4: Holistic Self-Critique
+
+Before presenting ambiguities to the developer, review the combined design picture holistically. Step 3 triages findings individually — this step checks whether they fit together as a coherent whole.
+
+**Prompt yourself:**
+- What's inconsistent, missing, or contradictory across the research findings, resolved decisions, and identified ambiguities?
+- What edge cases or failure modes aren't covered by any ambiguity or decision?
+- Do any patterns from different agents conflict when combined?
+
+**Areas to consider** (suggestive, not a checklist):
+- Requirement coverage — is every requirement from Step 1 addressed by at least one decision or ambiguity?
+- Cross-cutting concerns — do error handling, state management, or performance span multiple ambiguities without being owned by any?
+- Pattern coherence — do the simple decisions from Step 3 still hold when viewed together, or does a combination reveal a conflict?
+- Ambiguity completeness — did Step 3 miss a genuine ambiguity by treating a multi-faceted issue as simple?
+
+**Remediation:**
+- Issues you can resolve with evidence: fix in-place — reclassify simple decisions as genuine ambiguities, or resolve a genuine ambiguity as simple if holistic review provides clarity. Note what changed.
+- Issues that need developer input: add as new genuine ambiguities to the Step 5 checkpoint queue.
+- If no issues found: proceed to Step 5 with the existing ambiguity set.
+
+## Step 5: Developer Checkpoint
+
+Use the grounded-questions-one-at-a-time pattern. Use a **❓ Question:** prefix so the developer knows their input is needed. Each question must:
+- Reference real findings with `file:line` evidence
+- Present concrete options (not abstract choices)
+- Pull a DECISION from the developer, not confirm what you already found
+
+**Question patterns by ambiguity type:**
+
+- **Pattern conflict**: "Found 2 patterns for {X}: {pattern A} at `file:line` and {pattern B} at `file:line`. They differ in {specific way}. Which should the new {feature} follow?"
+- **Missing pattern**: "No existing {pattern type} in the codebase. Options: (A) {approach} modeled after {external reference}, (B) {approach} extending {existing code at file:line}. Which fits the project's direction?"
+- **Scope boundary**: "The {research/description} mentions both {feature A} and {feature B}. Should this design cover both, or just {feature A} with {feature B} deferred?"
+- **Integration choice**: "{Feature} can wire into {point A} at `file:line` or {point B} at `file:line`. {Point A} matches the {existing pattern} pattern. Agree, or prefer {point B}?"
+- **Novel approach**: "No existing {X} in the project. Options: (A) {library/pattern} — {evidence/rationale}, (B) {library/pattern} — {evidence/rationale}. Which fits?"
+
+**Critical rules:**
+- Ask ONE question at a time. Wait for the answer before asking the next.
+- Lead with the most architecturally significant ambiguity.
+- Every answer becomes a FIXED decision — no revisiting unless the developer explicitly asks.
+
+**Choosing question format:**
+
+- **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example:
+
+  > Use the `ask_user_question` tool with the following question: "Found 2 mapping approaches — which should new code follow?". Header: "Pattern". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+- **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+  "❓ Question: Research's `## Integration Points` shows no background job registration for this area. Is that expected, or is there async processing not surfaced in the inbound/outbound sweep?"
+
+**Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+**Classify each response:**
+
+**Decision** (e.g., "use pattern A", "yes, follow that approach"):
+- Record in Developer Context. Fix in Decisions section.
+
+**Correction** (e.g., "no, there's a third option you missed", "check the events module"):
+- Spawn targeted rescan: **codebase-analyzer** on the new area (max 1-2 agents).
+- Merge results. Update ambiguity assessment.
+
+**Scope adjustment** (e.g., "skip the UI, backend only", "include tests"):
+- Record in Developer Context. Adjust scope.
+
+**After all ambiguities are resolved**, present a brief design summary (under 15 lines):
+
+```
+Design: {feature name}
+Approach: {1-2 sentence summary of chosen architecture}
+
+Decisions:
+- {Decision 1}: {choice} — modeled after `file:line`
+- {Decision 2}: {choice}
+- {Decision 3}: {choice}
+
+Scope: {what's in} | Not building: {what's out}
+Files: {N} new, {M} modified
+```
+
+Use the `ask_user_question` tool to confirm before proceeding. Question: "{Summary from design brief above}. Ready to proceed to decomposition?". Header: "Design". Options: "Proceed (Recommended)" (Decompose into vertical slices, then generate code slice-by-slice); "Adjust decisions" (Revisit one or more architectural decisions above); "Change scope" (Add or remove items from the building/not-building lists).
+
+## Step 6: Feature Decomposition
+
+After the design summary is confirmed, decompose the feature into vertical slices. Each slice is a self-contained unit: types + implementation + wiring for one concern.
+
+1. **Decompose holistically** — define ALL slices, dependencies, and ordering before generating any code:
+
+   ```
+   Feature Breakdown: {feature name}
+
+   Slice 1: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW), path/to/file.ext (MODIFY)
+     Depends on: nothing (foundation)
+
+   Slice 2: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW), path/to/file.ext (MODIFY)
+     Depends on: Slice 1
+
+   Slice 3: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW)
+     Depends on: Slice 2
+   ```
+
+2. **Slice properties**:
+   - End-to-end vertical: each slice is a complete cross-section of one concern (types + impl + wiring)
+   - ~512-1024 tokens per slice (maps to individual file blocks)
+   - Sequential: each builds on the previous (never parallel)
+   - Foundation first: types/interfaces always Slice 1
+
+3. **Confirm decomposition** using the `ask_user_question` tool. Question: "{N} slices for {feature}. Slice 1: {name} (foundation). Slices 2-N: {brief}. Approve decomposition?". Header: "Slices". Options: "Approve (Recommended)" (Proceed to slice-by-slice code generation); "Adjust slices" (Reorder, merge, or split slices before generating); "Change scope" (Add or remove files from the decomposition).
+
+4. **Create skeleton artifact** — immediately after decomposition is approved:
+   - Determine metadata: filename `thoughts/shared/plans/YYYY-MM-DD_HH-MM-SS_topic.md`, repository name from git root, branch and commit from the git context injected at the start of the session (fallbacks: "no-branch" / "no-commit"), author from the injected User (fallback: "unknown")
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Write skeleton using the Write tool with `status: in-progress` in frontmatter
+   - **Include all prose sections filled** from Steps 1-5: Overview, Requirements, Current State Analysis, Desired End State, What We're NOT Doing, Decisions, Ordering Constraints, Verification Notes, Performance Considerations, Migration Notes, Pattern References, Developer Context, References
+   - **Phase sections**: one `## Phase N: {slice name}` heading per slice from the decomposition (in slice order), each with `### Overview`, `### Changes Required:` (one `#### N. path/to/file.ext` subsection per file with empty code fence + NEW/MODIFY label), and `### Success Criteria:` (Automated + Manual placeholders — filled in Step 9)
+   - **Plan History section**: list all phases with `— pending` status
+   - This is the living artifact — all subsequent writes use the Edit tool
+
+   **Artifact template sections** (all required in skeleton):
+
+   - **Frontmatter**: date, author, commit, branch, repository, topic, tags, `status: in-progress`, parent, phase_count, unresolved_phase_count (initialized to phase_count, decrements as each phase's code is approved in Step 7d), last_updated, last_updated_by
+   - **# {Feature Name} Implementation Plan**
+   - **## Overview**: 2-3 sentences — what we're building and the chosen architectural approach. Settled decision, not a discussion.
+   - **## Requirements**: Bullet list from ticket, research, or developer input.
+   - **## Current State Analysis**: What exists now, what's missing, key constraints. Include `### Key Discoveries` with `file:line` references, patterns to follow, constraints to work within.
+   - **## Desired End State**: Usage examples showing the feature in use from a consumer's perspective — concrete code, not prose.
+   - **## What We're NOT Doing**: Developer-stated exclusions AND likely scope-creep vectors (alternative architectures not chosen, nearby code that looks related but shouldn't be touched).
+   - **## Decisions**: `###` per decision. Complex: Ambiguity → Explored (Option A/B with `file:line` + pro/con) → Decision. Simple: just state decision with evidence.
+   - **## Phase N: {slice name}** (one per slice, in slice order):
+     - `### Overview`: one sentence describing what this phase delivers + parallelism note from `Depends on:` (e.g., "Depends on Phase 1; can run in parallel with Phase 3.").
+     - `### Changes Required:` — one `#### N. path/to/file.ext` subsection per file in this slice. Each subsection has `**File**: path`, `**Changes**: {NEW | MODIFY — summary}`, and an empty code fence (filled in Step 7d). NEW files get full implementation. MODIFY files get only modified/added code — no "Current" block, the original is on disk.
+     - `### Success Criteria:` with `#### Automated Verification:` and `#### Manual Verification:` subsections, each containing `- [ ] TBD` placeholder bullets (filled in Step 9 from Verification Notes).
+   - **## Ordering Constraints**: What must come before what. What can run in parallel. (Carries the cross-phase view; per-phase parallelism note also lives in each Phase Overview.)
+   - **## Verification Notes**: Carry forward from research — known risks, build/test warnings, precedent lessons. Format as verifiable checks (commands, grep patterns, visual inspection). Step 9 converts these to per-phase Success Criteria.
+   - **## Performance Considerations**: Any performance implications or optimizations.
+   - **## Migration Notes**: If applicable — existing data, schema changes, rollback strategy, backwards compatibility. Empty if not applicable.
+   - **## Pattern References**: `path/to/similar.ext:line-range` — what pattern to follow and why.
+   - **## Developer Context**: Record questions exactly as asked during checkpoint, including `file:line` evidence. Also record micro-checkpoint interactions from Step 7c.
+   - **## Plan History**: Phase approval/revision log. `- Phase N: {name} — pending/approved as generated/revised: {what changed}`. implement ignores this section.
+   - **## References**: Research artifacts, tickets, similar implementations.
+
+   **Phase Changes Required format in skeleton**:
+   - **NEW files**: `#### N. path/to/file.ext` + `**File**: path` + `**Changes**: NEW — {purpose}` + empty code fence (filled with full implementation in Step 7d)
+   - **MODIFY files**: `#### N. path/to/file.ext:line-range` + `**File**: path` + `**Changes**: MODIFY — {summary}` + empty code fence (filled with only the modified code in Step 7d — no "Current" block, the original is on disk)
+
+## Step 7: Generate Slices (Iterative)
+
+Generate code one slice at a time. Each slice sees the fixed code from all previous slices.
+
+**Before slice 1**: look at the decomposition. For slices whose code shape isn't already covered by Step 2's pattern-finder result (different layer, different file kind, different concern), dispatch additional **codebase-pattern-finder** calls in parallel — one assistant message, one tool call per slice that needs its own template. Slices whose shape matches a sibling reuse that sibling's result. Hold the returned templates in context for 7a; do not re-dispatch per slice during generation.
+
+**For each slice in the decomposition (sequential order):**
+
+### 7a. Generate slice code (internal)
+
+Generate complete, copy-pasteable code for every file in this slice — but **hold it for the artifact, do NOT present full code to the developer**. The developer sees a condensed review in 7c; the full code goes into the artifact in 7d.
+
+- **New files**: complete code — imports, types, implementation, exports. Follow the pattern template from Step 2.
+- **Modified files**: read current file FULLY, generate only the modified/added code scoped to changed sections (no full "Current" block — the original is on disk)
+- **Test files**: complete test suites following project patterns
+- **Wiring**: show where new code hooks into existing code
+
+If additional context is needed, spawn a targeted **codebase-analyzer** agent.
+
+No pseudocode, no TODOs, no placeholders — the code must be copy-pasteable by implement.
+
+**Context grounding** (after slice 2): Before generating, re-read the artifact's prior `## Phase N` sections for files this slice touches (a file may appear in earlier phases; if so, this phase extends or revisits it). The artifact is the source of truth — generate code that extends what's already emitted, not what you remember from conversation.
+
+### 7b. Self-verify slice
+
+Before presenting to the developer, cross-check this slice and produce a structured summary:
+
+```
+Self-verify Slice N:
+- Decisions: {OK / VIOLATION: decision X — fix applied}
+- Cross-slice: {OK / CONFLICT: file X has inconsistent types — fix applied}
+- Research: {OK / WARNING: constraint Y not satisfied — fix applied}
+```
+
+If violations found: fix in-place before presenting. Include the self-verify summary in the 7c checkpoint presentation.
+
+### 7c. Developer micro-checkpoint
+
+Present a **condensed review** of the slice — NOT the full generated code. The developer reviews the design shape, not every line. For each file in the slice, show:
+
+1. **Summary** (1-2 sentences): what changed, what pattern used, what it connects to
+2. **Signatures**: type/interface definitions, exported function signatures with parameter and return types
+3. **Key code blocks**: factory calls, wiring, non-obvious logic — the interesting parts that show the design decision in action
+
+**Omit**: boilerplate, import lists, full function bodies, obvious implementations.
+**MODIFY files**: focused diff (`- old` / `+ new`) with ~3 lines context. **Test files**: test case names only.
+
+**If the developer asks to see full code**, show it inline — exception, not default.
+
+Use the `ask_user_question` tool to confirm. Question: "Slice {N/M}: {slice name} — {files affected}. {1-line summary}. Approve?". Header: "Slice {N}". Options: "Approve (Recommended)" (Lock this slice, write to artifact, proceed to slice {N+1}); "Revise this slice" (Adjust code before proceeding — describe what to change); "Rethink remaining slices" (This slice reveals a design issue — revisit decomposition).
+
+**Checkpoint cadence**: One slice per checkpoint. Present each slice individually, regardless of slice count.
+
+### 7d. Incorporate feedback
+
+**Approve**: Lock this slice's code and **Edit the artifact immediately**:
+1. For each file in this slice, Edit the skeleton artifact to replace the empty code fence under that file's `#### N. path/...` subsection inside this slice's `## Phase N: {slice name}` section with the full generated code from 7a
+2. If a later slice contributes to a file already filled by an earlier phase: emit a NEW `#### N. path/to/file.ext` subsection inside the later phase with only that phase's incremental changes (do NOT mutate the earlier phase's code fence — implement runs phases sequentially and the codebase state evolves between them). Each phase's code fence is the change set for that phase, applied on top of the codebase state after the previous phase.
+3. After fill, verify within this phase: no duplicate function definitions inside the same code fence, imports deduplicated, exports list complete
+4. Update the Plan History section: `- Phase N: {name} — approved as generated`
+5. Decrement frontmatter `unresolved_phase_count` by 1
+- Proceed to next slice
+
+**Revise**: Update code per developer feedback. Re-run self-verify (7b). Re-present the same slice (7c). The artifact is NOT touched — only "Approve" writes to the artifact.
+
+**Rethink**: Developer spotted a design issue. If a previously approved slice is affected, flag the conflict and offer cascade revision — developer decides whether to reopen (if yes, Edit the affected `## Phase N` entry).
+Update decomposition (add/remove/reorder remaining slices) and confirm before continuing.
+
+## Step 8: Integration Verification
+
+After all phases are complete, review cross-phase consistency:
+
+1. **Present integration summary** (under 15 lines):
+   ```
+   Integration: {feature name} — {N} phases complete
+
+   Phases: {brief list of phase names and file counts}
+   Cross-phase: {types consistent / imports valid / wiring complete}
+   Research constraints: {all satisfied / N violations noted}
+   ```
+
+2. **Verify research constraints**: Check each Precedent & Lesson and Verification Note from the research artifact against the generated code. List satisfaction status.
+
+3. **Confirm using the `ask_user_question` tool**. Question: "{N} phases complete, {M} files total. Cross-phase consistency verified. Proceed to finalize?". Header: "Verify". Options: "Proceed (Recommended)" (Finalize the plan artifact (fill Success Criteria, update status)); "Revisit phase" (Reopen a specific phase for revision — Edit the artifact after); "Add missing" (A file or integration point is missing — add to artifact).
+
+## Step 9: Finalize Plan Artifact
+
+The artifact was created as a skeleton in Step 6 and filled progressively in Step 7d. This step fills per-phase Success Criteria and finalizes.
+
+1. **Verify all Phase code fences are filled**: Every `#### N. path/...` subsection inside every `## Phase N` must have a non-empty code block. If any are still empty (e.g., a slice was skipped), generate and fill them now.
+
+2. **Fill per-phase Success Criteria from Verification Notes**. For each `## Phase N` section, replace the placeholder bullets in `### Success Criteria:` with concrete checks derived from this phase's scope and the artifact's `## Verification Notes`:
+
+   - `#### Automated Verification:` — start with project-standard baseline (`npm run check`, `npm test`) and add phase-specific automated checks: file existence (`test -f path`), grep patterns from Verification Notes (`grep -r "pattern" packages/ | wc -l` returns expected count), test names that should now pass, type-check / lint scoped to changed files.
+   - `#### Manual Verification:` — observable behaviors that can't be automated: UI/UX checks, performance under real load, edge cases requiring human judgment, precedent-lesson manual checks. Pull from Verification Notes that are visual or behavioral, scoped to what this phase delivers.
+
+   Convert prose Verification Notes by phase ownership: a constraint that lands inside Phase N's scope becomes a Phase N criterion. Cross-phase constraints (e.g., "production build still succeeds") repeat across the relevant terminal phases.
+
+   **Format** — each entry is a `- [ ]` markdown checkbox; commands wrapped in backticks. `implement` flips `- [ ]` to `- [x]` as it completes each criterion; `validate` extracts and runs each command listed under `#### Automated Verification:`. The example below illustrates the format only — actual per-phase content and bullet counts come from the guidance above (phase scope + `## Verification Notes`).
+
+   ```markdown
+   ### Success Criteria:
+
+   #### Automated Verification:
+   - [ ] Type checking passes: `npm run check`
+   - [ ] Tests pass: `npm test`
+   - [ ] Grep pattern from Verification Note: `grep -r "newApi" packages/ | wc -l` returns >= 3
+
+   #### Manual Verification:
+   - [ ] New widget renders correctly above the editor
+   - [ ] Performance acceptable with 1000+ todo items
+   ```
+
+3. **Verify frontmatter counters**:
+   - `unresolved_phase_count == 0` (every phase approved in Step 7d)
+   - `phase_count` matches the number of `## Phase N` sections
+
+   If any check fails, return to Step 7 for the unresolved phase. Do NOT flip status to ready.
+
+4. **Update frontmatter** via Edit: set `status: ready`. `last_updated` and `last_updated_by` were set at skeleton creation — leave as-is.
+
+5. **Verify template completeness**: Ensure all sections from the template reference in Step 6 are present and filled. Edit to fix any gaps.
+
+6. **Phase Changes Required format reminder**:
+   - **NEW files**: `#### N. path/to/file.ext` + `**File**` + `**Changes**: NEW — {purpose}` + full implementation code block
+   - **MODIFY files**: `#### N. path/to/file.ext:line-range` + `**File**` + `**Changes**: MODIFY — {summary}` + code block with only the modified/added code (no "Current" block — the original is on disk, implement reads it)
+
+## Step 10: Review & Iterate
+
+1. **Present the plan artifact location**:
+   ```
+   Implementation plan written to:
+   `thoughts/shared/plans/{filename}.md`
+
+   {N} architectural decisions fixed, {P} phases generated, {M} new files, {K} existing files modified.
+   {R} revisions during generation.
+
+   Please review and let me know:
+   - Are the architectural decisions correct?
+   - Does the code match what you envision?
+   - Any missing integration points or edge cases?
+
+   ---
+
+   💬 Follow-up: describe the change in chat to append a timestamped Follow-up section to this artifact. Re-run `/skill:blueprint` for a fresh artifact.
+
+   **Next step:** `/skill:implement thoughts/shared/plans/{filename}.md Phase 1` — start execution at Phase 1 (omit `Phase 1` to run all phases sequentially).
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+## Step 11: Handle Follow-ups
+
+- **Edit in-place.** Use the Edit tool to update the plan artifact directly — phase code stays one source of truth.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "Updated <brief description>"`.
+- **Sync decisions ↔ code.** If the change affects decisions, update both the Decisions section AND the affected `## Phase N` code. Code is source of truth — if they conflict, the code wins, update the prose.
+- **Return to checkpoint on new ambiguities.** If new ambiguities surface, return to Step 5 (developer checkpoint) before regenerating phases.
+- **When to re-invoke instead.** For surgical edits to a specific phase, prefer `/skill:revise <plan-path>` over re-running the full skill. Re-run `/skill:blueprint` only when the underlying research changed materially. The previous block's `Next step:` stays valid for the existing plan.
+
+## Guidelines
+
+1. **Be Architectural**: Design shapes code; plans sequence work. Every decision must be grounded in `file:line` evidence from the actual codebase.
+
+2. **Be Interactive**: Don't produce the full design in one shot. Resolve ambiguities through the checkpoint first, get buy-in on the approach, THEN decompose and generate slice-by-slice.
+
+3. **Be Complete**: Code in every `## Phase N` `### Changes Required:` block must be copy-pasteable by implement. No pseudocode, no TODOs, no "implement here" placeholders. If you can't write complete code, an ambiguity wasn't resolved.
+
+4. **Be Skeptical**: Question vague requirements. If an existing pattern doesn't fit the new feature, say so and propose alternatives. Don't force a pattern where it doesn't belong.
+
+5. **Resolve Everything**: No unresolved questions in the final artifact. If something is ambiguous, ask during the checkpoint or micro-checkpoint. The plan must be complete enough that implement can execute each phase end-to-end without re-asking.
+
+6. **Present Condensed, Persist Complete**: Micro-checkpoints show the developer summaries, signatures, and key code blocks. The artifact always contains full copy-pasteable code. If the developer asks to see full code, show it — but never default to walls of code in checkpoints.
+
+## Subagent Usage
+
+| Context | Agents Spawned |
+|---|---|
+| Default (research artifact provided) | codebase-pattern-finder |
+| Novel work (new library/pattern) | + web-search-researcher |
+| Step 5 correction path (developer flags missed area) | targeted codebase-analyzer (max 1-2) |
+| Step 7a mid-generation gap (specific anchor unclear) | targeted codebase-analyzer (max 1) |
+
+Spawn multiple agents in parallel when they're searching for different things. Each agent runs in isolation — provide complete context in the prompt, including specific directory paths when the feature targets a known module. Don't write detailed prompts about HOW to search — just tell it what you're looking for and where.
+
+## Important Notes
+
+- **Always chained**: This skill requires a research artifact produced by the research skill.
+- **File reading**: Always read research artifacts and referenced files FULLY (no limit/offset) before spawning agents
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS read input files first (Step 1) before spawning agents (Step 2)
+  - ALWAYS wait for all agents to complete before identifying ambiguities (Step 3)
+  - ALWAYS resolve all ambiguities (Step 5) before decomposing into slices (Step 6)
+  - ALWAYS complete holistic decomposition before generating any slice code (Step 7)
+  - ALWAYS create the skeleton artifact immediately after decomposition approval (Step 6)
+  - NEVER leave Phase code fences empty after their slice is approved — fill via Edit in Step 7d
+- NEVER skip the developer checkpoint — developer input on architectural decisions is the highest-value signal in the planning process
+- NEVER edit source files — all code goes into the plan document, not the codebase. This skill produces a document, not implementation. Source file editing is implement's job.
+- **Code is source of truth** — if a `## Phase N` code block conflicts with the Decisions prose, the code wins. Update the prose.
+- **Checkpoint recordings**: Record micro-checkpoint interactions in Developer Context with `file:line` references, same as Step 5 questions.
+- **Frontmatter consistency**: Always include frontmatter, use snake_case for multi-word fields, keep tags relevant
+
+## Common Planning Patterns
+
+- **New Features**: types first → backend logic → API surface → UI last. Research existing patterns first. Include tests alongside each implementation.
+- **Modifications**: Read current file FULLY. Show only the modified/added code scoped to changed sections. Check integration points for side effects.
+- **Database Changes**: schema/migration → store/repository → business logic → API → client. Include rollback strategy.
+- **Refactoring**: Document current behavior first. Plan incremental backwards-compatible changes. Verify existing behavior preserved.
+- **Novel Work**: Include approach comparison in Decisions. Ground in codebase evidence OR web research. Get explicit developer sign-off BEFORE writing code.
--- a/extensions/rpiv-pi/skills/changelog/SKILL.md
+++ b/extensions/rpiv-pi/skills/changelog/SKILL.md
@@ -0,0 +1,164 @@
+---
+name: changelog
+description: Regenerate the [Unreleased] section of every affected CHANGELOG.md in Keep a Changelog style. Reads commits since the last release tag plus any uncommitted or staged changes, classifies them by Conventional Commit prefix, and rewrites each [Unreleased] block. Works in single-package repos and monorepos (one CHANGELOG.md per package). Use when preparing a release or drafting changelog entries. Idempotent — safe to re-run as work lands.
+argument-hint: [--since <ref>]
+allowed-tools: Bash(git *), Read, Edit
+---
+
+# Generate CHANGELOG entries
+
+You are tasked with regenerating the `## [Unreleased]` section of every affected `CHANGELOG.md` in the repository so it reflects all change since the last release tag — committed and uncommitted alike.
+
+## Range hint
+
+`$ARGUMENTS` (empty/literal → range starts at the last release tag from `git describe --tags --abbrev=0`)
+
+## Workflow
+
+1. Bail-out checks
+2. Determine the change range
+3. Determine each CHANGELOG's scope and collect commits + uncommitted hunks
+4. Classify and draft entries
+5. Preview and confirm
+6. Apply
+
+## Step 1: Bail-out checks
+
+1. Run `git rev-parse --is-inside-work-tree`. If not a git repo, tell the user "This directory is not a git repository." and stop.
+2. Run `git ls-files 'CHANGELOG.md' '**/CHANGELOG.md'` to discover every tracked changelog. If zero results, tell the user "No `CHANGELOG.md` found in the repository — create one (root or per-package) before running this skill." and stop.
+3. Run `git describe --tags --abbrev=0` to confirm at least one release tag exists. If none, ask the user to supply `--since <ref>` and stop until they do.
+
+## Step 2: Determine the change range
+
+1. Parse `$ARGUMENTS` for a `--since <ref>` flag. If absent, set `SINCE=$(git describe --tags --abbrev=0)`.
+2. The range is `$SINCE..HEAD` for committed changes, plus the current uncommitted+staged working tree.
+
+## Step 3: Determine each CHANGELOG's scope, then collect commits + uncommitted hunks
+
+Each `CHANGELOG.md` discovered in Step 1.2 owns a path scope:
+
+- **Nested CHANGELOG** (e.g. `packages/foo/CHANGELOG.md`, `apps/web/CHANGELOG.md`): scope is its parent directory — `packages/foo/`, `apps/web/`.
+- **Root CHANGELOG** (`CHANGELOG.md` at repo root):
+  - If no nested CHANGELOGs exist: scope is the entire repository.
+  - If nested CHANGELOGs also exist: scope is the repository **excluding** every directory that owns a nested CHANGELOG. The root file captures repo-wide change (CI, build config, root README) that no per-package file would claim.
+
+For each scope:
+
+1. Committed: `git log $SINCE..HEAD --pretty=format:"%H%x09%s%x09%b%x1e" -- <scope>`. For root-with-exclusions, pass `:(exclude)<dir>` pathspecs for every nested-CHANGELOG directory. Records are `\x1e`-delimited; parse subject (`%s`) and body (`%b`).
+2. Uncommitted: `git diff HEAD -- <scope>` and `git diff --cached -- <scope>` with the same pathspec rules. Treat the union as a single virtual "pending" change set with no commit message — the model classifies it from the diff itself.
+3. Skip CHANGELOGs whose scope has no committed and no uncommitted changes in range.
+
+## Step 4: Classify and draft entries
+
+For each affected CHANGELOG, produce entries grouped under the Keep a Changelog 1.1.0 sections, in this order: `Added`, `Changed`, `Deprecated`, `Removed`, `Fixed`, `Security`, `Performance`. Append a `Breaking / Upgrade Notes` section only when a breaking change exists.
+
+### Conventional Commit → section mapping
+
+- `feat:` → **Added**
+- `fix:` → **Fixed**
+- `perf:` → **Performance**
+- `refactor:`, `style:`, `build:`, `ci:`, `chore:` → **Changed**
+- `docs:` → **Changed** (only if user-facing docs; skip internal `thoughts/` or research notes)
+- `test:` → omit (not user-visible)
+- `revert:` → **Changed** (note what was reverted)
+
+### Always-skip commits
+
+Skip any commit whose subject matches one of these — they are release pipeline housekeeping, not user-visible change:
+
+- `Release v<x.y.z>` or `chore(release): v<x.y.z>` (common release-bot patterns)
+- `Add [Unreleased] section for next cycle`
+- Version-only bumps with no other content (`<x.y.z>` as the entire subject)
+- Merge commits with no diff content of their own
+
+### Breaking change detection
+
+Flag a commit as breaking if any of these are true:
+
+- The type has a `!` suffix (`feat!:`, `refactor!:`, etc.)
+- The commit body contains a `BREAKING CHANGE:` footer
+- The diff removes or renames an exported symbol, removes a CLI flag, or removes a public file
+
+For each breaking change, add an entry to **Breaking / Upgrade Notes** in addition to the regular section, written as a one-line upgrade instruction.
+
+### Style rules — match Keep a Changelog 1.1.0 prose
+
+- One short user-facing sentence per entry. Imperative mood ("Add", "Fix", "Remove").
+- Write for the plugin's **users**, not its maintainers. No internal symbol names, file paths, regex literals, or precedent commit hashes inside entries.
+- If a feature has a user-visible name (a slash command, a CLI flag, a skill name), name it in backticks. Example: `` Added `--locale` flag for per-invocation language override. ``
+- Group entries by category, not by commit. Merge duplicate-topic commits into one entry.
+- If a commit reverses something earlier in the same `[Unreleased]` window (e.g. add → remove → add-back), reflect only the net effect.
+- Skip entries that have zero user-visible impact: dependency bumps with no behavior change, internal refactors invisible to users, test additions, type-only changes.
+
+### Worked example
+
+Input commits in `packages/api/`:
+
+```
+abc1234 feat(api): add /v2/search endpoint with cursor pagination
+def5678 feat(api): support webhook retries with exponential backoff
+ghi9abc fix(api): rotate session secret on every JWT refresh
+jkl0def docs(api): document rate-limit headers in OpenAPI spec
+mno1234 chore(deps): bump @types/node to 20.11
+pqr5678 test(api): coverage for cursor edge cases
+stu9abc refactor(api): inline httpClient factory (no behavior change)
+```
+
+Output `[Unreleased]`:
+
+```markdown
+## [Unreleased]
+
+### Added
+- `/v2/search` endpoint with cursor-based pagination.
+- Webhook delivery retries with exponential backoff.
+
+### Changed
+- OpenAPI spec documents rate-limit response headers.
+
+### Fixed
+- JWT refresh rotates the session secret on every renewal.
+```
+
+What this example demonstrates:
+
+- Two `feat:` commits → two **Added** entries (one per user-visible feature).
+- `docs:` for a user-facing API spec → **Changed** (skip if the docs touched were internal notes).
+- `fix:` → **Fixed**, written as the corrected behavior in imperative mood, not as the bug.
+- `chore(deps):` with no behavior change → omitted.
+- `test:` → omitted (not user-visible).
+- `refactor:` flagged "no behavior change" → omitted (the rule is user-visible impact, not commit type).
+- Commit hashes never appear in entries.
+
+## Step 5: Preview and confirm
+
+1. Print a per-CHANGELOG summary: file path, count by section, breaking-change flag.
+2. Print the proposed `[Unreleased]` body for each affected CHANGELOG, in full.
+3. Call `ask_user_question`:
+   - Question: "Apply regenerated `[Unreleased]` to {N} CHANGELOG(s)?"
+   - Header: "Changelog"
+   - Options:
+     - "Apply (Recommended)" — Write the regenerated sections to disk. Refinement, if needed, happens afterward in normal chat (`Edit` tool) or via `git restore` to roll back.
+     - "Show Preview" — For each affected CHANGELOG, render a unified diff between the **current** `[Unreleased]` body on disk and the **proposed** regenerated body. Lines marked `-` are about to be removed; lines marked `+` are about to be added. After printing, re-ask this same question.
+
+## Step 6: Apply
+
+For each affected CHANGELOG:
+
+1. Read the file.
+2. Locate the `## [Unreleased]` heading. The block runs from that heading up to (but not including) the next `## [` heading — or end of file if no later version exists. If no `## [Unreleased]` heading exists, insert one above the first `## [` heading (or after the file's intro prose if no version sections exist yet).
+3. Use `Edit` to replace the entire block with `## [Unreleased]\n\n` followed by the regenerated sections.
+4. **Never** touch any heading below `[Unreleased]`. Released version sections are immutable.
+
+After all writes complete, print the list of modified files and remind the user to commit them before invoking their release pipeline — most release scripts require a clean working tree.
+
+## Important Notes
+
+- ALWAYS preview before writing. Never apply without the user's `ask_user_question` confirmation.
+- ALWAYS replace the full `[Unreleased]` body, not append. The skill is idempotent regeneration, not accumulation.
+- NEVER modify released version sections (anything below the first `## [x.y.z]` heading).
+- NEVER write Conventional Commit prefixes (`feat:`, `fix:`, etc.) into the changelog body. They classify the entry; they don't appear in the prose.
+- NEVER include commit hashes, PR numbers, or author names in entries. The audience is end users, not git archaeologists.
+- NEVER pick or suggest a version number. The release pipeline owns the bump.
+- NEVER invoke a release script from this skill. Authoring is a separate step from releasing.
+- If a CHANGELOG has changes in the range but every commit is omit-worthy by the style rules (test-only, type-only, internal refactor), leave its `[Unreleased]` body empty — do not invent entries.
--- a/extensions/rpiv-pi/skills/code-review/SKILL.md
+++ b/extensions/rpiv-pi/skills/code-review/SKILL.md
@@ -0,0 +1,504 @@
+---
+name: code-review
+description: "Conduct comprehensive code reviews of pending changes, a branch, or a PR using parallel specialist agents that audit the diff, compare against peer code, and verify claims. Use when the user asks to 'review this', wants pending changes, a PR, a branch, or a diff reviewed, or asks for a code review. Produces review documents in thoughts/shared/reviews/. Internal mechanics like row-only agent contracts and Gap-Finder set arithmetic are documented in the skill body."
+argument-hint: "[scope]"
+---
+
+# Code Review
+
+Scope: $ARGUMENTS
+
+Review changes across **Quality**, **Security**, **Dependencies** lenses with optional advisor adjudication. Valid scopes: `commit` | `staged` | `working` | hash | `A..B` | PR branch. **Empty scope defaults to feature-branch-vs-default-branch first-parent review** (default branch auto-detected; see Step 1).
+
+**How it works**:
+- Step 1 — resolve scope, read diff (with `-U30` context), derive flags, build semantic file map
+- Step 2 — dispatch Wave-1: integration + precedents + (deps/CVE) + (peer-mirror); integration & peer-mirror gate Wave-2, precedents gates Step 5
+- Step 3 — dispatch Wave-2: Quality + Security lenses, file-oriented
+- Step 4 — dispatch Wave-3: Predicate-Trace + Interaction Sweep + Gap-Finder, all gated
+- Step 5 — reconcile via advisor or inline dimension-sweep (blocks on precedents)
+- Step 6 — verify findings: re-read each cited file:line; drop/demote unverified
+- Step 7 — write artifact
+- Steps 8–9 — present and handle follow-ups
+
+**File-orientation contract**: agents reason about *files* as coherent units. Hunks are evidence *within* a file's analysis, never the unit of analysis. The `-U30` patch (Step 1) inlines function-level context so agents rarely need extra `Read` calls.
+
+Every Wave-2 agent prompt contains EXACTLY: (a) `Known Context:` followed by the Discovery Map verbatim, and (b) the literal string `.git/code-review-patch.diff` as the patch path. Nothing else from Wave-1 outputs — NOT the raw integration-scanner dump, NOT precedent-locator output, NOT Dependencies/CVE output. See "Wave-2 context isolation" in Step 3 for the failure mode when this is violated. Wave-1 agents that do not consume the Discovery Map (precedents, dependencies, CVE) get `ChangedFiles` / manifest-diff only.
+
+## Step 1: Resolve Scope and Assemble the Diff
+
+1. **Detect default branch**: `DEFAULT_BRANCH=$(git symbolic-ref --quiet --short refs/remotes/origin/HEAD 2>/dev/null | sed 's@^origin/@@')`. Fallback: probe `main` then `master` (`git rev-parse --verify --quiet <name>`); if neither resolves, ask the user which branch is the integration target before proceeding. Use `$DEFAULT_BRANCH` wherever the parser below says `<default>`.
+
+2. **Interpret the Scope line** (from the header) and identify `OLDEST` + `NEWEST` commits (user-inclusive endpoints). Each branch carries a **strategy tag** used later by Assembly and Step 6:
+   - Empty → `OLDEST=$(git merge-base "$DEFAULT_BRANCH" HEAD)`, `NEWEST=HEAD`, strategy=`first-parent` (also skips the `BASE` computation below — use `BASE=OLDEST` directly)
+   - `commit` → `OLDEST=NEWEST=HEAD`, strategy=`working-tree`
+   - `staged` / `working` → no commits; see working-tree branch below, strategy=`working-tree`
+   - Single hash `h` → `OLDEST=NEWEST=h`, strategy=`explicit-range`
+   - Range `A..B` → verify A is ancestor of B (`git merge-base --is-ancestor A B`; swap if reversed); `OLDEST=A`, `NEWEST=B`, strategy=`explicit-range`
+   - Commit list (`h1,h2,h3` or whitespace-separated) → find endpoints via `git rev-list --topo-order`; `OLDEST` = farthest-from-HEAD, `NEWEST` = nearest. Reject if not on a single linear ancestry (ask user to clarify). Strategy=`first-parent` (ancestry invariant preserves named commits under first-parent traversal).
+   - PR branch name → `OLDEST=$(git merge-base "$DEFAULT_BRANCH" HEAD)`, `NEWEST=HEAD`, strategy=`first-parent` — note: `OLDEST` is already the parent-of-first-PR-commit, so skip the `BASE` computation below and use `BASE=OLDEST` directly.
+   - Anything unrecognised (prose, branch name that fails to resolve, mixed list) → ask the user to clarify via `ask_user_question`: (A) "review current branch vs `$DEFAULT_BRANCH` (first-parent)", (B) "review uncommitted changes", (C) "restate scope". Do NOT silently guess.
+
+2. **Compute the range once**: `BASE=$(git rev-parse "$OLDEST^")`, `TIP=$NEWEST`, `RANGE="$BASE..$TIP"`. This gives a range that INCLUDES `OLDEST`'s own changes (standard `A..B` excludes `A`). Every subsequent git command uses `$RANGE` — do NOT inline a `^` character in templates; orchestrators sometimes drop it. Also set `FP_FLAG="--first-parent"` when strategy=`first-parent`, else `FP_FLAG=""`.
+
+3. **Assemble the UNION of changes** (not the net endpoint-diff — so reverted intermediate work stays visible). Save the patch to a tempfile once with generous context; do NOT re-run `git log --patch` to slice windows later:
+   - `git log "$RANGE" $FP_FLAG --name-only --pretty=format: | sort -u` → `ChangedFiles`
+   - `git log "$RANGE" $FP_FLAG --stat --reverse` → per-commit size summary
+   - `git log "$RANGE" $FP_FLAG --patch --reverse --no-merges -U30 > .git/code-review-patch.diff` → union patches with **30 lines of surrounding context per hunk** (function-level context inline). `$FP_FLAG` is orthogonal to `--no-merges`: first-parent prunes second-parent subtrees from reachability, `--no-merges` drops the merge commit itself from the log.
+   - `git log "$RANGE" --reverse --format="%H %s%n%n%b%n---"` → commit-message context
+   - **Working-tree branch** (`staged` / `working`, no `$RANGE`): `git diff --cached --name-only` / `git diff --name-only`; `git diff --cached --stat` / `git diff --stat`; `git diff --cached -U30` / `git diff -U30`. Commit-message context is N/A. `FP_FLAG` is not applicable.
+   - **Patch-size fallback**: `-U30` produces ~2–3× the size of `-U0`. If the resulting patch exceeds ~1MB, drop to `-U10` for this run; never use `-U0` — it defeats the skill's design.
+
+3. **Bail-out**: if `ChangedFiles` is empty, print `No changes in scope {scope}. Exiting.` and STOP. Do not write an artifact.
+
+4. **Derive scope + flags** (orchestrator-side, used in later steps):
+   - `InScopeFiles` — used by the Step 6 pre-filter. `ChangedFiles` reflects *tree-reachability* (inflated on branches that back-merged the default branch — each post-merge first-parent commit inherits the merge's tree, so `--name-only` includes every file the merge resolved); `InScopeFiles` reflects *commits' own diffs* and is what the developer actually authored. Derivation:
+     - strategy=`first-parent` (empty / PR branch / commit-list inputs) → `InScopeFiles = ⋃ git diff-tree --no-commit-id --name-only -r <h>` over `git log "$RANGE" --first-parent --no-merges --pretty=%H` (each feature commit's own file delta; back-merge sidecars drop out even when the merge is on the first-parent line). For commit-list input, iterate over the user-named hashes instead of the first-parent walk to preserve non-contiguous-list intent.
+     - strategy=`explicit-range` → `InScopeFiles = ChangedFiles` (user explicitly asked for range semantics; merges in the range are part of the intent).
+     - strategy=`working-tree` → `InScopeFiles = ChangedFiles` (no merge surface).
+   - **Invariant**: `InScopeFiles ⊆ ChangedFiles`. On back-merged feature branches, `InScopeFiles ⊊ ChangedFiles` is the primary mechanism by which sidecar findings get dropped at Step 6.
+   - `ManifestChanged` = ChangedFiles intersects any dependency manifest or lockfile (e.g. `package.json`/lockfile, `Cargo.toml`/`Cargo.lock`, `go.mod`/`go.sum`, `pyproject.toml`/`requirements*.txt`/`poetry.lock`, `Gemfile*`, `*.csproj`, `pom.xml`/`build.gradle*`, `composer.json`, …) OR a peer/optional/dev-dependency field was touched.
+   - `LockstepSelfReview` = repository root contains `scripts/sync-versions.js` AND every `packages/*/package.json` shares the same `version:` AND the diff touches `packages/*/package.json`.
+   - `HasGatingPredicate` = diff adds or modifies a **status/enum-comparison predicate** (`Status == X`, `Status is X or Y`, `X.Contains(Status)`, pattern-match on a discriminator) OR introduces a new value into an enum referenced by existing gating predicates. NOT merely the presence of `if (!x) return`.
+   - `ReviewType` = one of `commit | pr | staged | working`.
+   - `PeerPairs` = `(new_file, peer_file)` tuples. `new_file` is in `git log "$RANGE" --diff-filter=A --name-only` (working-tree: `git diff --diff-filter=A --name-only [--cached]`). `peer_file` exists at HEAD (`git ls-tree HEAD`) and matches one heuristic:
+     - **Stem similarity ≥ 60%** of the longer stem (e.g. `PhysicalProductSubscription` ↔ `Subscription`).
+     - **Interface/impl pair**: `I<Name>` ↔ `<Name>`, `<Name>` ↔ `<Name>.impl`, `<Name>{Abstract,Base,Protocol}` ↔ `<Name>`.
+     - **Shared suffix** from `{Handler, Service, Repository, Aggregate, Reducer, Controller, Resolver, Command, Query, Job, Processor, Strategy, Policy, Event, Listener, Subscriber, Publisher, Exception, Eligibility, Ability, QueryParam, Specification, Factory, Builder}`.
+
+     Drop a pair only when the peer doesn't exist at HEAD, no heuristic matches, or both files were added in this diff. Empty list ⇒ skip the peer-mirror agent. Co-modified peers are KEPT — the agent Reads them at HEAD (post-diff tree state), so any invariant present at HEAD counts as peer evidence regardless of whether the peer was edited in this diff.
+
+## Step 2: Dispatch Wave-1 — Integration + Precedents + Deps/CVE + Peer-Mirror
+
+Spawn ALL of the following in parallel at T=0 in a **single message with multiple Agent tool calls**. Do NOT wait for integration-scanner before dispatching precedents / dependencies / CVE — they do not consume Discovery-Map output, only `ChangedFiles` and the manifest diff (both orchestrator-produced in Step 1).
+
+**Agent — Integration map:**
+- subagent_type: `integration-scanner`
+- Prompt: "Map inbound references, outbound dependencies, and infrastructure wiring for the following changed files: {ChangedFiles, one per line}. Flag any auth-boundary crossings (middleware, guards, interceptors, authorize-style decorators) and config/DI/event registration touching these paths. Do NOT analyse code quality — connections only, in your standard output format."
+
+**Agent — Precedents** (always): use the `precedent-locator` prompt defined in Step 3 below — dispatch it here, not in Wave-2. Input it needs: `ChangedFiles` only.
+
+**Agent — Dependencies** (only when `ManifestChanged`): use the `codebase-analyzer` Dependencies prompt defined in Step 3 below — dispatch here. Input it needs: touched manifest paths + `LockstepSelfReview` flag.
+
+**Agent — CVE / advisory** (only when `ManifestChanged`): use the `web-search-researcher` prompt defined in Step 3 below — dispatch here. Input it needs: parsed `name@version` list from the manifest diff (orchestrator extracts and hands over directly).
+
+**Agent — Peer-Mirror** (only when `len(PeerPairs) > 0`): `subagent_type: peer-comparator`. Input: the `PeerPairs` list verbatim, nothing else — no Discovery Map (it isn't built yet and the agent doesn't need it), no patch path (the work is peer-vs-new entity comparison, not diff analysis). Prompt:
+  ```
+  Peer-mirror check.
+
+  PeerPairs (orchestrator-computed):
+  {list of (new_file, peer_file) tuples}
+
+  For each pair, Read BOTH files in full. Enumerate the peer's PUBLIC surface as rows:
+  - every public method / exported function
+  - every domain event / notification / message fired (language-agnostic: method calls named `fire*`, `emit*`, `publish*`, `dispatch*`, `raise*`, `notify*`, `AddDomainEvent`, or idiomatic equivalents)
+  - every state transition (name + precondition guard + side-effects)
+  - every constructor-injected / DI-supplied collaborator
+  - every persisted field / column / serialised property
+  - every registration this file contributes to a switch/map/table/route/handler registry elsewhere (match by type name appearing in a `switch`/`match`/`when`/dispatch table)
+
+  For each row, check the new file. Emit ONE row per peer invariant:
+
+    peer_site (file:line — `<verbatim line>`) | new_site (file:line — `<verbatim line>` OR `<absent>`) | status | one-sentence delta
+
+  status ∈ {Mirrored, Missing, Diverged, Intentionally-absent}.
+
+  "Intentionally-absent" requires an explicit cite — a comment in the new file, a commit-message line mentioning the omission, or a type-system constraint that makes the invariant inapplicable (e.g. the peer's `Trial*` methods are absent because the new entity's type says it doesn't support trials). Suspicion is not sufficient; when in doubt, emit Missing.
+
+  Output format: markdown table per pair, heading `### Peer pair: <new_file> ↔ <peer_file>`. No prose outside the tables. No severity. No recommendations. Citation contract applies to every cell.
+  ```
+
+While these agents run, the orchestrator produces the rest of the Discovery Map inline from Step 1's data:
+- `ChangedFiles`, `ManifestChanged`, `LockstepSelfReview`, `ReviewType`
+- **Semantic file map** (per file: `path (+A -B)` + role tag + top-level symbol names touched; see format and rules below)
+- Commit-message context (if applicable)
+
+**Wait for `integration-scanner` AND the peer-mirror agent (when dispatched)** before dispatching Wave-2. Wave-2 agents consume both via the Discovery Map (auth-boundary crossings, inbound refs, peer-mirror Missing/Diverged rows). Precedents / Dependencies / CVE continue running in the background; **Precedents MUST be awaited before Step 5 begins** (Reconciliation reads its follow-up-within-30-days counts to weight severity; see Step 5). Dependencies / CVE also merge in at Step 5 but may arrive later in the wait barrier.
+
+**Synthesize the Discovery Map** — a compact block that Wave-2 agents receive verbatim as `Known Context`. Each file line carries a *role tag* and a *symbols-touched hint*; files are clustered by shared directory prefix so agents orient without re-reading the patch.
+
+```
+## Discovery Map
+
+Review type: {ReviewType}
+Scope: {scope argument}
+Commit/range: {git ref}
+Manifest changed: {yes|no}
+Lockstep self-review: {yes|no}
+
+Changed files ({N}):
+
+  ## {cluster — shared directory prefix}
+    path/file.ext (+A -B) {role-tag} — top 1–3 symbols touched
+    ...
+
+Auth-boundary crossings: {integration-scanner, file:line}
+Inbound refs (files with ≥3 consumers): {integration-scanner}
+Outbound deps: {integration-scanner}
+Wiring/config: {integration-scanner}
+Peer mirrors: {peer-mirror agent output verbatim — Missing/Diverged rows only; Mirrored and Intentionally-absent rows are summarised as counts}
+```
+
+**Clustering**: group files by longest shared directory prefix yielding clusters of 2+ files; singletons form their own cluster labelled with the filename. Emerges from the repo — no framework assumptions.
+
+**Role-tag** (one tag per file, first match wins):
+1. `[boundary]` — in integration-scanner's auth-boundary output
+2. `[persistence]` — path contains `migration`/`schema`/`repository`/`dao`/`model`, or matches an ORM/migration convention visible in the repo
+3. `[test]` — path contains `/test`/`/spec`/`__tests__`, or filename ends in a test suffix (`.test.*`, `.spec.*`, `_test.*`, `Test.*`)
+4. `[config]` — in integration-scanner's wiring/config output, or is a manifest/lockfile/settings file
+5. `[hub]` — in integration-scanner's inbound-refs with ≥3 consumers
+6. `[code]` — default
+
+**Symbols-touched hint**: extract top 1–3 top-level definitions from the diff's `+` lines using a heuristic appropriate to the file's language (class/function/def/fn/struct/trait/interface/type/export). Cap at ~80 chars. Leave blank if ambiguous — orientation, not completeness.
+
+## Step 3: Dispatch Wave-2 — Quality + Security Lenses
+
+Spawn Quality + Security in parallel using the Agent tool. Each receives the `## Discovery Map` block inline as `Known Context` above its task, and a pointer to `.git/code-review-patch.diff` for the diff itself. Precedents / Dependencies / CVE are already running from Wave-1 — do NOT re-dispatch them here; the prompts below document what those Wave-1 agents received, they are not re-issued.
+
+**Wave-2 context isolation (LOAD-BEARING — violations cause silent quality collapse)**: Each Wave-2 agent receives EXACTLY two things, nothing else: (1) the Discovery Map (digested form) and (2) the literal path string `.git/code-review-patch.diff`.
+
+**DO NOT paste into Wave-2 prompts**, under any circumstance, even if the orchestrator has already received them:
+- raw integration-scanner output (the Discovery Map already summarises its auth/ref/wiring findings)
+- precedent-locator output
+- Dependencies lens output
+- CVE lens output
+- any prior Wave-2 or Wave-3 output from earlier runs in the same Pi session
+
+**Why this is load-bearing**: summary context induces *narrativisation* — the agent treats the preamble as "the orchestrator already framed the findings, I just classify them" instead of independently reading the patch file. Observed failure signatures when this is violated: Quality drops from ~40 tool calls / 3M tokens / 500s to ~5-15 tool calls / 300k tokens / 100-200s, and returns hallucinated findings (invented statuses, mis-cited line numbers, claims that files are "missing from patch" when they are in fact present).
+
+**Self-check before dispatching Wave-2**: read your outgoing Agent prompt. If it contains any content from Wave-1 agent RESULTS beyond the Discovery Map you synthesised, strip it. The Discovery Map is the contract; raw outputs are reconciliation-only.
+
+**Citation contract** (applies to every Wave-2+ agent, every step): every `file:line` citation MUST be accompanied by the literal line text in backticks — format `file:line — \`<verbatim line>\` — <note>`. Omit findings whose lines you cannot quote verbatim.
+
+**Quality lens** (`diff-auditor`) — **file-oriented**:
+  ```
+  Analyse changes file by file. For each file in ChangedFiles, read its diff region in `.git/code-review-patch.diff` (patch has `-U30` — full function context is already inline; rarely need an extra Read call), form a mental model of what the file does and what the diff changes about it, then apply the 13 surfaces below to the file as a whole. Cite `file:line` with verbatim line text (citation contract) for every finding. Omit findings not traceable to a diff-touched change. No severity.
+
+  **File order strategy**: prioritise by role tag — `[boundary]` files first (security-sensitive), then `[persistence]` (durable-state surfaces), then `[hub]` (blast-radius amplifiers), then `[code]`, then `[config]`, then `[test]` last. Within the same tag, prioritise files with the largest diffs.
+
+  **Per file**, write a short section (`### file/path.ext`) containing only the surfaces that APPLY to that file's changes. Use sub-headings for grouped evidence. A surface may be flagged across multiple files — report the evidence where it lives, and rely on cross-layer surfaces (8, 9) to tie them together.
+
+  **Surfaces** — each surface's mechanical trigger decides whether it APPLIES to a given file's changes. Walk every applicable trigger:
+
+  1. **Logic & flow** (always, per file with new/modified code) — validation, error paths, off-by-one, null misses, branch ordering, return/await, unguarded mutation.
+  2. **Pattern coherence** (≥2 similar constructs within a file, or ≥2 files in the same cluster) — cite nearby line broken from.
+  3. **Blast radius** (Discovery Map lists inbound refs to this file, OR this file is flagged as a hub) — `consumer:line` + what changes for each inbound ref.
+  4. **Test coverage gaps** (always, checked once across the whole changeset) — for each risk-bearing function/method added/changed, check whether any `[test]`-tagged file in ChangedFiles contains a corresponding test. Flag risk-bearing behavior with no adjacent test.
+  5. **Predicate-set coherence** (`HasGatingPredicate`) — ≥2 conditionals on the same enum/type across the changeset. Tabulate `predicate file:line | accepted | rejected`. Flag mismatches. Surface 5 output MUST use heading `### Predicate-set coherence` at review-scope level (not nested inside a per-file section) — downstream step consumes it verbatim.
+  6. **Registration coverage** (changeset adds discriminator value / enum variant / handler key / route / event type / strategy entry) — every dispatch/registry/switch across ChangedFiles that must enumerate it. Cite each registration site + each enumeration site. Flag gaps.
+  7. **Query/write symmetry** (changeset adds setter/linker, shape change, or new persisted field) — trace BOTH creation and renewal/update paths; cite the setter file:line AND the reader file:line.
+  8. **Cross-layer drift** (same entity/enum/key appears in ≥2 files in different clusters or role tags — e.g. model ↔ DTO ↔ schema ↔ registry ↔ presentation) — open each file, tabulate presence, flag asymmetry. When the entity is a **key fanned out across parallel tables** (locale maps, theme maps, strategy registries, handler/command tables, feature-flag tables), every table must carry the added key and none must retain a removed/renamed one — flag orphaned references and missing entries.
+  9. **Peer-member consistency** (a file gains a new method/hook/case/handler in a set of peers — class methods, hook siblings, reducer cases, handler registrations, CLI subcommands) — tabulate the invariants the peers share (state mutation, emitted event/signal, precondition guard, bookkeeping counter, teardown symmetry); flag omissions in the new member.
+  10. **Durable-state hygiene** (file is a schema migration, repository/DAO, file-backed config, KV/cache, serialized artifact, or adds a new persisted field) — trace the forward-write AND the reverse/rollback path; flag irreversible or data-losing rollback, missing lookup affordance for a new query field (index, key, sorted structure), iteration over an unbounded/mutable source without a stable cursor, and new invariants at the storage layer not mirrored by the in-memory validator.
+  11. **Shared-state acquisition** (file introduces async handler, event listener, singleton init, queue consumer, file-lock region, or global cache mutation) — trace acquire/release around every mutation; flag unguarded check-then-act across an `await`/callback/IPC boundary, stale reads while another writer is in flight, non-commutative acquire order between distinct state slots, and replay/retry paths that lack an idempotency key.
+  12. **Multi-step commitment** (file issues ≥2 writes that must all succeed or all be undone — DB transaction, cross-table mutation, multi-file write, filesystem+network pair, multi-API orchestration, compensating-action chain) — trace the commit boundary; flag missing undo/compensation on partial failure, retry paths that re-apply non-idempotent steps, and divergence between two stores that must agree without a coordinating primitive.
+  13. **Error handling & idempotency** (file adds a failure-response construct — retry loop, catch/except block, error boundary, fallback branch, circuit breaker, timeout, resumable step) — trace the error-propagation path; flag swallowed errors, retry without an idempotency key, fallback that silently degrades observable behavior, unjustified timeout values.
+
+  **Economising Reads**: issue a `Read` only when (a) you need a file NOT in ChangedFiles (hub, peer, test), or (b) the changed function is longer than the `-U30` window can show. Never re-Read a file just to re-orient — that's what the symbols-touched hint is for.
+  ```
+
+**Security lens** (`diff-auditor`) — **file-oriented**:
+  ```
+  Analyse each changed file as a whole, looking for sinks in the classes below. For each file, grep the file's diff region in `.git/code-review-patch.diff` (patch has `-U30` — sink context is inline) for the sink patterns, and for each hit provide the verbatim line (citation contract) plus 2 surrounding lines and `confidence: N/10` that user-controlled input can reach the sink under current deployment. Drop hits with confidence < 8. Cross-reference Discovery Map auth-boundary crossings and inbound refs — a sink in a file reached from an auth-boundary file is in scope even if the sink file itself doesn't cross the boundary.
+
+  **File order strategy**: `[boundary]` files first (direct source→sink exposure); then `[persistence]` (query injection, unsafe deserialization); then `[code]` (command exec, SSRF, explicit-trust rendering); then `[hub]` / `[config]`; skip `[test]` unless a test helper touches a sink.
+
+  IN-SCOPE RULE: only report findings whose sink line is inside a changed file's diff region (add/modify/adjacent-context rewrite). Pre-existing sinks in files the diff did not change are out of scope, UNLESS the diff changes how data flows TO the sink (e.g. new user-controlled source routed to an untouched sink) — then cite both locations. When in doubt, trace data flow from Discovery Map boundary crossings through the changed files.
+
+  Sink classes — match the concept in whatever language the diff uses:
+  - **Command execution** — shell/process spawn w/ user input (e.g. `exec`, `subprocess.run(shell=True)`, `Runtime.exec`, …).
+  - **Dynamic code / unsafe deserialization** — `eval` / dynamic-function constructors; deserializers that can execute code (e.g. `pickle.loads`, `yaml.load` w/o safe loader, `ObjectInputStream`, `BinaryFormatter`, …).
+  - **Query injection** — user input concatenated or interpolated into a query string interpreted by an external engine (SQL, NoSQL operators, LDAP filters, XPath, GraphQL constructed as string). Parameterized/prepared queries and typed query builders are safe.
+  - **Explicit-trust rendering** — user input emitted into a channel that will interpret it as code/markup rather than data. In-scope only when the framework's explicit-trust API is invoked (`dangerouslySetInnerHTML`, `bypassSecurityTrustHtml`, `v-html`, `template.HTML`, `mark_safe`, `html_safe`, triple-stache, raw-HTML markdown, …), plain `innerHTML =` / `document.write(` in vanilla/server-rendered code, or equivalent passthroughs in non-HTML renderers (unescaped ANSI-escape emission to a TTY, unquoted shell-prompt interpolation, template-engine raw blocks).
+  - **Path traversal** — user-controlled components into file-system APIs without normalization/allowlist.
+  - **SSRF** — outbound HTTP/TCP with user-controlled host OR protocol (not just path).
+  - **Secrets in diff** — literal credentials, API keys, PEM blocks, connection strings w/ embedded passwords, `.env` content.
+  - **Missing trust-boundary check** — a traced sink reached from a Discovery-Map boundary crossing (HTTP handler, RPC endpoint, IPC message, CLI flag that flows to a privileged operation, webhook receiver) without an upstream authorization/validation step (middleware, guard, attribute/decorator, allowlist check, signature verification).
+
+  Do NOT report: DOS/resource exhaustion/rate-limiting, missing hardening without a traced sink, theoretical races/timing without reproducer, log spoofing/prototype pollution/tabnabbing/open redirects/XS-Leaks/regex DOS, client-side-only authn/authz (server is the authority), findings sourced only from env var / CLI flag / UUID, test-only or notebook files without a concrete untrusted-input path, outdated-dependency CVEs (CVE lens handles).
+
+  Name the sink class and the matched idiom. Evidence only. No CVE lookups.
+  ```
+
+**Dependencies lens** (`codebase-analyzer`, only when `ManifestChanged`; otherwise SKIP and omit `### Dependencies` in artifact):
+  ```
+  Lockstep self-review: {yes|no}
+
+  Identify the ecosystem from touched manifests (npm, Cargo, Go modules, PyPI/Poetry, Bundler, NuGet, Maven/Gradle, …). Parse the changed manifest(s) and list:
+  1. Added deps: `name@version` with `file:line`.
+  2. Bumped deps: `name: old -> new` with `file:line`.
+  3. Removed deps.
+  4. Peer / optional / dev-scope changes (whatever the ecosystem calls them).
+  5. License field changes in manifest or lockfile.
+  6. Lockstep=yes: flag only intra-monorepo drift where a sibling pin diverges from the lockstep version. Treat wildcard peer pins as intentional.
+  7. Lockstep=no: flag version conflicts between direct dep and lockfile resolution.
+
+  Evidence only. No CVE lookups.
+  ```
+
+**Precedents lens** (`precedent-locator`):
+  ```
+  Code review of {scope}. Changed files: {ChangedFiles}.
+  Find similar past changes touching these files or nearby. Per precedent: commit hash, blast radius, follow-up fixes within 30 days, one-sentence takeaway. Distil composite lessons.
+  ```
+
+**CVE/advisory lens** (`web-search-researcher`, only when `ManifestChanged`):
+  ```
+  Look up CVEs / GitHub Advisories / OSS Index entries for the target versions. Return LINKS. Per vulnerability: severity (Critical/High/Moderate/Low), affected range, whether bumped-to version is fixed.
+
+  Dependencies:
+  {name@version per line — orchestrator-extracted}
+  ```
+
+**Wait for Quality + Security to complete** before proceeding. Precedents / Dependencies / CVE from Wave-1 may still be running; gather them before Step 5, not before Step 4.
+
+## Step 4: Dispatch Wave-3 — Predicate-Trace + Interaction Sweep + Gap-Finder
+
+Once Wave-2 (Quality + Security) completes, dispatch 4a and 4b as parallel agents **in a single message**; compute 4c inline (orchestrator-side set arithmetic — no agent). They do NOT consume each other's output:
+
+- **Interaction Sweep (4b)** receives Quality's `Predicate-set coherence` table directly as its predicate-row source. Quality's table already flags mismatches — Predicate-Trace (4a) only *elaborates* them through consumers. Interaction Sweep's categories 1–6 don't need 4a at all; categories 7–9 (stranded-state, false-promise, co-tenant filter gap) operate on the same rows 4a would trace.
+- **Gap-Finder (4c)** is coverage arithmetic: `{in-scope files} − {files with ≥1 Quality/Security finding} = {uncovered files}`. Orchestrator already holds both sets post-Wave-2 — an agent would discard context only to re-receive it via prompt. Inline is strictly cheaper and deterministic.
+- If Predicate-Trace (4a) surfaces a row that was not visible in Quality's table, append it via a Step 9 follow-up — cheaper than a serial gate.
+
+### Step 4a: Predicate-Trace
+
+**Gate**: SKIP this sub-step (do not dispatch 4a) unless `HasGatingPredicate` is true AND the Quality lens returned ≥2 rows in its `Predicate-set coherence` table referencing the same enum/type. If skipped, 4b and 4c still dispatch.
+
+Otherwise spawn ONE `codebase-analyzer` in parallel with 4b:
+  ```
+  Coherence rows (Quality — Predicate-set coherence): {paste verbatim}
+  Gating predicates in diff: {`file:line` list}
+
+  Per predicate, return: `predicate file:line | inputs | promise (what TRUE/matching branch implies) | consumer file:line | consumer filter | fulfils? | gap`.
+
+  Flag:
+  - *False promise* — matching branch depends on a consumer/filter elsewhere that excludes this entity's source/type/state.
+  - *Stranded state* — entity reaches state X via one conditional, but every conditional that operates on this entity elsewhere excludes X (no exit path).
+
+  Evidence only. Citation contract applies.
+  ```
+
+Do NOT wait — 4b (Interaction Sweep) dispatches in the same message as 4a; 4c runs inline in the orchestrator.
+
+### Step 4b: Interaction Sweep
+
+**Gate**: SKIP this sub-step (do not dispatch 4b) when EITHER `len(ChangedFiles) < 2` OR the Quality lens returned fewer than 4 total observations across all files. Emergent interactions need surface area; tiny diffs cannot structurally produce them.
+
+Otherwise spawn ONE `codebase-analyzer` in parallel with 4a:
+  ```
+  Quality Evidence: {verbatim}
+  Security Evidence: {verbatim}
+  Predicate-set coherence rows (verbatim from Quality's table — full Step 4a output is NOT required and is NOT awaited): {verbatim | "not applicable"}
+  Precedents: {verbatim if Wave-1 finished; else "deferred to Step 5"}
+
+  Group evidence by shared entity, state machine, workflow, data flow path, API boundary, background process, or producer-consumer contract.
+
+  Per group, check for emergent defects:
+  1. contradictory assumptions between components/layers,
+  2. unreachable, stuck, or non-terminal states,
+  3. retry/reprocess mechanisms made inert by another behavior,
+  4. duplicate-processing / idempotency gaps from ordering or missing guards,
+  5. guards in one layer invalidating transitions in another,
+  6. one finding masking, amplifying, or permanently triggering another,
+  7. stranded state — state X reachable via one conditional but every conditional operating on this entity elsewhere excludes X,
+  8. false-promise predicate — matching branch's consumer excludes this entity's source/type/state,
+  9. co-tenant filter gap — shared discriminator filter where a new or terminal value the diff touches falls through every consumer's filter.
+
+  Return only findings with ≥2 concrete `file:line` facts from different files/components, each quoted verbatim per the citation contract. No recommendations. No single-location repeats.
+
+  For findings involving ordering/races/concurrency across processes or handlers, name the ordering primitive that would prevent the race (distributed lock, exclusive-key wrapper, ordered partition, transaction, idempotency key, etc.) and explain why it does NOT apply here. Drop the finding if the primitive exists in the diff or nearby and your argument against it is speculative.
+  ```
+
+### Step 4c: Gap-Finder (orchestrator-side coverage arithmetic)
+
+**Gate**: SKIP when `len(ChangedFiles) < 2`. Tiny diffs cannot structurally have coverage gaps.
+
+No agent dispatch. Compute inline while 4a / 4b run:
+
+1. **Coverage map** — parse Quality + Security outputs; for each finding row extract its `file:line` citation and map `file → {finding-id}`. Files with ≥1 row are covered; files with none are uncovered.
+2. **In-scope filter** — keep files tagged `[boundary]`, `[persistence]`, `[code]`, or `[hub]` AND whose diff delta (sum of added + removed lines) is ≥ 5. Drop `[test]` and `[config]` entirely; drop files with tiny deltas.
+3. **Emit gap findings** — walk uncovered in-scope files in role-tag priority `[boundary]` → `[persistence]` → `[hub]` → `[code]`. For each, open its diff region in `.git/code-review-patch.diff` and pick ONE risk-bearing line (first non-comment `+` line, or the function-declaration header if a whole function was added). Emit:
+
+   `G<ordinal> — file:line — \`<verbatim line>\` — {role-tag} — <risk class in 3-6 words>`
+
+   Risk-bearing behavior class (diff introduces one of): state mutation | I/O (DB/network/file) | error path | conditional on mutable state | concurrent/shared-state access | public API surface change. Maximum **5** gap findings; stop once reached. Citation contract applies.
+
+**Wait for ALL of 4a / 4b AND the Precedents agent from Wave-1 to complete** before proceeding to Step 5 (Reconciliation). Precedents is a **hard gate** — severity weighting in Step 5 reads its follow-up-within-30-days counts. Dependencies / CVE (when dispatched) also merge in here but are not individually hard-gated; wait for them too unless they clearly exceed the review SLA, in which case omit `## Dependencies` and note it in the artifact. 4c has no wait — it completes synchronously with the orchestrator.
+
+## Step 5: Reconcile Findings
+
+**Barrier**: Step 5 MUST NOT begin until the Precedents agent has returned. Severity weighting depends on historical follow-up counts; starting reconciliation without them produces mis-weighted severities that the verification pass (Step 6) cannot correct.
+
+**Resolution integrity check** (load-bearing): when Precedents returns a commit that claims to resolve or supersede a current finding, run `git merge-base --is-ancestor <precedent-hash> <TIP>` before accepting the resolution.
+  - Ancestor: the precedent IS in the reviewed branch; mark the finding `resolved-by: <hash>` and demote its severity to 💭 (kept for context).
+  - Not ancestor: the precedent is on a different branch / not merged; treat it as **context only**. Do NOT mark the finding resolved; annotate the precedent's row in `## Precedents` third column `NOT ancestor of {TIP} (context only)`.
+  - Orchestrator unable to run git (e.g., `staged` / `working` scope with no commit): skip the check and annotate `resolution: unverified`.
+
+1. **Compile and classify evidence** per lens:
+   - **Quality** — 🔴 traced flow contradiction (dropped error path, missing validation on a sink, null-deref); 🟡 blast-radius × complexity-delta (hot path + new allocation, ABI change without migration); 🔵 pattern divergence with nearby template; 💭 composite-lesson architecture concern.
+   - **Security** — 🔴 concrete user-reachable source→sink trace via Discovery Map auth-boundary (reject hits without explicit trace); 🟡 concrete crypto issue (weak hash in auth/integrity role, non-constant-time compare, hardcoded key material); 🔵 divergence from a secure example in the same file; 💭 architectural question.
+   - **Dependencies** — 🔴 Critical/High CVE in touched dep OR lockstep-contract violation; 🟡 Moderate CVE, outdated major with migration path, license incompatibility; 🔵 minor/transitive drift; 💭 architectural dep question.
+   - **Interaction-sweep** — 🔴/🟡 only (no 💭): 🔴 concrete emergent failure across ≥2 files/components; 🟡 multi-component mismatch with bounded blast radius or existing mitigation. **Promotion rule**: when ≥2 lens findings share the same entity/flow and combine into an emergent failure, the aggregate is 🔴 even if each constituent was 🟡/🔵. The interaction IS the defect — don't leave constituents at their original severity and skip the cross-finding bullet.
+   - **Gap-finder** — 🟡 uncovered risk-bearing region in a changed file with no lens coverage; 🔵 low-impact gap (style-only or defensive-only region missed). No 🔴 (gap findings are uncertain by nature).
+   - **Peer-mirror** — treat every Missing/Diverged row as a finding. Base severity 🔵. **Bump to 🟡** when the missing invariant is a domain-event emission, a precondition guard on a state-mutating method, or a persisted-field invariant. **Bump to 🔴** when the missing invariant intersects a dispatch site the diff touches (switch/map/table/registry enumerating the peer's type alongside the new type — detectable from integration-scanner's `Wiring/config` output and the Discovery Map's `[config]`/`[hub]` files). Rationale: a missing mirror on a dispatched invariant is a silent-stranded-state cascade constituent; on a non-dispatched invariant it is a style issue. Record every peer-mirror bump in `## Reconciliation Notes`.
+   - **Precedents** → compile into `## Precedents` (table: `hash | subject | 30d-follow-ups | note`), composite lessons below. **Severity weighting**: for each current finding, count precedent commits touching the same symbol/file that had ≥1 follow-up fix within 30 days. If count ≥ 2, bump the finding one severity tier (🔵→🟡, 🟡→🔴); cap at 🔴. Record the bump by annotating the finding's title line `[precedent-weighted]` — do NOT emit a separate reconciliation section.
+
+2. **Probe advisor availability** — attempt a probe by checking whether `advisor` is in the active tool set (main-thread visibility). If yes, proceed to advisor path; otherwise take the inline path.
+
+3. **Advisor path** (when advisor is active):
+   - Print a main-thread `## Pre-Adjudication Findings` block first — the advisor reads `getBranch()`, so evidence must be flushed before the call.
+   - Call `advisor()` (zero-param). If it returns usable prose, paste it verbatim as a blockquote at the top of `## Recommendation` and skip the inline path. Otherwise fall through.
+
+4. **Inline path** (advisor unavailable or errored):
+   - Run a dimension-sweep modeled on `skills/design/SKILL.md:83-116`: Data model / API surface / Integration / Scope / Verification / Performance.
+   - For every finding, ask: does another finding contradict this severity given the Discovery Map? If yes, note the tension.
+   - Record every severity move as a title-line annotation on the affected finding (`[precedent-weighted]`, `[cascade: <kind>]`, `[subsumed-by <ID>]`). No standalone reconciliation section.
+
+5. **Emit the reconciled severity map** — authoritative severity per finding, carrying the advisor's guidance when present. Keep the per-pass grouping (do NOT tag each finding with its originating lens in prose; the H2 it sits under is the tag).
+   - Interaction findings carry `I<n>` IDs and appear under the severity H2 that matches their final tier (`## 🔴 Critical`, `## 🟡 Important`). The old `### Cross-Finding Interactions` sub-heading is retired — severity is the top-level grouping.
+   - When an interaction finding subsumes a local finding at the root-cause level, keep the local finding only if its evidence is independently actionable; annotate the local finding's title line `[subsumed-by I<n>]`.
+   - Distinct structural defects MUST remain distinct findings even when related. Specifically: a *stranded state* (entity reaches X, no exit path) and a *false-promise predicate* (TRUE branch promises unreachable behavior) are separate defects even when they arise in the same subsystem — do NOT collapse them. Collapse only when the narrative, fix, and evidence locations are identical.
+   - **Cascade detection** (load-bearing): before emitting severities, scan findings for these triples and emit a 🔴 Cross-Finding bullet if any fires —
+       • *{entity reaches state X} + {no event on that transition} + {consumer filter excludes X}* = **silent stranded state**
+       • *{check-then-act on shared resource} + {no ordering primitive} + {retry/replay path}* = **duplicate-processing cascade**
+       • *{spec A accepts Y} + {spec B rejects Y} + {workflow depends on both}* = **contradictory-predicate deadlock**
+     Also check `thoughts/shared/reviews/*.md` and Precedents: if a prior review names a cascade whose constituents appear in current findings, cite it and assert reproduction. Missed cascades are the biggest historical quality regression; prefer false positives here.
+
+## Step 6: Verify Findings
+
+Before writing the artifact, spawn ONE `claim-verifier` whose sole job is to ground every reconciled finding in the actual code at its cited `file:line`. This catches two classes of error the lenses cannot self-detect: (a) *confident assertions* the agent never opened a file to confirm, and (b) *rationalisations* ("intentional-by-design", "pre-existing", "not a real deadlock") that contradict what the code does. Lens agents reason from the patch; the verifier reasons from the file.
+
+**Dispatch** after Step 5's reconciled severity map is final, before Step 7 writes anything. First apply the **InScopeFiles pre-filter**: drop any finding whose cited `file` ∉ `InScopeFiles` (orchestrator-side set arithmetic, matches Step 4c's idiom). On `first-parent` strategies `InScopeFiles ⊊ ChangedFiles` is expected — this is where back-merge sidecar findings get dropped. Record the dropped count in `## Reconciliation Notes` so the omission is auditable. Then dispatch the filtered map:
+
+  ```
+  Verify each finding below against the actual repository state. You have Read access to the whole tree.
+
+  Findings (verbatim from Step 5):
+  {paste the full reconciled severity map — each finding with its file:line citation, verbatim line quote per the citation contract, and severity tier}
+
+  For EACH finding:
+  1. `grep -n` the verbatim quote in the cited file. Absent → Falsified. Present at a different line → rewrite the citation to the actual line, then continue. Present at cited line → continue.
+  2. If the finding makes a claim about behavior reachable elsewhere (consumer filters, dispatch registrations, peer aggregates, upstream guards, downstream sinks), Read those referenced files too. Do NOT trust the patch-only view.
+  3. If the finding claims a state is "stranded" / a predicate is "false-promise" / a precondition is "missing" — construct a concrete 2–3 line reproducer trace: "caller at A:L invokes B:L with entity in state X; guard at C:L rejects; exit path would require D which the code does not provide." If you cannot construct it, the finding is Weakened.
+  4. If the finding was marked `resolved-by: <hash>` in Step 5, Read the resolving commit's changes on the reviewed branch (via `git show <hash> -- <file>`) and confirm the resolution is actually present at TIP.
+
+  Return ONE tag per finding — output format:
+
+    FINDING <id> | <tag> | <one-sentence justification citing a file:line>
+
+  Tags:
+  - Verified — quote matches, claim is reproducible against the actual code, no contradiction found
+  - Weakened — claim is partially true but narrower than stated (severity should drop one tier), OR it relies on a consumer-side assumption that the actual consumer does not make
+  - Falsified — quote does not match, OR the claimed behavior is contradicted by code the lens did not read (peer site, upstream guard, existing handler, resolution already applied)
+
+  Be explicit about contradictions. If finding A says "intentional by design" and finding B says "stranded state" about the same entity, mark the one whose claim the code does NOT support as Falsified and cite the contradicting line.
+
+  Citation contract applies to every justification. No recommendations. No new findings.
+  ```
+
+**Before applying tags** — re-read every Weakened and Falsified justification (the tag is a summary; the justification is the evidence). Per `agents/claim-verifier.md` tag semantics: Weakened = narrower, Falsified = wrong direction, Verified = correct or understated. If a justification contradicts its tag (e.g. "inverted" / "opposite" under Weakened, or "worse than stated" under Weakened), override before applying the rules below. Also verify identity on the ID set — exactly one row per input finding; re-dispatch `claim-verifier` on any missing IDs before proceeding.
+
+**Apply the tags** (on the corrected tag):
+- **Falsified** findings — remove from the artifact entirely. Their ID is retired (never reused); the retirement is counted in the frontmatter `verification` string (`F` dropped) and nowhere else.
+- **Weakened** findings — demote one severity tier (🔴→🟡, 🟡→🔵, 🔵→💭). Rewrite the finding's evidence line to reflect the narrower claim.
+- **Verified** findings — carry through unchanged to Step 7.
+- **Edge case**: if a 🔴 Cross-Finding Interaction bullet relies on constituents that are now Falsified or Weakened, re-evaluate the interaction. Drop the interaction if it no longer stands on ≥2 Verified constituents from different files.
+
+**Gate**: if verification removes / demotes ALL 🔴 findings AND there are no remaining 🟡 findings, set `status: approved` in the artifact frontmatter. Otherwise `status: needs_changes` (or `requesting_changes` for verified 🔴 > 3).
+
+**Do not skip this step** — it is the only mechanism that stops confident-but-unread lens assertions from reaching the artifact.
+
+## Step 7: Write the Review Document
+
+1. **Determine metadata**:
+   - Filename: `thoughts/shared/reviews/YYYY-MM-DD_HH-MM-SS_{scope-kebab}.md`
+   - Repository: git root basename (fallback: cwd basename).
+   - Branch + commit: from git-context injected at session start, or `git branch --show-current` / `git rev-parse --short HEAD` (fallback: `no-branch` / `no-commit`).
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Reviewer: user from injected git-context (fallback: `unknown`).
+
+2. **Write the artifact** using the Write tool (no Edit — this skill writes once per run).
+
+**Finding IDs**: lens-prefix + ordinal, stable across severity moves. `I` = interaction, `Q` = quality, `S` = security, `G` = gap. Ordinals never renumber — if a finding is dropped by Step 6, its ID is retired, not reused.
+
+**Title-line annotations** (appear in square brackets on the finding's title line, point-of-demand for reconciliation facts):
+- `[precedent-weighted]` — severity bumped by Step 5's precedent follow-up weighting.
+- `[cascade: <kind>]` — severity set by Step 5's cascade-detection triple (`stranded-state`, `duplicate-processing`, `contradictory-predicate-deadlock`).
+- `[subsumed-by <ID>]` — root-cause subsumed by another finding but kept because its specific evidence is independently actionable.
+
+**Section-omission rules**: omit entirely (no empty placeholders) when —
+- `## 💭 Discussion` — no 💭 findings.
+- `## Pattern Analysis` — no peer-mirror pair existed for this diff.
+- `## Impact` — integration-scanner returned no inbound refs to changed files.
+- `## Precedents` — precedent-locator returned no precedents.
+
+**What is NOT emitted to the artifact**: verification outcomes in prose (frontmatter `verification` string is the only channel), advisor availability / dispatch path / tool failures (skill trace, not review content), `last_updated` / `last_updated_by` (git mtime + git author carry this for a write-once artifact).
+
+**Advisor prose**, when advisor ran, is pasted verbatim as a blockquote at the top of `## Recommendation`, not as a standalone section.
+
+**Template shape**: Read the full template at `templates/review.md` (house pattern per `.rpiv/guidance/skills/architecture.md:66` — `templates/` subfolder, runtime-read, never inlined). At emission time: Read `templates/review.md`, fill every `{placeholder}` with reconciled-and-verified values from Steps 5 and 6, apply the section-omission rules above (delete the whole section AND its trailing separator line when its input is empty), strip the leading `<!-- -->` comment, and Write the result to the target path.
+
+## Step 8: Present Summary
+
+```
+Review written to:
+`thoughts/shared/reviews/{filename}.md`
+
+Severity:     {C} critical · {I} important · {S} suggestions
+Lenses:       {Q} quality · {Se} security · {D} dependencies
+Verification: {V} verified · {W} weakened · {F} falsified (dropped)
+Advisor:      {adjudicated | inline}
+Status:       {approved | needs_changes | requesting_changes}
+
+Top items:
+1. {ID} — `file:line` — {headline}
+2. {ID} — `file:line` — {headline}
+3. {ID} — `file:line` — {headline}
+
+Ask follow-ups, or chain forward.
+
+---
+
+💬 Follow-up: describe the question in chat to append a timestamped Follow-up section. Retired IDs stay retired; re-run `/skill:code-review` for a fresh review.
+
+**Next step:** `/skill:design "Address findings from thoughts/shared/reviews/{filename}.md"` — run the design phase over the review document to produce a fix plan (only when status is `needs_changes` or `requesting_changes`).
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+```
+
+## Step 9: Handle Follow-ups
+
+- **Append, never rewrite.** Edit the artifact to add a `## Follow-up {ISO 8601 timestamp}` section. The section heading's timestamp is the append-time record — no frontmatter update needed.
+- **Re-dispatch narrowly.** Spawn a single targeted `codebase-analyzer` on the area in question (1 agent max).
+- **Retired IDs stay retired.** Findings dropped at Step 6 (Falsified) do not re-enter follow-ups; new findings introduce new IDs with the same lens-prefix scheme (next ordinal).
+- **When to re-invoke instead.** If the diff itself changed (new commits, scope shift, different branch), re-run `/skill:code-review` for a fresh review. The previous block's `Next step:` stays valid for the existing review.
+
+## Important Notes
+
+- **Frontmatter**: `allowed-tools` is intentionally omitted — the skill inherits `Agent`, `ask_user_question`, `advisor`, `Write`, `web_search`, `todo`. Do NOT re-add the line.
+- **Security-lens precision stance**: prefer false negatives. Evidence must carry `confidence ≥ 8`; 🔴 requires an explicit source→sink trace. Missing hardening without a traced sink is NOT a finding.
+- **Load-bearing ordering**:
+  - Wave-1 fans out at T=0 — integration-scanner, Precedents, (when `ManifestChanged`) Dependencies + CVE, and (when `len(PeerPairs) > 0`) the peer-mirror agent dispatch in a single multi-Agent message. integration-scanner AND peer-mirror gate Wave-2 (both feed the Discovery Map Wave-2 consumes); **Precedents is a hard gate on Step 5** (its follow-up-within-30-days counts drive severity weighting; reconciling without them produces mis-weighted severities the verification pass cannot correct); Dependencies + CVE soft-gate Step 5.
+  - Step 4a (Predicate-Trace) and 4b (Interaction Sweep) dispatch in parallel once Wave-2 completes; 4c (Gap-Finder) is orchestrator-side coverage arithmetic — no agent. Interaction Sweep (4b) receives Quality's `Predicate-set coherence` table as its predicate-row source, not 4a's output.
+  - When Quality's `Predicate-set coherence` surface returns ≥2 rows with mismatched values on the same enum/type, the 4b sweep MUST evaluate categories 7–9 against those rows.
+  - **File orientation is load-bearing**: patches MUST use `-U30` (or `-U10` fallback for >1MB patches), never `-U0`. The Discovery Map's semantic file map (clusters + role tags + symbols-touched hint) is the orientation primitive, not per-hunk line ranges. Lens prompts organise findings per file (`### file/path.ext`), not per hunk. Agents SHOULD NOT issue extra `Read` calls for files already represented in the patch unless specifically needed for a cross-file trace.
+  - **Wave-2 context isolation**: Quality and Security prompts MUST NOT include Wave-1 background-agent output (precedent-locator, Dependencies, CVE) even when those agents have finished before Wave-2 dispatches. Summary context from those agents causes the lens agents to narrativise instead of independently analyse the diff — the observed failure mode is a ~5× speedup coupled with hallucinated findings and mis-cited line numbers. Pass only Discovery Map + patch file path.
+  - ALWAYS emit `## Pre-Adjudication Findings` to the main branch BEFORE calling `advisor()` — advisor reads `getBranch()` (main-thread-only, `packages/rpiv-advisor/advisor.ts:336`).
+  - ALWAYS probe advisor availability before calling it (strip-when-unconfigured at `packages/rpiv-advisor/advisor.ts:463-472`).
+  - NEVER call `advisor()` from a sub-agent (branch invisible to advisor).
+  - NEVER parse advisor prose — paste verbatim as a blockquote at the top of `## Recommendation`.
+  - ALWAYS wait for 4a / 4b AND the Precedents agent to complete before Step 5 — Wave-3's hard barrier. 4c is synchronous (orchestrator). Dependencies + CVE wait here too when running, but are not individually hard-gated.
+  - ALWAYS run Step 6 (verification pass) between reconciliation and artifact write. It is the only mechanism that catches lens agents asserting claims they never opened a file to confirm, and the only mechanism that validates `resolved-by` annotations against the actual branch via `git merge-base --is-ancestor`. Skipping Step 6 silently re-admits the failure mode this skill was designed to prevent.
+  - PRESERVE severity emoji/naming and frontmatter keys verbatim — `thoughts-locator` / `thoughts-analyzer` grep these.
+  - Bundled row-only specialists at narrativisation-prone sites: `diff-auditor` (Wave-2 Q+S), `peer-comparator` (Wave-1 PM), `claim-verifier` (Step 6). See `.rpiv/guidance/agents/architecture.md`.
+  - **Scope strategy is load-bearing at both ends**: Step 1 sets `strategy` and `FP_FLAG`; Step 6 pre-filters the reconciled severity map against `InScopeFiles` before `claim-verifier` dispatch. `--first-parent` is orthogonal to `--no-merges` / `-U30` — additive, not a replacement. Agent contracts (`claim-verifier.md:11-30` in particular) stay scope-blind by design; orchestrator owns scope.
+- **Agent roles**:
+  - `integration-scanner` (Wave-1) — inbound/outbound refs, auth-boundary crossings.
+  - `precedent-locator` (Wave-1) — git history + thoughts/.
+  - `codebase-analyzer` ×1 (Wave-1, `ManifestChanged`) — dependencies parse.
+  - `web-search-researcher` (Wave-1, `ManifestChanged`) — CVE/advisory lookups with LINKS.
+  - `peer-comparator` ×1 (Wave-1, gated on `len(PeerPairs) > 0`) — peer-mirror check; tags Mirrored/Missing/Diverged/Intentionally-absent.
+  - `diff-auditor` ×2 (Wave-2) — Quality, Security.
+  - `codebase-analyzer` ×1 (Step 4a, gated) — predicate-trace.
+  - `codebase-analyzer` ×1 (Step 4b, gated) — interaction sweep.
+  - *(Step 4c, gated)* — gap-finder runs inline in the orchestrator (set arithmetic over coverage map; no agent).
+  - `claim-verifier` ×1 (Step 6, always) — verification pass (grounds every reconciled finding at its cited `file:line`; tags Verified / Weakened / Falsified; Falsified dropped, Weakened demoted one tier).
--- a/extensions/rpiv-pi/skills/code-review/templates/review.md
+++ b/extensions/rpiv-pi/skills/code-review/templates/review.md
@@ -0,0 +1,152 @@
+<!-- Emitted by code-review SKILL.md Step 7. Placeholders in {braces} are filled at emission; section-omission rules live inline in SKILL.md. -->
+---
+template_version: 2
+date: {Current date and time with timezone in ISO format}
+author: {User from injected git context}
+repository: {Repository name}
+branch: {Current branch name}
+commit: {Current commit hash}
+review_type: {commit | pr | staged | working}
+scope: "{What was reviewed}"
+scope_strategy: {first-parent | explicit-range | working-tree}
+in_scope_files_count: {N}
+status: {approved | needs_changes | requesting_changes}
+severity: { critical: {C}, important: {I}, suggestion: {S} }
+verification: { verified: {V}, weakened: {W}, falsified: {F} }
+blockers_count: {B}
+tags: [code-review, relevant-components]
+---
+
+# Code Review — {Scope}
+
+**Commit:** `{hash}` · **Status:** `{status}` · **Findings:** {C}🔴 · {I}🟡 · {S}🔵 · **Verification:** {V}✓ / {W}− / {F}✗
+
+## Top Blockers
+
+1. **{ID}** — {one-line headline}
+2. **{ID}** — {one-line headline}
+
+---
+
+## Legend
+
+```text
+Severity    🔴 fix before merge   🟡 fix soon   🔵 nice to have   💭 discuss
+ID prefix   I interaction   Q quality   S security   G gap
+Verify      ✓ verified   − weakened (demoted)   ✗ falsified (dropped)
+Annotate    [precedent-weighted]   [cascade: <kind>]   [subsumed-by <ID>]
+```
+
+---
+
+## 🔴 Critical
+
+### {ID} 🔴 {short headline} `{annotation?}`
+
+**Where**
+`{file:line}`
+
+**Code**
+```{lang}
+{verbatim line(s) from the file}
+```
+
+**Why**
+{1–2 sentences: mechanism, not symptom}
+
+**Fix**
+{one sentence, imperative}
+
+**Alt**
+{optional: alternative fix}
+
+---
+
+## 🟡 Important
+
+### {ID} 🟡 {short headline} `{annotation?}`
+
+**Where**
+`{file:line}`
+
+**Code**
+```{lang}
+{verbatim line(s)}
+```
+
+**Why**
+{mechanism}
+
+**Fix**
+{action}
+
+---
+
+## 🔵 Suggestions
+
+### {ID} 🔵 {short headline}
+
+**Where**
+`{file:line}`
+
+**Fix**
+{action}
+
+---
+
+## 💭 Discussion
+
+### {ID} 💭 {question / architectural concern}
+
+**Where**
+`{file:line}`
+
+**Why**
+{what the reviewer wants the author to consider}
+
+---
+
+## Pattern Analysis
+
+| Peer            | Mirrored | Missing | Diverged | Intentional |
+| --------------- | -------: | ------: | -------: | ----------: |
+| `{peer file}`   |      {M} |    {Mi} |      {D} |         {A} |
+
+**Missing/Diverged rows drive:** {finding IDs}
+
+**Key divergences from peer**
+- {divergence one}
+- {divergence two}
+
+---
+
+## Impact
+
+| Consumer        | Change           | Findings |
+| --------------- | ---------------- | -------- |
+| `{file:line}`   | {change class}   | {IDs}    |
+
+---
+
+## Precedents
+
+| Commit    | Subject          | Follow-ups                                              |
+| --------- | ---------------- | ------------------------------------------------------- |
+| `{hash}`  | {commit subject} | {30d follow-ups, or "NOT ancestor of {TIP}", or note}   |
+
+**Recurring lessons (most → least frequent)**
+
+1. {composite lesson}
+2. ...
+
+---
+
+## Recommendation
+
+> (advisor prose pasted verbatim here as a blockquote when advisor ran; omit the blockquote otherwise)
+
+| # | ID     | Action                      | Alt / Note        |
+| - | ------ | --------------------------- | ----------------- |
+| 1 | {ID}   | {action, one sentence}      | {alternative}     |
+| 2 | {ID}   | {action}                    | —                 |
+| 3 | {ID}   | {action}                    | —                 |
--- a/extensions/rpiv-pi/skills/commit/SKILL.md
+++ b/extensions/rpiv-pi/skills/commit/SKILL.md
@@ -0,0 +1,65 @@
+---
+name: commit
+description: Create structured git commits by analyzing staged and unstaged changes and grouping them logically into one or more commits with clear, descriptive messages. Use when the user asks to commit, says "commit this" or "commit my changes", wants help writing a commit message, or has finished a chunk of work that needs committing.
+argument-hint: [message]
+allowed-tools: Bash(git *), Read, Glob, Grep
+---
+
+# Commit Changes
+
+You are tasked with creating git commits for repository changes.
+
+## Commit hint
+
+`$ARGUMENTS` (empty/literal → infer from history and `git diff`)
+
+## Context:
+- **In-session**: If there's conversation history, use it to understand what was built/changed
+- **Standalone**: If no context available, rely entirely on git state and file inspection
+
+## Process:
+
+0. **Check git availability:**
+   - Run `git status --short` to determine whether the current directory is a git repository
+   - If not a git repo, tell the user: "This directory is not a git repository. Run `git init` to initialize one."
+   - Stop — do not proceed with commit.
+
+1. **Think about what changed:**
+   - **If in-session**: Review the conversation history to understand what was accomplished
+   - **Always**: Run `git diff` to understand the modifications in detail
+   - If needed, inspect file contents to understand purpose and scope
+   - Consider whether changes should be one commit or multiple logical commits
+
+2. **Plan your commit(s):**
+   - Identify which files belong together
+   - Draft clear, descriptive commit messages
+   - Use imperative mood in commit messages
+   - Focus on why the changes were made, not just what
+   - Check for sensitive information (API keys, credentials) before committing
+
+3. **Present your plan to the user:**
+   - List the files you plan to add for each commit
+   - Show the commit message(s) you'll use
+   - Use the `ask_user_question` tool to confirm the commit plan. Question: "{N} commit(s) with {M} files. Proceed?". Header: "Commit". Options: "Commit (Recommended)" (Create the commit(s) as planned); "Adjust" (Change the grouping or commit messages); "Review files" (Show me the full diff before committing).
+
+4. **Execute upon confirmation:**
+   - Use `git add` with specific files (never use `-A` or `.`)
+   - Create commits with your planned messages
+   - Show the result with `git log --oneline -n X` (where X = number of commits you just created)
+
+## Important:
+
+- **NEVER add co-author information or Claude attribution**
+- Commits should be authored solely by the user
+- Do not include any "Generated with Claude" messages
+- Do not add "Co-Authored-By" lines
+- Write commit messages as if the user wrote them
+
+## Remember:
+
+- Adapt your approach: use conversation context if available, otherwise infer from git state
+- In-session: you have full context of what was done; Standalone: infer from git analysis
+- Group related changes by purpose (feature, fix, refactor, docs)
+- Keep commits atomic: one logical change per commit
+- Split into multiple commits if: different features, mixing bugs with features, or unrelated concerns
+- The user trusts your judgment - they asked you to commit
--- a/extensions/rpiv-pi/skills/create-handoff/SKILL.md
+++ b/extensions/rpiv-pi/skills/create-handoff/SKILL.md
@@ -0,0 +1,96 @@
+---
+name: create-handoff
+description: Create a context-preserving handoff document for session transitions, compacting the current task, decisions made, in-flight changes, and open questions into a single concise file so a fresh session can pick up where this one left off. Use when the user invokes /create-handoff, says context is getting large, asks to wrap up the session, or wants to hand off work to another session.
+argument-hint: [description]
+allowed-tools: Read, Write, Bash(git *), Glob, Grep
+disable-model-invocation: true
+---
+
+# Create Handoff
+
+You are tasked with writing a handoff document to hand off your work to another agent in a new session. You will create a handoff document that is thorough, but also **concise**. The goal is to compact and summarize your context without losing any of the key details of what you're working on.
+
+
+## Process
+### 1. Filepath & Metadata
+Use the following information to understand how to create your document:
+    - create your file under `thoughts/shared/handoffs/YYYY-MM-DD_HH-MM-SS_description.md`, where:
+        - YYYY-MM-DD / HH-MM-SS come from the `date` command (see below)
+        - description is a brief kebab-case description
+     - Repository name: from git root basename, or current directory basename if not a git repo
+     - Use the git branch and commit from the git context injected at the start of the session (or run `git branch --show-current` / `git rev-parse --short HEAD` directly)
+     - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+     - Author: use the User from the git context injected at the start of the session (fallback: "unknown")
+     - If metadata unavailable: use "unknown" for commit/branch
+    - Examples:
+        - `thoughts/shared/handoffs/2025-01-08_13-55-22_create-context-compaction.md`
+
+### 2. Handoff writing.
+using the above conventions, write your document. use the defined filepath, and the following YAML frontmatter pattern. Use the metadata gathered in step 1, Structure the document with YAML frontmatter followed by content:
+
+Use the following template structure:
+```markdown
+---
+date: {Current date and time with timezone in ISO format}
+author: {Author name from thoughts status}
+commit: {Current commit hash}
+branch: {Current branch name}
+repository: {Repository name}
+topic: "{Feature/Task Name} {Work Type}" # Customize work type: Implementation Strategy, Bug Fix, Research, Feature Implementation, etc.
+tags: [implementation, strategy, relevant-component-names]
+status: complete
+last_updated: {Same ISO timestamp as `date:` above}
+last_updated_by: {Author name}
+type: {work_type} # Options: implementation_strategy, bug_fix, research, refactoring, feature_development, etc.
+---
+
+# Handoff: {very concise description}
+
+## Task(s)
+{description of the task(s) that you were working on, along with the status of each (completed, work in progress, planned/discussed). If you are working on an implementation plan, make sure to call out which phase you are on. Make sure to reference the plan document and/or research document(s) you are working from that were provided to you at the beginning of the session, if applicable.}
+
+## Critical References
+{List any critical specification documents, architectural decisions, or design docs that must be followed. Include only 2-3 most important file paths. Leave blank if none.}
+
+## Recent changes
+{describe recent changes made to the codebase that you made in file:line syntax}
+
+## Learnings
+{describe important things that you learned - e.g. patterns, root causes of bugs, or other important pieces of information someone that is picking up your work after you should know. consider listing explicit file paths.}
+
+## Artifacts
+{ an exhaustive list of artifacts you produced or updated as filepaths and/or file:line references - e.g. paths to feature documents, implementation plans, etc that should be read in order to resume your work.}
+
+## Action Items & Next Steps
+{ a list of action items and next steps for the next agent to accomplish based on your tasks and their statuses}
+
+## Other Notes
+{ other notes, references, or useful information - e.g. where relevant sections of the codebase are, where relevant documents are, or other important things you learned that you want to pass on but that don't fall into the above categories}
+```
+---
+
+### 3. Approve
+Save the document.
+
+Once this is completed, you should respond to the user with the template between <template_response></template_response> XML tags. do NOT include the tags in your response.
+
+<template_response>
+Handoff written to:
+`thoughts/shared/handoffs/YYYY-MM-DD_HH-MM-SS_description.md`
+
+Replace the path below with your actual handoff file path before running.
+
+---
+
+💬 Follow-up: describe extra context in chat to append to this handoff before chaining; re-run `/skill:create-handoff` for a fresh handoff document.
+
+**Next step:** `/skill:resume-handoff thoughts/shared/handoffs/YYYY-MM-DD_HH-MM-SS_description.md` — pick up where this session left off in a fresh context.
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+</template_response>
+
+---
+## Additional Notes & Instructions
+- **more information, not less**. This is a guideline that defines the minimum of what a handoff should be. Always feel free to include more information if necessary.
+- **be thorough and precise**. include both top-level objectives, and lower-level details as necessary.
+- **avoid excessive code snippets**. While a brief snippet to describe some key change is important, avoid large code blocks or diffs; do not include one unless it's necessary (e.g. pertains to an error you're debugging). Prefer using `/path/to/file.ext:line` references that an agent can follow later when it's ready, e.g. `packages/dashboard/src/app/dashboard/page.tsx:12-24`
--- a/extensions/rpiv-pi/skills/design/SKILL.md
+++ b/extensions/rpiv-pi/skills/design/SKILL.md
@@ -0,0 +1,419 @@
+---
+name: design
+description: Design complex features by decomposing them into vertical slices and producing a design artifact (architecture decisions, slice breakdown, file map) in thoughts/shared/designs/. The design feeds the plan or blueprint skill. Use for complex multi-component features touching 6+ files across multiple layers, when the user wants a feature designed before implementation. Requires a research artifact or a solutions artifact (from explore). Prefer design over plan or blueprint when the focus is architecture and decomposition rather than phased execution steps.
+argument-hint: [research artifact path]
+---
+
+# Design
+
+You are tasked with designing how code will be shaped for a feature or change. This iterative variant decomposes features into vertical slices and generates code slice-by-slice with developer micro-checkpoints between slices. The design artifact feeds directly into plan, which sequences it into phases.
+
+**How it works**:
+- Read input and key source files into context (Step 1)
+- Spawn targeted research agents for depth analysis (Step 2)
+- Identify ambiguities — triage into simple decisions and genuine ambiguities (Step 3)
+- Holistic self-critique — review the combined design for gaps and contradictions (Step 4)
+- Developer checkpoint — resolve genuine ambiguities one at a time (Step 5)
+- Decompose into vertical slices holistically before generating code (Step 6)
+- Generate code slice-by-slice with developer micro-checkpoints (Step 7)
+- Verify cross-slice integration consistency (Step 8)
+- Finalize the design artifact (Step 9)
+- Review and iterate with the developer (Step 10)
+
+The final artifact is plan-compatible.
+
+## Step 1: Input Handling
+
+When this command is invoked:
+
+1. **Read research artifact**:
+
+   **Research artifact provided** (argument contains a path to a `.md` file in `thoughts/`):
+   - Read the research artifact FULLY using the Read tool WITHOUT limit/offset
+   - Extract: Summary, Code References, Integration Points, Architecture Insights, Developer Context, Open Questions
+   - **Read the key source files from Code References** into the main context — especially hooks, shared utilities, and integration points the design will depend on. Read them FULLY. This ensures you have complete understanding before proceeding.
+   - These become starting context — no need to re-discover what exists
+   - Research Developer Context Q/As = inherited decisions (record in Decisions, never re-ask); Open Questions = starting ambiguity queue, filtered by dimension in Step 3
+
+   **No arguments provided**:
+   ```
+   I'll design a feature iteratively from a research artifact. Please provide:
+
+   `/skill:design [research artifact] [task description]`
+
+   Research artifact is required. Task description is optional.
+   ```
+   Then wait for input.
+
+2. **Read any additional files mentioned** — tickets, related designs, existing implementations. Read them FULLY before proceeding.
+
+## Step 2: Targeted Research
+
+This is NOT a discovery sweep. Focus on DEPTH (how things work, what patterns to follow) not BREADTH (where things are).
+
+1. **Spawn parallel research agents** using the Agent tool:
+
+   - Use **codebase-pattern-finder** to find existing implementations to model after — the primary template for code shape
+   - Use **codebase-analyzer** to understand HOW integration points work in detail
+   - Use **integration-scanner** to map the wiring surface — inbound refs, outbound deps, config/DI/event registration
+   - Use **precedent-locator** to find similar past changes in git history — what commits introduced comparable features, what broke, and what lessons apply to this design. Only when `commit` is available (not `no-commit`); otherwise skip and note "git history unavailable" in Verification Notes.
+
+   **Novel work** (new libraries, first-time patterns, no existing codebase precedent):
+   - Add **web-search-researcher** for external documentation, API references, and community patterns
+   - Instruct it to return LINKS with findings — include those links in the final design artifact
+
+   Agent prompts should focus on (labeled by target agent):
+   - **codebase-pattern-finder**: "Find the implementation pattern I should model after for {feature type}"
+   - **codebase-analyzer**: "How does {integration point} work in detail"
+   - **integration-scanner**: "What connects to {component} — inbound refs, outbound deps, config"
+
+   NOT: "Find all files related to X" — that's discovery's job, upstream of this skill.
+
+2. **Read all key files identified by agents** into the main context — especially the pattern templates you'll model after.
+
+3. **Wait for ALL agents to complete** before proceeding.
+
+4. **Analyze and verify understanding**:
+   - Cross-reference research findings with actual code read in Step 1
+   - Identify any discrepancies or misunderstandings
+   - Note assumptions that need verification
+   - Determine true scope based on codebase reality
+
+## Step 3: Identify Ambiguities — Dimension Sweep
+
+Walk Step 2 findings, inherited research Q/As, and carried Open Questions through six architectural dimensions that map 1:1 to `plan` extract sections — the sweep guarantees downstream completeness. Add **migration** as a seventh dimension only if the feature changes persisted schema.
+
+- **Data model** — types, schemas, entities
+- **API surface** — signatures, exports, routes
+- **Integration wiring** — mount points, DI, events, config
+- **Scope** — in / explicitly deferred
+- **Verification** — tests, assertions, risk-bearing behaviors
+- **Performance** — load paths, caching, N+1 risks
+
+For each dimension, classify findings as **simple decisions** (one valid option, obvious from codebase — record in Decisions with `file:line` evidence, do not ask) or **genuine ambiguities** (multiple valid options, conflicting patterns, scope questions, novel choices — queue for Step 5). Inherited research Q/As land as simple; Open Questions filter by dimension — architectural survives, implementation-detail defers.
+
+**Pre-validate every option** before queuing it against research constraints and runtime code behavior. Eliminate or caveat options that contradict Steps 1-2 evidence. **Coverage check**: every Step 2 file read appears in at least one decision or ambiguity; every dimension is addressed (silently-resolved valid, skipped-unchecked not).
+
+## Step 4: Holistic Self-Critique
+
+Before presenting ambiguities to the developer, review the combined design picture holistically. Step 3 triages findings individually — this step checks whether they fit together as a coherent whole.
+
+**Prompt yourself:**
+- What's inconsistent, missing, or contradictory across the research findings, resolved decisions, and identified ambiguities?
+- What edge cases or failure modes aren't covered by any ambiguity or decision?
+- Do any patterns from different agents conflict when combined?
+
+**Areas to consider** (suggestive, not a checklist):
+- Requirement coverage — is every requirement from Step 1 addressed by at least one decision or ambiguity?
+- Cross-cutting concerns — do error handling, state management, or performance span multiple ambiguities without being owned by any?
+- Pattern coherence — do the simple decisions from Step 3 still hold when viewed together, or does a combination reveal a conflict?
+- Ambiguity completeness — did Step 3 miss a genuine ambiguity by treating a multi-faceted issue as simple?
+
+**Remediation:**
+- Issues you can resolve with evidence: fix in-place — reclassify simple decisions as genuine ambiguities, or resolve a genuine ambiguity as simple if holistic review provides clarity. Note what changed.
+- Issues that need developer input: add as new genuine ambiguities to the Step 5 checkpoint queue.
+- If no issues found: proceed to Step 5 with the existing ambiguity set.
+
+## Step 5: Developer Checkpoint
+
+Use the grounded-questions-one-at-a-time pattern. Use a **❓ Question:** prefix so the developer knows their input is needed. Each question must:
+- Reference real findings with `file:line` evidence
+- Present concrete options (not abstract choices)
+- Pull a DECISION from the developer, not confirm what you already found
+
+**Question patterns by ambiguity type:**
+
+- **Pattern conflict**: "Found 2 patterns for {X}: {pattern A} at `file:line` and {pattern B} at `file:line`. They differ in {specific way}. Which should the new {feature} follow?"
+- **Missing pattern**: "No existing {pattern type} in the codebase. Options: (A) {approach} modeled after {external reference}, (B) {approach} extending {existing code at file:line}. Which fits the project's direction?"
+- **Scope boundary**: "The {research/description} mentions both {feature A} and {feature B}. Should this design cover both, or just {feature A} with {feature B} deferred?"
+- **Integration choice**: "{Feature} can wire into {point A} at `file:line` or {point B} at `file:line`. {Point A} matches the {existing pattern} pattern. Agree, or prefer {point B}?"
+- **Novel approach**: "No existing {X} in the project. Options: (A) {library/pattern} — {evidence/rationale}, (B) {library/pattern} — {evidence/rationale}. Which fits?"
+
+**Critical rules:**
+- Ask ONE question at a time. Wait for the answer before asking the next.
+- Lead with the most architecturally significant ambiguity.
+- Every answer becomes a FIXED decision — no revisiting unless the developer explicitly asks.
+
+**Choosing question format:**
+
+- **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example:
+
+  > Use the `ask_user_question` tool with the following question: "Found 2 mapping approaches — which should new code follow?". Header: "Pattern". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+- **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+  "❓ Question: Integration scanner found no background job registration for this area. Is that expected, or is there async processing I'm not seeing?"
+
+**Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+**Classify each response:**
+
+**Decision** (e.g., "use pattern A", "yes, follow that approach"):
+- Record in Developer Context. Fix in Decisions section.
+
+**Correction** (e.g., "no, there's a third option you missed", "check the events module"):
+- Spawn targeted rescan: **codebase-analyzer** on the new area (max 1-2 agents).
+- Merge results. Update ambiguity assessment.
+
+**Scope adjustment** (e.g., "skip the UI, backend only", "include tests"):
+- Record in Developer Context. Adjust scope.
+
+**After all ambiguities are resolved**, present a brief design summary (under 15 lines):
+
+```
+Design: {feature name}
+Approach: {1-2 sentence summary of chosen architecture}
+
+Decisions:
+- {Decision 1}: {choice} — modeled after `file:line`
+- {Decision 2}: {choice}
+- {Decision 3}: {choice}
+
+Scope: {what's in} | Not building: {what's out}
+Files: {N} new, {M} modified
+```
+
+Use the `ask_user_question` tool to confirm before proceeding. Question: "{Summary from design brief above}. Ready to proceed to decomposition?". Header: "Design". Options: "Proceed (Recommended)" (Decompose into vertical slices, then generate code slice-by-slice); "Adjust decisions" (Revisit one or more architectural decisions above); "Change scope" (Add or remove items from the building/not-building lists).
+
+## Step 6: Feature Decomposition
+
+After the design summary is confirmed, decompose the feature into vertical slices. Each slice is a self-contained unit: types + implementation + wiring for one concern.
+
+1. **Decompose holistically** — define ALL slices, dependencies, and ordering before generating any code:
+
+   ```
+   Feature Breakdown: {feature name}
+
+   Slice 1: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW), path/to/file.ext (MODIFY)
+     Depends on: nothing (foundation)
+
+   Slice 2: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW), path/to/file.ext (MODIFY)
+     Depends on: Slice 1
+
+   Slice 3: {name} — {what this slice delivers}
+     Files: path/to/file.ext (NEW)
+     Depends on: Slice 2
+   ```
+
+2. **Slice properties**:
+   - End-to-end vertical: each slice is a complete cross-section of one concern (types + impl + wiring)
+   - ~512-1024 tokens per slice (maps to individual file blocks)
+   - Sequential: each builds on the previous (never parallel)
+   - Foundation first: types/interfaces always Slice 1
+
+3. **Confirm decomposition** using the `ask_user_question` tool. Question: "{N} slices for {feature}. Slice 1: {name} (foundation). Slices 2-N: {brief}. Approve decomposition?". Header: "Slices". Options: "Approve (Recommended)" (Proceed to slice-by-slice code generation); "Adjust slices" (Reorder, merge, or split slices before generating); "Change scope" (Add or remove files from the decomposition).
+
+4. **Create skeleton artifact** — immediately after decomposition is approved:
+   - Determine metadata: filename `thoughts/shared/designs/YYYY-MM-DD_HH-MM-SS_topic.md`, repository name from git root, branch and commit from the git context injected at the start of the session (fallbacks: "no-branch" / "no-commit"), author from the injected User (fallback: "unknown")
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Write skeleton using the Write tool with `status: in-progress` in frontmatter
+   - **Include all prose sections filled** from Steps 1-5: Summary, Requirements, Current State Analysis, Scope, Decisions, Desired End State, File Map, Ordering Constraints, Verification Notes, Performance Considerations, Migration Notes, Pattern References, Developer Context, References
+   - **Architecture section**: one `### path/to/file.ext — NEW/MODIFY` heading per file from the decomposition, with empty code fences as placeholders
+   - **Design History section**: list all slices with `— pending` status
+   - This is the living artifact — all subsequent writes use the Edit tool
+
+   **Artifact template sections** (all required in skeleton):
+
+   - **Frontmatter**: date, author, commit, branch, repository, topic, tags, `status: in-progress`, parent, last_updated, last_updated_by
+   - **# Design: {Feature Name}**
+   - **## Summary**: 2-3 sentences — what we're building and the chosen architectural approach. Settled decision, not a discussion.
+   - **## Requirements**: Bullet list from ticket, research, or developer input.
+   - **## Current State Analysis**: What exists now, what's missing, key constraints. Include `### Key Discoveries` with `file:line` references, patterns to follow, constraints to work within.
+   - **## Scope**: `### Building` — concrete deliverables. `### Not Building` — developer-stated exclusions AND likely scope-creep vectors (alternative architectures not chosen, nearby code that looks related but shouldn't be touched).
+   - **## Decisions**: `###` per decision. Complex: Ambiguity → Explored (Option A/B with `file:line` + pro/con) → Decision. Simple: just state decision with evidence.
+   - **## Architecture**: `###` per file with NEW/MODIFY label. Empty code fences in skeleton (filled in Step 7d). NEW files get full implementation. MODIFY files get only modified/added code — no "Current" block.
+   - **## Desired End State**: Usage examples showing the feature in use from a consumer's perspective — concrete code, not prose.
+   - **## File Map**: `path/to/file.ext  # NEW/MODIFY — purpose` per line.
+   - **## Ordering Constraints**: What must come before what. What can run in parallel.
+   - **## Verification Notes**: Carry forward from research — known risks, build/test warnings, precedent lessons. Format as verifiable checks (commands, grep patterns, visual inspection). plan converts these to success criteria.
+   - **## Performance Considerations**: Any performance implications or optimizations.
+   - **## Migration Notes**: If applicable — existing data, schema changes, rollback strategy, backwards compatibility. Empty if not applicable.
+   - **## Pattern References**: `path/to/similar.ext:line-range` — what pattern to follow and why.
+   - **## Developer Context**: Record questions exactly as asked during checkpoint, including `file:line` evidence. For iterative variant: also record micro-checkpoint interactions from Step 7c.
+   - **## Design History**: Slice approval/revision log. `- Slice N: {name} — pending/approved as generated/revised: {what changed}`. plan ignores this section.
+   - **## References**: Research artifacts, tickets, similar implementations.
+
+   **Architecture format in skeleton**:
+   - **NEW files**: heading + empty code fence (filled with full implementation in Step 7d)
+   - **MODIFY files**: heading with `file:line-range` + empty code fence (filled with only the modified code in Step 7d — no "Current" block, the original is on disk)
+
+## Step 7: Generate Slices (Iterative)
+
+Generate code one slice at a time. Each slice sees the fixed code from all previous slices.
+
+**For each slice in the decomposition (sequential order):**
+
+### 7a. Generate slice code (internal)
+
+Generate complete, copy-pasteable code for every file in this slice — but **hold it for the artifact, do NOT present full code to the developer**. The developer sees a condensed review in 7c; the full code goes into the artifact in 7d.
+
+- **New files**: complete code — imports, types, implementation, exports. Follow the pattern template from Step 2.
+- **Modified files**: read current file FULLY, generate only the modified/added code scoped to changed sections (no full "Current" block — the original is on disk)
+- **Test files**: complete test suites following project patterns
+- **Wiring**: show where new code hooks into existing code
+
+If additional context is needed, spawn a targeted **codebase-analyzer** agent.
+
+No pseudocode, no TODOs, no placeholders — the code must be copy-pasteable by implement.
+
+**Context grounding** (after slice 2): Before generating, re-read the artifact's Architecture section for files this slice touches. The artifact is the source of truth — generate code that extends what's already there, not what you remember from conversation.
+
+### 7b. Self-verify slice
+
+Before presenting to the developer, cross-check this slice and produce a structured summary:
+
+```
+Self-verify Slice N:
+- Decisions: {OK / VIOLATION: decision X — fix applied}
+- Cross-slice: {OK / CONFLICT: file X has inconsistent types — fix applied}
+- Research: {OK / WARNING: constraint Y not satisfied — fix applied}
+```
+
+If violations found: fix in-place before presenting. Include the self-verify summary in the 7c checkpoint presentation.
+
+### 7c. Developer micro-checkpoint
+
+Present a **condensed review** of the slice — NOT the full generated code. The developer reviews the design shape, not every line. For each file in the slice, show:
+
+1. **Summary** (1-2 sentences): what changed, what pattern used, what it connects to
+2. **Signatures**: type/interface definitions, exported function signatures with parameter and return types
+3. **Key code blocks**: factory calls, wiring, non-obvious logic — the interesting parts that show the design decision in action
+
+**Omit**: boilerplate, import lists, full function bodies, obvious implementations.
+**MODIFY files**: focused diff (`- old` / `+ new`) with ~3 lines context. **Test files**: test case names only.
+
+**If the developer asks to see full code**, show it inline — exception, not default.
+
+Use the `ask_user_question` tool to confirm. Question: "Slice {N/M}: {slice name} — {files affected}. {1-line summary}. Approve?". Header: "Slice {N}". Options: "Approve (Recommended)" (Lock this slice, write to artifact, proceed to slice {N+1}); "Revise this slice" (Adjust code before proceeding — describe what to change); "Rethink remaining slices" (This slice reveals a design issue — revisit decomposition).
+
+**Checkpoint cadence**: Slices 1-2: always individual. Slices 3+: individual if (a) mid-generation agent spawn was needed, (b) MODIFY touches an undiscussed file, or (c) self-verify fixed a violation.
+Otherwise batch 2-3 slices (max 3).
+
+### 7d. Incorporate feedback
+
+**Approve**: Lock this slice's code and **Edit the artifact immediately**:
+1. For each file in this slice, Edit the skeleton artifact to replace the empty code fence under that file's Architecture heading with the full generated code from 7a
+2. If a later slice contributes to a file already filled by an earlier slice: **rewrite the entire code fence** with the merged result (do not append alongside existing code)
+3. After merge, verify: no duplicate function definitions, imports deduplicated, exports list complete
+4. Update the Design History section: `- Slice N: {name} — approved as generated`
+- Proceed to next slice
+
+**Revise**: Update code per developer feedback. Re-run self-verify (7b). Re-present the same slice (7c). The artifact is NOT touched — only "Approve" writes to the artifact.
+
+**Rethink**: Developer spotted a design issue. If a previously approved slice is affected, flag the conflict and offer cascade revision — developer decides whether to reopen (if yes, Edit artifact entry).
+Update decomposition (add/remove/reorder remaining slices) and confirm before continuing.
+
+## Step 8: Integration Verification
+
+After all slices are complete, review cross-slice consistency:
+
+1. **Present integration summary** (under 15 lines):
+   ```
+   Integration: {feature name} — {N} slices complete
+
+   Slices: {brief list of slice names and file counts}
+   Cross-slice: {types consistent / imports valid / wiring complete}
+   Research constraints: {all satisfied / N violations noted}
+   ```
+
+2. **Verify research constraints**: Check each Precedent & Lesson and Verification Note from the research artifact against the generated code. List satisfaction status.
+
+3. **Confirm using the `ask_user_question` tool**. Question: "{N} slices complete, {M} files total. Cross-slice consistency verified. Proceed to design artifact?". Header: "Verify". Options: "Proceed (Recommended)" (Finalize the design artifact (verify completeness, update status)); "Revisit slice" (Reopen a specific slice for revision — Edit the artifact after); "Add missing" (A file or integration point is missing — add to artifact).
+
+## Step 9: Finalize Design Artifact
+
+The artifact was created as a skeleton in Step 6 and filled progressively in Step 7d. This step verifies completeness and finalizes.
+
+1. **Verify all Architecture entries are filled**: Every file heading from the decomposition must have a non-empty code block. If any are still empty (e.g., a slice was skipped), generate and fill them now.
+
+2. **Verify cross-slice file merges**: For files touched by multiple slices, confirm the Architecture entry contains the final merged code, not just the last slice's contribution.
+
+3. **Update frontmatter** via Edit: set `status: complete`. `last_updated` and `last_updated_by` were set at skeleton creation — leave as-is.
+
+4. **Verify template completeness**: Ensure all 17 sections from the template reference in Step 6 are present and filled. Edit to fix any gaps.
+
+5. **Architecture format reminder**:
+   - **NEW files**: `### path/to/file.ext — NEW` + one-line purpose + full implementation code block
+   - **MODIFY files**: `### path/to/file.ext:line-range — MODIFY` + code block with only the modified/added code (no "Current" block — the original is on disk, implement reads it)
+
+## Step 10: Review & Iterate
+
+1. **Present the design artifact location**:
+   ```
+   Design artifact written to:
+   `thoughts/shared/designs/{filename}.md`
+
+   {N} architectural decisions fixed, {M} new files designed, {K} existing files modified.
+   {S} slices generated, {R} revisions during generation.
+
+   Please review and let me know:
+   - Are the architectural decisions correct?
+   - Does the code match what you envision?
+   - Any missing integration points or edge cases?
+
+   ---
+
+   💬 Follow-up: describe the change in chat to append a timestamped Follow-up section to this artifact. Re-run `/skill:design` for a fresh artifact.
+
+   **Next step:** `/skill:plan thoughts/shared/designs/{filename}.md` — sequence the design into implementation phases.
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+## Step 11: Handle Follow-ups
+
+- **Edit in-place.** Use the Edit tool to update the design artifact directly — sliced design code stays one source of truth.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "Updated <brief description>"`.
+- **Sync decisions ↔ code.** If the change affects decisions, update both the Decisions section AND the Architecture code. Code is source of truth — if they conflict, the code wins, update the prose.
+- **Return to checkpoint on new ambiguities.** If new ambiguities surface, return to Step 5 (developer checkpoint) before re-generating slices.
+- **When to re-invoke instead.** If the underlying research is now stale or the feature scope changed materially, re-run `/skill:research` then `/skill:design` for a fresh artifact. The previous block's `Next step:` stays valid for the existing design.
+
+## Guidelines
+
+1. **Be Architectural**: Design shapes code; plans sequence work. Every decision must be grounded in `file:line` evidence from the actual codebase.
+
+2. **Be Interactive**: Don't produce the full design in one shot. Resolve ambiguities through the checkpoint first, get buy-in on the approach, THEN decompose and generate slice-by-slice.
+
+3. **Be Complete**: Code in the Architecture section must be copy-pasteable by implement. No pseudocode, no TODOs, no "implement here" placeholders. If you can't write complete code, an ambiguity wasn't resolved.
+
+4. **Be Skeptical**: Question vague requirements. If an existing pattern doesn't fit the new feature, say so and propose alternatives. Don't force a pattern where it doesn't belong.
+
+5. **Resolve Everything**: No unresolved questions in the final artifact. If something is ambiguous, ask during the checkpoint or micro-checkpoint. The design must be complete enough that plan can mechanically decompose it into phases.
+
+6. **Present Condensed, Persist Complete**: Micro-checkpoints show the developer summaries, signatures, and key code blocks. The artifact always contains full copy-pasteable code. If the developer asks to see full code, show it — but never default to walls of code in checkpoints.
+
+## Subagent Usage
+
+| Context | Agents Spawned |
+|---|---|
+| Default (research artifact provided) | codebase-pattern-finder, codebase-analyzer, integration-scanner, precedent-locator |
+| Novel work (new library/pattern) | + web-search-researcher |
+| During code writing (if needed) | targeted codebase-analyzer for specific files |
+
+Spawn multiple agents in parallel when they're searching for different things. Each agent runs in isolation — provide complete context in the prompt, including specific directory paths when the feature targets a known module. Don't write detailed prompts about HOW to search — just tell it what you're looking for and where.
+
+## Important Notes
+
+- **Always chained**: This skill requires a research artifact produced by the research skill. There is no standalone design mode.
+- **File reading**: Always read research artifacts and referenced files FULLY (no limit/offset) before spawning agents
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS read input files first (Step 1) before spawning agents (Step 2)
+  - ALWAYS wait for all agents to complete before identifying ambiguities (Step 3)
+  - ALWAYS resolve all ambiguities (Step 5) before decomposing into slices (Step 6)
+  - ALWAYS complete holistic decomposition before generating any slice code (Step 7)
+  - ALWAYS create the skeleton artifact immediately after decomposition approval (Step 6)
+  - NEVER leave Architecture code fences empty after their slice is approved — fill via Edit in Step 7d
+- NEVER skip the developer checkpoint — developer input on architectural decisions is the highest-value signal in the design process
+- NEVER edit source files — all code goes into the design document, not the codebase. This skill produces a document, not implementation. Source file editing is implement's job.
+- **Code is source of truth** — if the Architecture code section conflicts with the Decisions prose, the code wins. Update the prose.
+- **Checkpoint recordings**: Record micro-checkpoint interactions in Developer Context with `file:line` references, same as Step 5 questions.
+- **Frontmatter consistency**: Always include frontmatter, use snake_case for multi-word fields, keep tags relevant
+
+## Common Design Patterns
+
+- **New Features**: types first → backend logic → API surface → UI last. Research existing patterns first. Include tests alongside each implementation.
+- **Modifications**: Read current file FULLY. Show only the modified/added code scoped to changed sections. Check integration points for side effects.
+- **Database Changes**: schema/migration → store/repository → business logic → API → client. Include rollback strategy.
+- **Refactoring**: Document current behavior first. Plan incremental backwards-compatible changes. Verify existing behavior preserved.
+- **Novel Work**: Include approach comparison in Decisions. Ground in codebase evidence OR web research. Get explicit developer sign-off BEFORE writing code.
--- a/extensions/rpiv-pi/skills/discover/SKILL.md
+++ b/extensions/rpiv-pi/skills/discover/SKILL.md
@@ -0,0 +1,218 @@
+---
+name: discover
+description: Interview the developer one question at a time to extract feature intent and requirements, then synthesize into a Feature Requirements Document at thoughts/shared/discover/. The first question is intent-only and runs before any codebase probe; subsequent questions ground in evidence the probe surfaces. Use as the canonical entry point of the pipeline before research, or to stress-test a feature idea before codebase discovery. The FRD's Decisions block is consumed by `research` and propagates through Developer Context into `design`.
+argument-hint: [free-text feature description | existing artifact path]
+---
+
+# Discover
+
+You are tasked with extracting feature intent and requirements through a one-question-at-a-time interview, then writing a Feature Requirements Document (FRD) that downstream skills consume. Two principles shape the flow: (1) **intent before agents** — the foundational intent question runs before any probe, so stated intent shapes the probe scope; (2) **lazy + confirm** — build the decision tree one layer at a time, and surface evidence-based pre-resolutions for confirmation rather than silently recording them.
+
+**How it works**:
+- Read input (free-text or artifact path) (Step 1)
+- Foundational intent question — open, no recommendation, no `file:line` (Step 2)
+- Lightweight codebase probe shaped by stated intent (Step 3)
+- Build root + immediate children, batch-confirm pre-resolutions (Step 4)
+- Interview loop — tiered questions, lazy expansion, re-queue cross-cutting answers (Step 5)
+- Synthesize answers into FRD sections (Step 6)
+- Write artifact, present, chain to research (Step 7)
+
+The final artifact is research-compatible — its Decisions block is translated into research's Developer Context and inherited by design.
+
+## Step 1: Input Handling
+
+Input: `$ARGUMENTS`
+
+1. **No argument provided**:
+   ```
+   I'll capture feature intent into an FRD. Provide one of:
+
+   `/skill:discover [free-text feature description]`     — fresh interview, write a new FRD
+   `/skill:discover [existing artifact path]`            — refine an existing FRD/ticket/doc via fresh interview
+   ```
+   Then wait for input.
+
+2. **Detect input shape** — parse `$ARGUMENTS`:
+   - If the argument is an existing file path (resolves to a readable `.md` under `thoughts/`, or any path the user mentions for refinement context), read it FULLY using the Read tool WITHOUT limit/offset. Treat its content as baseline context — the interview surfaces gaps, missing requirements, and unstated assumptions relative to what's already documented.
+   - Otherwise → fresh-feature mode: the entire argument is the free-text feature description.
+
+3. **Read any other files mentioned** in the prompt (tickets, docs, related artifacts, explicit `path:line` references) FULLY before proceeding.
+
+**No agent dispatch in Step 1.** Only `Read` on user-named paths. Agent grounding starts in Step 3, after stated intent has shaped the probe scope.
+
+Each invocation always writes a NEW timestamp-distinct artifact (Step 7) — there is no in-place stress-test append mode. To iterate on a prior FRD, either re-invoke discover (produces a fresh artifact) or manually Edit the prior artifact.
+
+## Step 2: Foundational Intent Question
+
+Before any codebase probe, ask the foundational intent question. This is purely conversational — no agents, no recommendation, no `file:line` citations.
+
+1. **Ask one open-ended `intent` question** via `ask_user_question`:
+   - Frame: "What problem are you solving and who hits it?" / "What does success look like for the person experiencing this today?" — phrase it for the specific feature.
+   - **No `(Recommended)` option.** The developer should generate the framing, not pick from a proposal.
+   - **No `file:line` citations** — codebase has nothing to say about intent.
+   - Options should be open shapes (e.g., "End user / maintainer / operator / Other") that route the answer, not solution shapes.
+   - Always offer "Other" so the developer can free-text the real framing.
+
+2. **Capture the answer in the developer's own words.** This text feeds into the FRD's Problem & Intent section verbatim — do not paraphrase into agent prose.
+
+3. **Probe-readiness check**: does the stated intent support a *narrow* probe slice (one component, one seam)? If yes → proceed to Step 3. If no (answer is too vague, e.g., "I dunno, feels slow"), ask **one more `intent` question** to sharpen scope, then re-check. Step 2 ends on probe-readiness, not at fixed N=1. Cap: 3 `intent` questions before falling through to Step 3 with whatever scope you have.
+
+## Step 3: Lightweight Codebase Probe (parallel agents, intent-shaped)
+
+Goal: ground the upcoming interview in concrete codebase evidence, with the probe slice shaped by the developer's stated intent from Step 2 — not by raw `$ARGUMENTS`.
+
+1. **Pick the agent set.** Dispatch `codebase-locator`, `codebase-analyzer`, or both — nothing else. Cap: 2 agents per Step 3 invocation.
+
+2. **Spawn the chosen agent(s) in parallel using the Agent tool.** Draft each prompt yourself from the developer's stated intent — keep the slice narrow (one component, one seam) and avoid breadth phrasing like "everything related to X". Shape per call:
+   ```
+   Agent({
+     subagent_type: "codebase-locator",   // or "codebase-analyzer"
+     description: "<3-5 word task>",
+     prompt: "<your narrow-slice prompt, scoped to stated intent>"
+   })
+   ```
+   The agent description on each subagent is the contract for what it expects in the prompt body.
+
+3. **Wait for ALL agents to complete before proceeding to Step 4.**
+
+4. **Read any clearly-relevant files** surfaced by the agents (≤5 files in main context, files <300 lines fully, larger files first 150 lines). Carry the agent reports and these files into Step 4 as evidence.
+
+5. **Empty results are not fatal.** If the probe returns thin/empty results (greenfield, no precedent), record "no codebase precedent" as evidence — `scope` interview questions still work (they don't need `file:line`), and `shape` questions will shift to ungrounded "pick A or B by convention" mode.
+
+## Step 4: Lazy Tree Setup + Pre-Resolution Confirmation
+
+Synthesize the **next layer** of questions internally before asking anything. Lazy expansion — build only root + immediate children at this stage, not the full tree. Each subsequent layer is built after its parent resolves.
+
+1. **Build root + immediate children**:
+   - **Root** — the developer's already-stated problem from Step 2.
+   - **Immediate children** — the foundational unresolved branches: Goals/Non-Goals · Functional Requirements · Non-Functional Requirements (perf/security/UX/reliability) · Constraints · Acceptance Criteria · Recommended Approach.
+   - Order branches by dependency (root → goals → constraints → solution shape → details). **This order drives the interview, not the FRD section order** — Step 6 redistributes answers into FRD sections.
+
+2. **Mark evidence-based pre-resolutions** from Step 3 with `file:line` citations. Do NOT silently record them as Decisions yet.
+
+3. **Batch-confirm pre-resolutions in a single `ask_user_question` call** before entering the interview loop. Frame each as: "From the probe I inferred — `<observed behavior>` (`file:line`). Keep this for the feature, or change it as part of the work?" The developer's confirm/correct is the actual Decision.
+
+   - **Confirm** → record as Decision, rationale `evidence: file:line + confirmed`.
+   - **Correct** → flip the Decision direction, schedule a Correction probe at Step 5 (≤1 additional agent on the new seam).
+
+4. The lazy tree stays internal — do NOT present the tree to the developer unless asked.
+
+## Step 5: Interview Loop
+
+Walk the lazy tree depth-first, parent before child. Expand the next layer (build a node's children) only after the node resolves. For each unresolved node:
+
+1. **Classify the question by tier**:
+   - **`intent`** — already done in Step 2. Do not re-ask intent in this loop.
+   - **`scope`** (goals · non-goals · functional reqs · non-functional reqs · constraints) — recommendation grounded in stated intent. `file:line` citations only when an option references existing code; otherwise state "no codebase precedent" in the option description.
+   - **`shape`** (architectural choice — which seam, which pattern, which integration point) — recommendation with `file:line` citations required on every option that references existing code. Mirrors the `packages/rpiv-pi/skills/research/SKILL.md:103-142` checkpoint pattern. If no precedent exists, switch to ungrounded mode and label options as "convention A / convention B" with explicit "no codebase precedent".
+   - **`detail`** (acceptance criteria · routine sub-decisions inside any branch) — batchable when 2-4 sibling leaves are independent.
+
+2. **Recommended answer** (`scope` / `shape` / `detail`): derive from intent + Step 3 evidence + project conventions. Every non-intent question carries a recommendation labeled `(Recommended)`.
+
+3. **Ask via `ask_user_question`.** Lead with the recommended option. The "Other" option is automatic and handles open-ended answers.
+
+4. **Critical rules**:
+   - Ask ONE question at a time. Wait for the answer before asking the next.
+   - If a new evidence-based node surfaces mid-loop, batch-confirm it the way Step 4 does — never silently auto-record.
+
+5. **Classify each response**:
+   - **Decision** ("yes, that recommendation is right" / "use option B"): Record in Decisions. Resolve the node. Expand its children if any. Continue.
+   - **Correction** ("no, the real intent is X" / "you missed Y"): Re-run targeted Step 3 grep on the new area; spawn at most **1 additional narrow agent per correction event** if the correction reveals a seam not yet probed. Adjust the affected subtree. Re-ask any descendants that depend on the corrected node.
+   - **Scope adjustment** ("skip the UI part" / "include retries"): Update the tree — prune pruned branches, add new branches if needed. Record in Decisions.
+   - **Cross-cutting answer** ("we also need audit / rate limiting / X" — affects multiple branches): Mark the new node as cross-cutting and **re-queue** it. When the walk reaches each affected parent (functional / non-functional / constraints), the cross-cutter fires under that parent's context. Same node, multiple parents resolved sequentially.
+   - **Defer** ("not sure, leave for later"): Add to Open Questions. Resolve the node by deferral. Continue.
+
+6. **Batching**: When 2-4 sibling `detail` leaves are independent (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential. Do not batch `scope` or `shape` questions.
+
+7. **Termination — depth check, not bucket-fill**: stop the loop when:
+   - (a) every branch has a Decision or a Deferral, AND
+   - (b) the developer's own words appear in Problem/Goals (not paraphrased agent prose), AND
+   - (c) no Decision is `Recommendation accepted` without at least one Rationale clause beyond `agreed`.
+
+   Do not invent questions to pad the interview. Do NOT ask a final "looks good / want to adjust" rubber-stamp question — chain forward to research is automatic at Step 7.
+
+**Total agent budget across the skill**: 2 (Step 3 initial probe) + N×1 (Step 5 corrections, typically 0-2) = 2-4 agent dispatches per FRD.
+
+## Step 6: Synthesize FRD Body
+
+Read `templates/frd.md` (relative to this skill folder) at runtime to confirm the section list and frontmatter shape — do not inline it from memory.
+
+Compile interview output into the FRD. The interview's logical order (problem → goals → constraints → solution → details) is decoupled from the FRD's section order — redistribute answers into the template buckets here:
+
+- **Summary** — 2-3 sentences capturing the settled feature concept.
+- **Problem & Intent** — the developer's framing from Step 2, in their own words. Verbatim where possible.
+- **Goals / Non-Goals** — explicit in/out lists from the interview.
+- **Functional Requirements** — numbered, each independently testable.
+- **Non-Functional Requirements** — perf, security, UX, accessibility, reliability constraints.
+- **Constraints & Assumptions** — environmental, technical, schedule, organizational.
+- **Acceptance Criteria** — observable pass conditions a reviewer can check.
+- **Recommended Approach** — 1-2 sentences naming the architectural shape implied by the decisions (e.g., "new command in `packages/rpiv-pi/extensions/`, output to stdout, no persistence"). This text is what `research` passes to `scope-tracer` as the topic for breadth grounding.
+- **Decisions** — full Q/A log per decision: `### [title]` + `**Question**:` (text as asked, or "Pre-resolved from codebase evidence — confirmed in Step 4") + `**Recommended**:` (or "n/a — `intent` question") + `**Chosen**:` (developer's pick or evidence-derived answer) + `**Rationale**:` (1 line — why, or `evidence: path/to/file.ext:line + confirmed` for codebase-derived). This block is the inheritance hook into research's Developer Context.
+- **Open Questions** — only items the developer explicitly deferred.
+- **References** — input files, mentioned tickets, related artifacts.
+
+## Step 7: Write Artifact, Present, Chain
+
+1. **Determine metadata**:
+   - Filename: `thoughts/shared/discover/<YYYY-MM-DD_HH-MM-SS>_<topic>.md`
+     - Topic: kebab-case slug derived from the settled feature concept (lowercase, hyphens for spaces, strip special chars).
+     - Timestamp guarantees uniqueness across invocations — no slug-collision check.
+   - Repository name: from git root basename, or current directory basename if not a git repo.
+   - Use the git branch and commit from the git context injected at the start of the session (or run `git branch --show-current` / `git rev-parse --short HEAD` directly; fallbacks: `no-branch` / `no-commit`).
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Interviewer: from the User in the injected git context (fallback: `unknown`).
+
+2. **Write the FRD** using the Write tool. Frontmatter `status: complete`. All template sections present and filled. The directory `thoughts/shared/discover/` is pre-scaffolded by `session-hooks.ts` — no `mkdir -p` needed in the skill.
+
+3. **Present and chain**:
+   ```
+   Intent captured to:
+   `thoughts/shared/discover/<YYYY-MM-DD_HH-MM-SS>_<topic>.md`
+
+   {N} requirements, {M} decisions, {K} open questions.
+
+   The FRD's Decisions block is translated into research's Developer Context and inherited by design.
+
+   ---
+
+   💬 Follow-up: discover writes a fresh FRD per call — re-invoke `/skill:discover` to iterate (the prior FRD stays unchanged on disk).
+
+   **Next step:** `/skill:research thoughts/shared/discover/<YYYY-MM-DD_HH-MM-SS>_<topic>.md` — ground the intent in codebase reality.
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+## Step 8: Handle Follow-ups
+
+- **Fresh artifact per call, no in-place append.** Discover deliberately writes a NEW timestamp-distinct FRD on every invocation — there is no `## Follow-up` append mode. The prior FRD stays unchanged on disk.
+- **Iterate by re-invoking.** Re-run `/skill:discover [path-to-prior-FRD]` (or `/skill:discover <free-text>`) to produce a fresh FRD informed by the prior one.
+- **No rubber-stamp question.** NEVER ask a final "looks good / want to adjust" question — chain forward to research is automatic at Step 7.
+- **Manual edits are allowed.** If the developer wants a one-off correction without re-running the full interview, they can Edit the FRD directly — the skill does not own follow-up surface area beyond fresh-artifact-per-call.
+
+## Important Notes
+
+These reinforce the critical rules from the steps above — listed here so they don't get lost in step-body detail.
+
+- **Always interview-first, intent-first**: Never write the FRD without running the interview loop. The `intent` question (Step 2) always precedes any agent dispatch — let stated intent shape the probe, not the other way around.
+- **Always one question at a time**: Even with 2-4 batched independent `detail` leaves, that's still one `ask_user_question` call — wait for answers before asking the next round.
+- **`intent` generates, `scope`/`shape`/`detail` reviews**: Intent is the developer's framing — they generate it. Scope, shape, and detail are proposals — they review them. The "developer reviews a proposal" model does not apply at the intent layer.
+- **`file:line` is tier-conditional**: `intent` — never. `scope` — only when an option references existing code, otherwise label "no codebase precedent". `shape` — required on every option that references existing code; if no precedent exists, switch to ungrounded "convention A / convention B" mode. `detail` — same rule as `scope`.
+- **Lazy tree, no full-tree pre-build**: Build only root + immediate children in Step 4. Expand each node's children only after the node resolves. Premature full-tree construction biases the dialogue.
+- **Pre-resolutions confirm, never silently record**: Evidence-based nodes are batch-confirmed in Step 4 (or mid-loop if newly surfaced). The developer's confirm/correct is the actual Decision.
+- **Cross-cutting answers re-queue, don't duplicate or drop**: When an answer affects multiple branches, mark the node cross-cutting and fire it under each affected parent during the walk.
+- **Interview order ≠ FRD section order**: Walk the tree in dependency order (problem → goals → constraints → solution → details). Step 6 redistributes answers into FRD sections.
+- **Light fan-out only**: Step 3 ≤2 agents (`codebase-locator` + optionally `codebase-analyzer`). Step 5 Corrections ≤1 additional agent per correction event. Breadth discovery (`scope-tracer`, broad sweeps, `integration-scanner`) belongs to `research` — chain forward instead of expanding scope here.
+- **Never write or edit source files**: This skill produces an artifact only. Source-file changes are `implement`'s job, far downstream.
+- **Fresh artifact every invocation**: Each `/skill:discover` call writes a NEW timestamp-distinct file. To iterate on a prior FRD, re-invoke or manually Edit the prior file.
+- **Critical ordering** — follow the numbered steps exactly:
+  - ALWAYS read mentioned files before any agent dispatch (Step 1 → Step 2)
+  - ALWAYS ask the `intent` question before probing (Step 2 → Step 3)
+  - ALWAYS shape the probe by stated intent, not raw `$ARGUMENTS` (Step 3)
+  - ALWAYS batch-confirm pre-resolutions instead of silent auto-record (Step 4)
+  - ALWAYS expand the tree lazily during the interview (Step 5)
+  - ALWAYS re-queue cross-cutting answers under each affected parent (Step 5)
+  - ALWAYS terminate on depth signal, not bucket-fill (Step 5)
+  - ALWAYS synthesize from the interview log, never from memory of the conversation (Step 6)
+  - NEVER skip the developer-facing interview — it's the entire point of this skill
+  - NEVER ask a final "looks good / want to adjust" rubber-stamp question (anti-pattern per `a93e591`)
+  - NEVER dispatch agents before Step 2's `intent` question is answered
--- a/extensions/rpiv-pi/skills/discover/templates/frd.md
+++ b/extensions/rpiv-pi/skills/discover/templates/frd.md
@@ -0,0 +1,73 @@
+---
+date: {Current date and time with timezone in ISO format}
+author: {User from injected git context}
+commit: {Current commit hash}
+branch: {Current branch name}
+repository: {Repository name}
+topic: "{Feature topic}"
+tags: [intent, frd, relevant-component-names]
+status: complete
+last_updated: {Same ISO timestamp as `date:` above}
+last_updated_by: {User from injected git context}
+---
+
+# FRD: {Feature topic}
+
+## Summary
+{2-3 sentences. The settled feature concept after the interview — what we're building, in the developer's framing.}
+
+## Problem & Intent
+{What the developer is trying to solve and why. Capture the underlying motivation, not the proposed solution.}
+
+## Goals
+- {Explicit goal — what success looks like}
+- {Goal 2}
+
+## Non-Goals
+- {Explicit exclusion — surfaced during the interview}
+- {Likely scope-creep vector the developer ruled out}
+
+## Functional Requirements
+1. {Numbered, independently testable. "The system SHALL …"}
+2. {Requirement 2}
+
+## Non-Functional Requirements
+- **Performance**: {latency / throughput / load expectations, or "no specific constraint"}
+- **Security**: {auth, data handling, threat model edges}
+- **UX / Accessibility**: {interaction model, a11y constraints}
+- **Reliability**: {error handling expectations, retry/recovery semantics}
+
+## Constraints & Assumptions
+- {Technical constraint — runtime, dependency, platform}
+- {Schedule / organizational constraint}
+- {Assumption being made — explicit so research can verify}
+
+## Acceptance Criteria
+- [ ] {Observable pass condition a reviewer can check without reading code}
+- [ ] {Criterion 2}
+
+## Recommended Approach
+{1-2 sentences. The architectural shape implied by the decisions — e.g., "New command in `packages/rpiv-pi/extensions/`, writes JSON to stdout, no persistence layer." The downstream `research` skill validates this against the codebase and passes this text to `scope-tracer` as the topic.}
+
+## Decisions
+
+### {Decision 1 — short title}
+**Question**: {Question as asked during the interview, or "Pre-resolved from codebase evidence"}
+**Recommended**: {The recommendation that was offered}
+**Chosen**: {What the developer picked, or the evidence-derived answer}
+**Rationale**: {1 line — why this was chosen, or `evidence: path/to/file.ext:line` for codebase-derived}
+
+### {Decision 2 — short title}
+**Question**: …
+**Recommended**: …
+**Chosen**: …
+**Rationale**: …
+
+## Open Questions
+{Only items the developer explicitly deferred. Each becomes an Open Question for `research` to answer or carry forward into Developer Context.}
+
+- {Deferred item 1 — what's deferred, why}
+
+## References
+- {Input file or ticket}
+- {Related artifact, e.g., `thoughts/shared/research/<YYYY-MM-DD_HH-MM-SS>_<topic>.md`}
--- a/extensions/rpiv-pi/skills/explore/SKILL.md
+++ b/extensions/rpiv-pi/skills/explore/SKILL.md
@@ -0,0 +1,359 @@
+---
+name: explore
+description: Analyze solution options for a feature or change, comparing approaches with pros, cons, trade-offs, and a recommended path. Use when the user is weighing approaches, asks "what are the options" or "how should we approach X", wants approaches compared, says "explore solutions", or faces a decision with multiple valid implementations. Produces solutions documents in thoughts/shared/solutions/, which can feed the design skill.
+argument-hint: [feature/change description]
+---
+
+# Research Solutions
+
+You are tasked with analyzing solution options for new features or changes by invoking parallel skills and synthesizing their findings into actionable recommendations optimized for design consumption.
+
+## Initial Setup:
+
+When this command is invoked, respond with:
+```
+I'm ready to research solution options. Please provide:
+- What feature/change you want to explore
+- Any requirements or constraints you know about
+- Reference to relevant ticket or research documents if available
+
+I'll analyze the current codebase, generate solution options, and provide recommendations.
+```
+
+Then wait for the user's request.
+
+## Steps
+
+### Step 1: Read Mentioned Files
+
+- If user mentions tickets, research docs, or other files, read them FULLY first
+- **IMPORTANT**: Use Read tool WITHOUT limit/offset parameters
+- **CRITICAL**: Read these files in main context before invoking skills
+- Extract requirements, constraints, and goals
+- Identify what problem we're solving
+
+### Step 2: Generate Candidates and Dimensions
+
+**Generate 2–4 named candidates** from three sources, then merge into one shortlist:
+
+- **Ecosystem scan** — spawn `web-search-researcher` for any topic where the candidate space includes external libraries, frameworks, or services. Prompt it to return 2–4 named options with one-line "what it is" + canonical doc link per option. Skip only when the topic is wholly internal (e.g., "how to organize this service layer") and the orchestrator's design-space enumeration plus the user shortlist already cover the space.
+- **Design-space enumeration** — orchestrator names abstract shapes from first principles when applicable (pub/sub vs direct-call vs event-bus; sync vs async; manual mapping vs auto-mapper). One-line "what it is" per shape.
+- **User shortlist** — if the user pre-named candidates in the entry prompt ("compare TanStack Query vs SWR"), include those verbatim.
+
+Merge to 2–4 candidates total. Name each with a short noun phrase ("TanStack Query", "Direct event bus"). Deduplicate.
+
+**Default dimension list** (presented at Step 3; developer may drop irrelevant ones):
+
+- **approach-shape** (hybrid) — what category of solution the candidate is, what core moving parts it requires.
+- **precedent-fit** (codebase-anchored) — does the existing code already use this pattern; how many call sites would adopt the new option.
+- **integration-risk** (codebase-anchored) — which existing seams the candidate would touch; what breaks if it lands.
+- **migration-cost** (external-anchored for libraries; codebase-anchored for in-house code) — work to introduce the candidate plus work to remove the incumbent if there is one.
+- **verification-cost** (codebase-anchored) — test/CI surface needed to make the candidate safe to adopt.
+- **novelty** (external-anchored) — how recently the candidate emerged, ecosystem momentum, deprecation risk.
+
+Hold the candidate set and default dimension list in working state for the Step 3 checkpoint. Do not dispatch fit agents yet.
+
+### Step 3: Candidate Checkpoint
+
+Present the candidate set and default dimensions to the developer before per-candidate fit dispatch.
+
+1. **Show candidates and dimensions:**
+
+   ```
+   ## Candidates for: {Topic}
+
+   1. {Candidate A} — {one-line what it is}
+   2. {Candidate B} — {one-line what it is}
+   ...
+
+   Dimensions (default 6; drop any that don't apply):
+   - approach-shape · precedent-fit · integration-risk
+   - migration-cost · verification-cost · novelty
+   ```
+
+2. **Confirm via the `ask_user_question` tool with the following question:** "{N} candidates, {D} dimensions. Begin per-candidate fit dispatch?". Header: "Candidates". Options: "Proceed (Recommended)" (Begin per-candidate fit dispatch with all {N} candidates and all {D} dimensions); "Adjust candidates or dimensions" (Rename, add, or drop candidates; drop dimensions that don't apply); "Re-generate candidates" (Candidates look wrong — re-run Step 2 with adjusted scope).
+
+3. **Handle developer input:**
+
+   **"Proceed"**: lock the candidate × dimension set; advance to Step 4.
+
+   **"Adjust candidates or dimensions"**: ask the follow-up free-text question with prefix `❓ Question:` — "Which candidates and dimensions should be added, dropped, or renamed?" — apply edits to the working set, re-present, and confirm again with the same three-option `ask_user_question`.
+
+   **"Re-generate candidates"**: ask the follow-up free-text question with prefix `❓ Question:` — "What should be different in candidate generation? (narrower/wider scope, different ecosystem, exclude approach X, …)" — return to Step 2 with the updated scope, then re-enter Step 3.
+
+   Loop until "Proceed" is selected.
+
+### Step 4: Per-Candidate Fit Dispatch (parallel agents)
+
+For each confirmed candidate, dispatch up to two agents in parallel — total ≤ 2 × N agents:
+
+- **One `codebase-analyzer` per candidate** — when ≥1 kept dimension is codebase-anchored (precedent-fit, integration-risk, often migration-cost and verification-cost). The agent scores the candidate on every kept codebase-anchored dimension in a single pass, returning evidence per dimension with `file:line` references.
+- **One `web-search-researcher` per candidate** — when ≥1 kept dimension is external-anchored (novelty, often migration-cost for libraries, approach-shape for ecosystem options). The agent scores the candidate on every kept external-anchored dimension in a single pass, returning evidence per dimension with doc/source links.
+
+Skip either agent for a candidate when no dimension of that anchor-type was kept. Hybrid dimension `approach-shape` is scored by the orchestrator after both agents return, by combining their per-candidate findings.
+
+**Per-candidate prompt shape** (use the same outer template, fill in candidate name and kept dimensions):
+
+```
+Candidate: {name} — {one-line what it is}
+Topic: {topic from Step 1}
+
+Score this single candidate on the following dimensions, each with concrete evidence ({file:line} for codebase, doc/source link for external). Report findings as one section per dimension.
+
+Dimensions for this run:
+- {dimension name} — {one-line of what to look for}
+- ...
+
+Do NOT compare against other candidates; another agent handles each one separately. Focus on depth of evidence for THIS candidate.
+```
+
+Wait for ALL agents to complete before proceeding.
+
+**Coverage check**: every (candidate × kept-dimension) cell is filled — by an agent's evidence or by an explicit `null` ("does not apply to this candidate"). Cells silently dropped indicate a missing dispatch — re-run that candidate's agent.
+
+### Step 5: Synthesize and Recommend
+
+- Cross-reference per-candidate findings — fill the candidate × dimension grid with evidence per cell.
+- Apply the fit filter qualitatively per candidate: a candidate "clears" when no kept dimension surfaces a blocking concern (integration-risk that breaks load-bearing seams, migration-cost that exceeds the topic's scope, verification-cost with no path to coverage).
+- **If ≥1 candidate clears the fit filter**: pick the strongest, document rationale with evidence, and explain why alternatives weren't chosen. Identify conditions that would change the recommendation.
+- **If every candidate fails the fit filter**: produce a "no-fit" recommendation — list each candidate's blocking dimension with evidence, recommend re-scoping the question or expanding the candidate pool, and set Step 7 frontmatter `confidence: low` and `status: blocked`.
+
+### Step 6: Determine Metadata and Filename
+
+- Filename format: `thoughts/shared/solutions/YYYY-MM-DD_HH-MM-SS_{topic}.md`
+  - YYYY-MM-DD_HH-MM-SS: Current date and time (e.g., 2025-10-11_14-30-22)
+  - {topic}: Brief kebab-case description
+- Repository name: from git root basename, or current directory basename if not a git repo
+- Use the git branch and commit from the git context injected at the start of the session (or run `git branch --show-current` / `git rev-parse --short HEAD` directly)
+- Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+- Author: use the User from the git context injected at the start of the session (fallback: "unknown")
+- If metadata unavailable: use "unknown" for commit/branch
+
+### Step 7: Generate Solutions Document
+
+- Use the metadata gathered in step 6
+- Structure the document with YAML frontmatter followed by content:
+
+  ```markdown
+  ---
+  date: {Current date and time with timezone in ISO format}
+  author: {Author name}
+  commit: {Current commit hash}
+  branch: {Current branch name}
+  repository: {Repository name}
+  topic: "{Feature/Problem}"
+  confidence: high | medium | low
+  complexity: low | medium | high
+  status: ready | awaiting_input | blocked
+  tags: [solutions, component-names]
+  last_updated: {Same ISO timestamp as `date:` above}
+  last_updated_by: {Author name}
+  ---
+
+  # Solution Analysis: {Feature/Problem}
+
+  **Date**: {Current date and time with timezone from step 6}
+  **Author**: {Author name from step 6}
+  **Commit**: {Current commit hash from step 6}
+  **Branch**: {Current branch name from step 6}
+  **Repository**: {Repository name}
+
+  ## Research Question
+  {Original user query}
+
+  ## Summary
+  **Problem**: {What we're solving}
+  **Recommended**: {Option name} - {One sentence why}
+  **Effort**: {Low/Med/High} ({N days})
+  **Confidence**: {High/Med/Low}
+
+  ## Problem Statement
+
+  **Requirements:**
+  - {Requirement 1}
+  - {Requirement 2}
+
+  **Constraints:**
+  - {Hard constraint - must respect}
+  - {Soft constraint - should consider}
+
+  **Success criteria:**
+  - {What "done" looks like}
+
+  ## Current State
+
+  **Existing implementation:**
+  {What exists with file:line references}
+
+  **Relevant patterns:**
+  - {Pattern 1}: `file.ext:line` - Used in {N} places
+  - {Pattern 2}: `file.ext:line` - Used in {N} places
+
+  **Integration points:**
+  - `file.ext:line` - {Where feature hooks in}
+  - `file.ext:line` - {Another integration point}
+
+  ## Solution Options
+
+  ### Option 1: {Name}
+  **How it works:**
+  {2-3 sentence description + implementation approach}
+
+  **Pros:**
+  - {Advantage with evidence from codebase}
+  - {Advantage with evidence}
+
+  **Cons:**
+  - {Disadvantage with impact}
+
+  **Complexity:** {Low/Med/High} (~{N} days)
+  - Files to create: {N} (~{X} lines)
+  - Files to modify: {N} (~{X} lines)
+  - Risk level: {Low/Med/High}
+
+  ### Option 2: {Alternative Name}
+  {Same structure as Option 1}
+
+  ### Option 3: {Another Alternative}
+  {Same structure as Option 1}
+
+  ## Comparison
+
+  | Criteria | Option 1 | Option 2 | Option 3 |
+  |----------|----------|----------|----------|
+  | Complexity | {L/M/H} | {L/M/H} | {L/M/H} |
+  | Codebase fit | {H/M/L} | {H/M/L} | {H/M/L} |
+  | Risk | {L/M/H} | {L/M/H} | {L/M/H} |
+
+  ## Recommendation
+
+  <!-- Render exactly ONE of the two blocks below, based on Step 5's fit-filter outcome. -->
+
+  **(A) When ≥1 candidate clears the fit filter:**
+
+  **Selected:** {Option N}
+
+  **Rationale:**
+  - {Key reason with evidence}
+  - {Key reason with evidence}
+  - ...
+
+  **Why not alternatives:**
+  - Option X: {Reason}
+
+  **Trade-offs:**
+  - Accepting {limitation} for {benefit}
+
+  **Implementation approach:**
+  1. {Phase 1} - {What to build}
+  2. ...
+
+  **Integration points:**
+  - `file.ext:line` - {Specific change}
+  - `file.ext:line` - {Specific change}
+
+  **Patterns to follow:**
+  - {Pattern}: `file.ext:line`
+
+  **Risks:**
+  - {Risk}: {Mitigation}
+
+  **(B) When every candidate fails the fit filter:**
+
+  **No-fit:** every candidate surfaced a blocking concern on at least one kept dimension.
+
+  **Per-candidate blockers:**
+  - {Option 1}: {blocking dimension} — {evidence with file:line or doc link}
+  - {Option 2}: {blocking dimension} — {evidence}
+  - ...
+
+  **Recommended next step:**
+  - {Re-scope the question} — {how the topic should narrow/widen so candidates can clear}
+  - OR {Expand the candidate pool} — {what new candidate sources to enumerate; e.g., named ecosystem option not surfaced by Step 2}
+
+  **Frontmatter overrides:** set `confidence: low` and `status: blocked`.
+
+  ## Scope Boundaries
+  - {What we're building}
+  - {What we're NOT doing}
+
+  ## Testing Strategy
+
+  **Unit tests:**
+  - {Key test scenario 1}
+  - ...
+
+  **Integration tests:**
+  - {End-to-end scenario 1}
+  - ...
+
+  **Manual verification:**
+  - [ ] {Manual test 1}
+  - [ ] ...
+
+  ## Open Questions
+  **Resolved during research:**
+  - {Question that was answered} - {Answer with evidence from file:line}
+
+  **Requires user input:**
+  - {Business or design question} - {Default assumption for planning}
+
+  **Blockers:**
+  - {Critical unknown that prevents implementation} - {How to unblock}
+
+  ## References
+
+  - `thoughts/shared/research/{file}.md` - {Context}
+  - `src/file.ext:line` - {Similar implementation}
+  - `thoughts/shared/{file}.md` - {Historical decision}
+  ```
+
+### Step 8: Present Findings
+
+Print a concise summary, highlight key integration points, then close with the standardized footer:
+
+```
+Solutions document written to:
+`thoughts/shared/solutions/{filename}.md`
+
+{N} candidates evaluated, {M} dimensions scored, recommendation: {chosen}.
+
+---
+
+💬 Follow-up: describe the change in chat to append a timestamped Follow-up section to this artifact. Re-run `/skill:explore` for a fresh artifact.
+
+**Next step:** `/skill:design thoughts/shared/solutions/{filename}.md` — turn the chosen option into a design artifact (or `/skill:blueprint thoughts/shared/solutions/{filename}.md` for the fast path on smaller tasks).
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+```
+
+### Step 9: Handle Follow-ups
+
+- **Append, never rewrite.** Edit the artifact to add a `## Follow-up Analysis {ISO 8601 timestamp}` section. Prior candidate scoring and verdicts stay immutable.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "<one-line summary of follow-up>"`.
+- **Re-dispatch narrowly.** Spawn ≤1–2 fresh agents scoped to the new candidate or dimension. Do NOT re-run the full skill.
+- **When to re-invoke instead.** If the candidate set or dimensions shift materially, re-run `/skill:explore` for a fresh artifact. The previous block's `Next step:` stays valid for the existing artifact.
+
+## Important Notes
+
+- Parallel Agent dispatch — every `Agent(...)` call in the same assistant message (multiple tool_use blocks in one response), never one per turn. Call shape: `Agent({ subagent_type: "<agent-name>", description: "<3-5 word task label>", prompt: "<task>" })`.
+- Always spawn fresh research to validate current state - never rely on old research docs as source of truth
+- Old research documents can provide historical context but must be validated against current code
+- Generate 2-4 named candidates in Step 2; confirm them with the developer at Step 3 before per-candidate fit dispatch
+- Web-search-researcher is a first-class Step 2 agent for ecosystem candidate-source — skip only when the topic is wholly internal and design-space enumeration plus user shortlist cover the space
+- Per-candidate fit dispatch caps at two agents per candidate (one codebase-analyzer, one web-search-researcher) — skip either when no dimension of its anchor-type was kept
+- Solutions documents should be self-contained with all necessary context
+- Each agent prompt should be specific and focused on a single candidate scored on the kept dimensions
+- Quantify pattern precedent — count usage in codebase, don't just say "follows pattern"
+- Ground complexity estimates in actual similar work from git history
+- Think like a software architect — option-shopping output is 2–4 comparable candidates plus an honest fit verdict
+- Keep the main agent focused on synthesis and comparison, not deep implementation details
+- Encourage agents to find existing patterns and examples, not just describe possibilities
+- Resolve technical unknowns during research — don't leave critical questions for design
+- **File reading**: Always read mentioned files FULLY (no limit/offset) before invoking skills
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS read mentioned files first before invoking skills (step 1)
+  - ALWAYS generate candidates and run the Step 3 checkpoint before per-candidate dispatch (steps 2 → 3 → 4)
+  - ALWAYS wait for all per-candidate agents to complete before synthesizing (step 4)
+  - ALWAYS gather metadata before writing the document (step 6 before step 7)
+  - NEVER write the solutions document with placeholder values
--- a/extensions/rpiv-pi/skills/implement/SKILL.md
+++ b/extensions/rpiv-pi/skills/implement/SKILL.md
@@ -0,0 +1,103 @@
+---
+name: implement
+description: Execute an approved implementation plan from thoughts/shared/plans/ phase by phase, applying changes and verifying each phase against its success criteria before moving on. Use when the user invokes /implement, asks to "implement this plan", or wants an existing phased plan executed. Pair with revise to update plans mid-flight and validate to confirm completion.
+argument-hint: "[plan-path] [Phase N]"
+allowed-tools: Read, Edit, Write, Bash(*), Glob, Grep
+disable-model-invocation: true
+---
+
+# Implement Plan
+
+You are tasked with implementing an approved technical plan from `thoughts/shared/plans/`. These plans contain phases with specific changes and success criteria.
+
+## Getting Started
+
+- Plan path: `$1` (empty/literal → ask user)
+- Phase scope: `${@:2}` (empty → all phases sequentially; else scope to this phase only)
+
+With a plan path in hand:
+- Read the plan completely and check for any existing checkmarks (- [x])
+- Read the original ticket and all files mentioned in the plan
+- **Read files fully** - never use limit/offset parameters, you need complete context
+- Think deeply about how the pieces fit together
+- Create a todo list to track your progress
+- Start implementing if you understand what needs to be done
+
+## Implementation Philosophy
+
+Plans are carefully designed, but reality can be messy. Your job is to:
+- Follow the plan's intent while adapting to what you find
+- Implement each phase fully before moving to the next
+- Verify your work makes sense in the broader codebase context
+- Update checkboxes in the plan as you complete sections
+
+When things don't match the plan exactly, think about why and communicate clearly. The plan is your guide, but your judgment matters too.
+
+If you encounter a mismatch:
+- STOP and think deeply about why the plan can't be followed
+- Present the issue clearly:
+  ```
+  Issue in Phase {N}:
+  Expected: {what the plan says}
+  Found: {actual situation}
+  Why this matters: {explanation}
+
+  ```
+
+  Use the `ask_user_question` tool to resolve the mismatch. Question: "{Brief summary of the mismatch}". Header: "Mismatch". Options: "Follow the plan" (Adapt the plan's approach to the current code state); "Skip this change" (Move on without this change — it may not be needed); "Update the plan" (The plan needs to be revised before continuing).
+
+## Verification Approach
+
+After implementing a phase:
+- Run the success criteria checks (usually `make check test` covers everything)
+- Fix any issues before proceeding
+- Update your progress in both the plan and your todos
+- Check off completed items in the plan file itself using Edit
+
+Don't let verification interrupt your flow - batch it at natural stopping points.
+
+## If You Get Stuck
+
+When something isn't working as expected:
+- First, make sure you've read and understood all the relevant code
+- Consider if the codebase has evolved since the plan was written
+- Present the mismatch clearly and ask for guidance
+
+Use skills sparingly - mainly for targeted debugging or exploring unfamiliar territory.
+
+## Resuming Work
+
+If the plan has existing checkmarks:
+- Trust that completed work is done
+- Pick up from the first unchecked item
+- Verify previous work only if something seems off
+
+Remember: You're implementing a solution, not just checking boxes. Keep the end goal in mind and maintain forward momentum.
+
+## Closing Out
+
+When the last in-scope phase is complete (or the user pauses execution), print a closing block in this exact shape:
+
+```
+Implementation {complete | paused at Phase {N}}: `thoughts/shared/plans/{filename}.md`
+
+{P} phases completed, {M} files changed, {T} tests passing.
+Outstanding: {list of unchecked items, blockers, or "none"}.
+
+---
+
+💬 Follow-up: implement edits source files, not artifacts. For plan-level changes run `/skill:revise <plan-path>` first; for session pauses run `/skill:create-handoff`.
+
+**Next step:** `/skill:validate thoughts/shared/plans/{filename}.md` — verify the implementation against the plan's success criteria before committing.
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+```
+
+If the run was paused mid-plan rather than completed, swap the next-step line for `/skill:create-handoff` so context can be resumed cleanly in a new session — the same `/new` tip still applies.
+
+## Handle Follow-ups
+
+- **Implement does not own the plan.** Source-file edits happen in implement; plan edits do not. Never patch the plan artifact from inside implement.
+- **For plan-level changes.** Run `/skill:revise <plan-path>` first — it appends a timestamped Follow-up section to the plan and preserves history. Then resume implement at the affected phase.
+- **For session pauses.** Run `/skill:create-handoff` to capture in-flight state, then `/new` and `/skill:resume-handoff` in the next session.
+- **Mismatch handling stays inline.** When code reality diverges from the plan, use the inline `ask_user_question` flow ("Follow the plan / Skip this change / Update the plan") — that is implement's only follow-up surface; everything else escalates to revise or create-handoff.
--- a/extensions/rpiv-pi/skills/migrate-to-guidance/SKILL.md
+++ b/extensions/rpiv-pi/skills/migrate-to-guidance/SKILL.md
@@ -0,0 +1,98 @@
+---
+name: migrate-to-guidance
+description: Migrate a project's inline CLAUDE.md files to the .rpiv/guidance/ shadow-tree system. Finds every CLAUDE.md, transforms internal references, and creates equivalent architecture.md files under .rpiv/guidance/. Use when the user wants to move from inline CLAUDE.md to the guidance shadow tree, consolidate scattered CLAUDE.md files into one place, or invokes /migrate-to-guidance.
+argument-hint: [--delete-originals]
+allowed-tools: Bash, Read, Glob
+---
+
+# Migrate CLAUDE.md to Guidance
+
+You are tasked with migrating a project's existing `CLAUDE.md` files (typically created by `/skill:annotate-inline`) into the `.rpiv/guidance/` system.
+
+The migration relocates files from in-place `CLAUDE.md` to `.rpiv/guidance/{path}/architecture.md` and transforms internal cross-references.
+
+## Steps to follow:
+
+1. **Pre-flight check:**
+   - Use Glob to find all `**/CLAUDE.md` files in the project
+   - If none are found, inform the user: "No CLAUDE.md files found in this project. Nothing to migrate." and stop
+   - If `.rpiv/guidance/` already exists, note this — there may be conflicts
+
+2. **Dry run — preview the migration:**
+   - Run the migration script in dry-run mode:
+     ```
+     node scripts/migrate.js --project-dir "${CWD}" --dry-run
+     ```
+   - Parse the JSON output from stdout and present a migration plan to the user:
+     ```
+     ## Migration Plan
+
+     Found {N} CLAUDE.md files to migrate:
+
+     | Source | Target | Lines |
+     |--------|--------|-------|
+     | CLAUDE.md | .rpiv/guidance/architecture.md | 45 |
+     | src/core/CLAUDE.md | .rpiv/guidance/src/core/architecture.md | 78 |
+     | ... | ... | ... |
+     ```
+   - If there are **conflicts** (targets that already exist), list them:
+     ```
+     ### Conflicts (targets already exist):
+     - .rpiv/guidance/src/core/architecture.md
+
+     Use --force to overwrite these.
+     ```
+   - If there are **warnings** (unresolved prose references), list them:
+     ```
+     ### Warnings:
+     - .rpiv/guidance/architecture.md line 23: Prose reference may need manual update
+     ```
+   - Ask the user for confirmation before proceeding. Ask whether they want to:
+     - Delete the original CLAUDE.md files after migration (`--delete-originals`)
+     - Overwrite existing conflicts (`--force`)
+
+3. **Execute the migration:**
+   - Build the command based on user choices:
+     ```
+     node scripts/migrate.js --project-dir "${CWD}" [--delete-originals] [--force]
+     ```
+   - Run the migration and parse the JSON output
+   - Present the results:
+     ```
+     ## Migration Complete
+
+     | Source | Target | Lines | Refs Updated |
+     |--------|--------|-------|--------------|
+     | CLAUDE.md | .rpiv/guidance/architecture.md | 45 | 3 |
+     | src/core/CLAUDE.md | .rpiv/guidance/src/core/architecture.md | 78 | 1 |
+     | ... | ... | ... | ... |
+
+     Total: {N} files migrated
+     {Originals deleted: yes/no}
+     ```
+
+4. **Post-migration:**
+   - If warnings exist about unresolved prose references:
+     - Read the affected guidance files
+     - Offer to fix the remaining references using contextual knowledge of the project structure
+   - Print the closing footer (verbatim, with placeholders filled):
+     ```
+     Migration complete: {N} files migrated to `.rpiv/guidance/`.
+     {Originals deleted: yes/no}
+     Verification: run `claude` in the project and read a source file to confirm guidance injection works.
+
+     ---
+
+     💬 Follow-up: describe targeted edits in chat; re-run `/skill:migrate-to-guidance` with different flags (`--force`, `--delete-originals`) for a different migration shape.
+
+     **Next step:** `/skill:annotate-guidance` — refresh or extend annotations now that the guidance tree owns them (skip if no further annotation is planned).
+
+     > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+     ```
+
+## Important notes:
+- The migration script handles all file operations — do not manually copy or move CLAUDE.md files
+- Content format is preserved as-is (same markdown structure, same `<important if>` blocks)
+- Only cross-references between files are transformed (`CLAUDE.md` paths → `.rpiv/guidance/` paths)
+- The script outputs JSON to stdout — parse it for structured results
+- Debug logs go to stderr (visible with `claude --verbose`)
--- a/extensions/rpiv-pi/skills/outline-test-cases/SKILL.md
+++ b/extensions/rpiv-pi/skills/outline-test-cases/SKILL.md
@@ -0,0 +1,362 @@
+---
+name: outline-test-cases
+description: Discover testable features in a project (frontend-first) and create a folder outline under .rpiv/test-cases/ with per-feature metadata. Incremental runs reuse the existing outline for smarter discovery and diff-based checkpoints. Use before write-test-cases to map project scope, when the user wants to plan or inventory test coverage, asks to "outline test cases", or wants a test-case scaffold generated for a project.
+argument-hint: [target-directory]
+allowed-tools: Agent, Read, Write, Edit, Glob, Grep
+---
+
+# Outline Test Cases
+
+You are tasked with discovering all testable features in a project and creating a folder outline under `.rpiv/test-cases/`. Each feature gets its own folder with a `_meta.md` file containing discovered routes, endpoints, scope decisions, and domain context. A root `README.md` summarizes the full project outline. No test case content is generated — use `write-test-cases` per feature to fill the folders.
+
+Two modes: **Fresh** (no existing outline — full discovery and checkpoint) and **Incremental** (existing outline found — discovery with prior context, diff-based checkpoint). Discovery always runs in both modes.
+
+## Initial Setup
+
+When this command is invoked, respond with:
+```
+I'll discover all testable features in this project and create a folder outline
+under .rpiv/test-cases/. Let me check for existing outlines and analyze the codebase.
+```
+
+Use the current working directory as the target project by default. If the user provides a specific directory path as an argument, use that instead.
+
+## Steps
+
+### Step 1: Read files & detect mode
+
+- If the user mentions specific files (existing test cases, architecture docs, READMEs), read them FULLY first
+- **IMPORTANT**: Use the Read tool WITHOUT limit/offset parameters to read entire files
+- **CRITICAL**: Read these files yourself in the main context before invoking any agents
+
+#### Mode Detection
+
+Check for existing outline data:
+
+1. **Glob** for `**/_meta.md` with path set to `.rpiv/test-cases/` in the target directory (dot-prefixed directories must be targeted directly)
+2. If no `_meta.md` files found → **Fresh mode**. Proceed to Step 2.
+3. If `_meta.md` files found → **Incremental mode**. Read them ALL and extract:
+   - Existing feature list (names, slugs, modules, routes, endpoints)
+   - Scope exclusions from `## Scope Decisions` sections
+   - Previous checkpoint Q&A from `## Checkpoint History` sections
+   - Generated date from frontmatter
+
+Report detected mode:
+```
+[Fresh]: No existing outline found. Will run full discovery.
+[Incremental]: Found {N} existing feature outlines from {generated date}. Will re-discover with prior context and highlight changes.
+```
+
+### Step 2: Discover features
+
+First, detect the project's technology stack by checking for framework indicators (see Framework Detection Reference below).
+
+Spawn the following agents in parallel using the Agent tool. Wait for ALL agents to complete before proceeding.
+- Use the **codebase-locator** agent to find all registered routes, navigation menus, and page entry points
+- Use the **codebase-locator** agent to find all frontend HTTP API call sites — report each call-site `file:line` and the literal URL template string found at the call site (e.g., ``${base}/users/${id}``). Frontend-to-backend URL correlation happens orchestrator-side in Step 3's Cross-Reference synthesis (`skills/outline-test-cases/SKILL.md:71-79`) using the backend-controller findings from the next agent.
+- Use the **codebase-locator** agent to find all backend API controllers and route handlers
+- Use the **test-case-locator** agent to find existing test cases in `.rpiv/test-cases/` to avoid duplicates
+
+Include in your prompts for the three codebase-locator agents:
+- Target directory and detected framework
+- In **Incremental mode**: summary of previously discovered features (names, routes, endpoints) from existing `_meta.md` files — ask agents to flag new items and note any that no longer exist
+- If **scope exclusions** were loaded in Step 1: list them and instruct agents to exclude matching results
+
+While agents run, read `.gitignore` yourself to understand exclusion rules.
+
+### Step 3: Determine feature targets
+
+**IMPORTANT**: Wait for ALL agents from Step 2 to complete before proceeding.
+
+#### Cross-Reference (both modes)
+
+Cross-reference findings from all 4 agents:
+
+**Feature identification** — Build the feature list from frontend evidence:
+1. Start with frontend routes (Route Discovery) — each top-level route group is a candidate feature
+2. Validate with navigation menus — features in the sidebar/nav are confirmed active
+3. Enrich with API call mapping (API Mapping) — link each feature's frontend services to backend endpoints
+4. Cross-reference against backend controllers (Backend Discovery) — identify which backend controllers serve each frontend feature
+
+**Phantom detection** — Flag backend controllers NOT referenced by any frontend route or API call:
+- Platform/public API controllers serving external consumers
+- Webhook controllers triggered by external services
+- Deprecated endpoints with code still present
+- Sub-services used within other features
+- Present these as "Backend-only endpoints (no frontend exposure)" in the confirmation
+
+#### Incremental: Diff Against Existing Outline
+
+In Incremental mode, compare the fresh discovery results against existing `_meta.md` data and classify each feature:
+
+| Category | Condition |
+|---|---|
+| **Unchanged** | Feature exists in both existing outline and fresh discovery, routes/endpoints match |
+| **New** | Found by agents but not in any existing `_meta.md` |
+| **Removed** | In existing `_meta.md` but not found by agents |
+| **Changed** | Feature exists in both but routes or endpoints differ |
+
+#### Common Processing (both modes)
+
+**Feature grouping** — Group confirmed features by portal/application:
+- Detected from route structure (e.g., Admin, Public, Partner, Host)
+
+**Decomposition rules:**
+- Large features (>10 endpoints or >3 sub-routes) — note sub-features in metadata, keep as single folder
+- Small features (<5 endpoints, no dedicated route) — fold into parent feature
+- Sub-services without own routes — fold into the feature that uses them
+
+**Slug and module assignment:**
+- Feature slug: kebab-case from feature name (e.g., `user-management` → `users`, `report-builder`)
+- Module abbreviation: short uppercase code derived from feature name (e.g., USR, AUTH, DASH, RPT)
+
+**Duplicate check** — Cross-reference against existing TCs (TC Locator):
+- Features with existing TC folders → mark status as "partial" (has outline, TCs may exist)
+- Features with no TCs → mark status as "pending"
+
+### Step 4: Developer checkpoint
+
+#### Fresh Mode — Full Checkpoint
+
+Ask grounded questions one at a time before presenting the feature list. Use a **❓ Question:** prefix so the user knows their input is needed. Each question must reference real findings and pull NEW information — not confirm what you already found. Ask several questions targeting what the code analysis could not detect.
+
+**Question focus areas** (business/product language first, technical fallback only when necessary):
+
+- **Phantom features**: "There's a bulk-import capability in the code but no screen for it in the admin panel — is this tested separately or internal-only?"
+- **Missing coverage**: "The navigation menu shows a Reports section but I can't find an actual page behind it — is this under development or was it removed?"
+- **Hidden features**: "I see three separate user management areas but only one is visible in the menu — are the others internal tools or deprecated?"
+- **Feature boundaries**: "User management and role assignment share the same backend — should they be one test area or two?"
+- **Environment-specific**: "Some features seem to be behind feature flags and only active in staging — should these be included in the test outline?"
+
+**CRITICAL**: Ask ONE question at a time. Wait for the answer before asking the next. Lead with your most significant finding.
+
+**Choosing question format:**
+
+- **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example: Use the `ask_user_question` tool with the question "Found 2 mapping approaches — which should new code follow?". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+- **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+  "❓ Question: Integration scanner found no background job registration for this area. Is that expected, or is there async processing I'm not seeing?"
+
+**Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+**Classify each response and track for persistence:**
+
+**Confirmations** ("looks good", "yes proceed"):
+- Record. Proceed to the next question, or to the feature list if all questions answered.
+
+**Corrections** ("that's deprecated", "wrong grouping"):
+- Update the feature list directly. Record as scope decision for the affected feature.
+
+**Additions** ("you missed the refund flow", "add platform API"):
+- Add to the feature list. Assign slug/module. Record as scope decision.
+
+**Scope adjustments** ("skip admin features", "split settings into two"):
+- Adjust the target list. Record as scope decision for affected features.
+
+After all questions are answered, present the proposed feature list:
+
+```
+## Proposed Feature Outline
+
+Framework detected: {framework name}
+Applications found: {N} ({app names})
+Total backend endpoints: ~{N} across {M} controllers
+
+---
+### {Portal Name} ({N} features)
+
+1. {Feature Name} — {N} routes, {M} API endpoints
+   Slug: {feature-slug} | Module: {MOD}
+   Sub-features: {list if decomposed, or "none"}
+2. {Feature Name} — {N} routes, {M} API endpoints
+   Slug: {feature-slug} | Module: {MOD}
+{etc.}
+
+### Already Covered (will skip):
+- {Feature} — {N} existing TCs in .rpiv/test-cases/{slug}/
+
+### Backend-Only Endpoints (no frontend exposure):
+- {Controller/endpoint group} — {reason: platform API / webhook / deprecated}
+
+---
+Create outline for {total} features?
+```
+
+Use the `ask_user_question` tool with the following question: "Create outline for {total} features across {N} portals?". Options: "Create outline (Recommended)" (Write _meta.md files and folder structure for all features above); "Add or remove features" (Adjust the feature list before creating); "Reclassify" (Move backend-only endpoints into the main feature list or vice versa).
+
+Handle any final additions, removals, reclassifications, or slug/module overrides.
+
+#### Incremental Mode — Diff-Based Checkpoint
+
+Present the diff results from Step 3 with previous decisions:
+
+```
+## Outline Update ({N} features, last run {generated date})
+
+Unchanged ({N}):
+- {Feature Name} — {slug} | {MOD}
+{etc.}
+
+New ({N}):
+- {Feature Name} — {N} routes, {M} API endpoints (not in previous outline)
+{etc.}
+
+Removed ({N}):
+- {Feature Name} — was {slug} | {MOD} (no longer found in codebase)
+{etc.}
+
+Changed ({N}):
+- {Feature Name} — {what changed: "3 new endpoints", "route path changed", etc.}
+{etc.}
+
+Previous decisions:
+- {Q&A pair 1 rephrased as single-line decision statement}
+- {Q&A pair 2 rephrased as single-line decision statement}
+
+```
+
+Use the `ask_user_question` tool with the following question: "{N} unchanged, {M} new, {K} removed features. Apply updates?". Options: "Apply updates (Recommended)" (Update _meta.md files and create new feature folders); "Adjust changes" (Modify the proposed new/removed/changed features); "Re-run discovery" (Something looks wrong — re-scan the codebase).
+
+Rephrase each Q&A pair into a concise decision statement (e.g., `**Q:** "Is the bulk-import capability tested separately?" **A:** "No, internal only"` becomes `"Bulk-import — internal only, excluded from scope"`).
+
+**If no changes detected** (all features unchanged):
+- Present the unchanged list and previous decisions
+- Use the `ask_user_question` tool with the following question: "No changes detected since {date}. Still accurate?". Options: "Confirmed" (Outline is still accurate — no updates needed); "Force re-scan" (Re-run discovery anyway to verify).
+
+**For new/changed/removed features**, ask grounded questions ONE at a time (same approach as Fresh mode) targeting only the differences. Unchanged features need only batch confirmation.
+
+**Classify each response and track for persistence** (same as Fresh mode: Confirmations, Corrections, Additions, Scope adjustments).
+
+After all questions are answered, present the full feature list summary (same format as Fresh mode) and wait for user confirmation before proceeding to Step 5.
+
+### Step 5: Write folder outline
+
+#### Fresh Mode — creating new files
+
+1. **Create directories** — for each confirmed feature, create `.rpiv/test-cases/{feature-slug}/`
+
+2. **Write `_meta.md` per feature** — one file per folder:
+
+   Read the full feature metadata template at `templates/feature-meta.md`. Follow the template exactly, populating fields from agent findings and checkpoint answers:
+   - `## Routes` — route paths and component names from Route Discovery (no file:line references)
+   - `## Endpoints` — HTTP methods and paths from Backend Discovery
+   - `## Scope Decisions` — from checkpoint answers classified as Corrections, Additions, or Scope adjustments that affect this feature. Include cross-cutting decisions that apply. If no scope decisions surfaced, write a default entry: `- Full feature in scope (no exclusions identified)`
+   - `## Domain Context` — from checkpoint answers that reveal business rules or intentional behaviors. Leave section with `- None identified` if nothing surfaced.
+   - `## Test Data Requirements` — from checkpoint answers that mention data needs. Leave section with `- None identified` if nothing surfaced.
+   - `## Checkpoint History` — all Q&A pairs from the checkpoint that affect this feature, under a date header (`### YYYY-MM-DD`)
+
+3. **Write root `README.md`** at `.rpiv/test-cases/README.md`:
+
+   Read the full outline README template at `templates/outline-readme.md`. Follow the template exactly, populating fields from the confirmed feature list.
+
+4. **Present summary:**
+   ```
+   ## Test Case Outline Created
+
+   | Folder | Module | Portal | Routes | Endpoints | Status |
+   |--------|--------|--------|--------|-----------|--------|
+   | users/ | USR | Admin | 5 | 20 | pending |
+   | reports/ | RPT | Admin | 2 | 15 | pending |
+   | {etc.} | | | | | |
+
+   Output: `.rpiv/test-cases/`
+   Total: {N} feature folders + {N} _meta.md files + 1 README.md
+   Phantom features skipped: {list or "none"}
+
+   Note: this outline is a starting point based on code analysis — re-run or add features manually as the project evolves.
+
+   ---
+
+   💬 Follow-up: describe folder/metadata changes in chat to update specific `_meta.md` files. Re-run `/skill:outline-test-cases` for incremental discovery against the current codebase.
+
+   **Next step:** `/skill:write-test-cases [feature-name]` — generate the test case files for a single feature (run once per feature folder).
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+#### Incremental Mode — updating existing files
+
+1. **Update existing `_meta.md` files** using the Edit tool:
+   - Update `## Routes` and `## Endpoints` with fresh discovery data
+   - Append new Q&A pairs to `## Checkpoint History` under a new date header (`### YYYY-MM-DD`)
+   - Update `## Scope Decisions` if changed during checkpoint
+   - Update `## Domain Context` if changed
+   - Update frontmatter `date` to current date
+
+2. **Add new feature folders** for newly discovered features:
+   - Create directory + write new `_meta.md` from template (same as Fresh mode Step 5.2)
+
+3. **Flag removed features** — do NOT delete folders (they may contain generated TCs):
+   - Update `_meta.md` frontmatter `status` to `removed`
+   - Append removal note to `## Checkpoint History`
+   - Inform the user which folders were flagged so they can decide whether to delete
+
+4. **Update root `README.md`** — update feature table and `Last updated:` line using Edit
+
+5. **Present summary:**
+   ```
+   ## Test Case Outline Updated
+
+   Unchanged: {N} features
+   Updated: {N} _meta.md files (routes/endpoints refreshed)
+   Added: {N} new feature folders
+   Removed: {N} features flagged (folders preserved)
+
+   Changes:
+   - {List of what changed: "Added payments feature", "Flagged legacy-reports as removed", "Updated scope for users", etc.}
+
+   Output: `.rpiv/test-cases/`
+
+   Note: this outline is a starting point based on code analysis — re-run or add features manually as the project evolves.
+
+   ---
+
+   💬 Follow-up: describe folder/metadata changes in chat to update specific `_meta.md` files. Re-run `/skill:outline-test-cases` for incremental discovery against the current codebase.
+
+   **Next step:** `/skill:write-test-cases [feature-name]` — generate the test case files for a single feature (run once per feature folder).
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+### Step 6: Handle Follow-ups
+
+- **Append, never rewrite.** Edit `_meta.md` files in place; do not delete folders that contain generated TCs (flag them via `status: removed` instead).
+- **Bump frontmatter.** Update each touched `_meta.md`'s `date` field and the root `README.md` `Last updated:` line to the current date.
+- **Re-dispatch narrowly.** Spawn ≤1–2 agents scoped to the changed feature. Do NOT re-run the full skill.
+- **When to re-invoke instead.** If the codebase changed significantly, re-run `/skill:outline-test-cases` — incremental mode auto-detects existing outlines and reconciles. The previous block's `Next step:` stays valid.
+
+Skill-specific verbs:
+- **Add features**: add folder + `_meta.md`, update `README.md`.
+- **Remove features**: tell the user they can delete the folder; update `README.md`.
+- **Reclassify phantoms**: create folder + `_meta.md` for the reclassified feature, update `README.md`.
+- **Adjust metadata**: edit specific `_meta.md` files using the Edit tool.
+
+## Framework Detection Reference
+
+| Indicator | Framework | Detection |
+|-----------|-----------|-----------|
+| `@angular/core` | Angular | `package.json` dependencies |
+| `react-router-dom` / `react-router` / `@react-router` | React | `package.json` dependencies |
+| `next` | Next.js | `package.json` dependencies |
+| `vue-router` | Vue Router | `package.json` dependencies |
+| `nuxt` | Nuxt | `package.json` dependencies |
+| `.csproj` / `.sln` | .NET | File presence in project root |
+| `pyproject.toml` / `requirements.txt` with Django/Flask/FastAPI | Python | File presence + dependency check |
+| None found | Backend-only | Fallback to backend discovery |
+
+## Important Notes
+
+- This skill creates folders and `_meta.md` only — use `write-test-cases` per feature for actual TC content.
+- Frontend routes define features; backend enriches them. No UI route → no folder (unless developer overrides).
+- Never skip the developer checkpoint, even on incremental runs.
+- `_meta.md` is the inter-skill contract — keep route/endpoint paths stable, no `file:line` references.
+- **File reading**: Always read mentioned files FULLY (no limit/offset) before invoking agents.
+- **Critical ordering**: Follow the numbered steps exactly.
+  - ALWAYS detect mode first (Step 1) before spawning agents
+  - ALWAYS read mentioned files first before invoking agents (Step 1)
+  - ALWAYS wait for all agents to complete before determining targets (Step 3)
+  - ALWAYS checkpoint with the user before presenting the feature list (Step 4)
+  - ALWAYS get user confirmation before writing folders (Step 4 → Step 5)
+  - NEVER write folders or metadata with placeholder values
+- **Duplicate avoidance**: Always check existing TCs via test-case-locator before creating folders.
+- **Idempotent re-runs**: If `.rpiv/test-cases/` already has folders with TCs, mark them accordingly — do not overwrite existing TC content. Only update `_meta.md` and `README.md`.
--- a/extensions/rpiv-pi/skills/outline-test-cases/templates/feature-meta.md
+++ b/extensions/rpiv-pi/skills/outline-test-cases/templates/feature-meta.md
@@ -0,0 +1,50 @@
+```markdown
+---
+date: {YYYY-MM-DD}
+author: {User from injected git context}
+commit: {commit-hash}
+branch: {Current branch name}
+repository: {Repository name}
+topic: "{Feature Name}"
+tags: [test-cases, outline, {module}, {feature-slug}]
+status: pending | partial | generated
+feature: "{Feature Name}"
+module: {MOD}
+portal: {Portal Name}
+slug: {feature-slug}
+tc_count: 0
+last_updated: {YYYY-MM-DD}
+last_updated_by: {User from injected git context}
+---
+
+## Routes
+- `{route path}` — {ComponentName}
+
+## Endpoints
+- `{HTTP method} {path}` — {description}
+
+## Scope Decisions
+- {What's in scope and why}
+- {What's OUT of scope and why}
+
+## Domain Context
+- {Business rules, intentional behaviors, known limitations}
+
+## Test Data Requirements
+- {Minimum data conditions for testing this feature}
+
+## Checkpoint History
+### {YYYY-MM-DD}
+**Q: {Question asked during checkpoint}**
+A: {Developer's answer}
+```
+
+**Notes on `_meta.md` content:**
+- Routes come from route discovery findings — path and component name only, no file:line
+- Endpoints come from backend discovery, filtered to those serving this feature
+- Scope Decisions, Domain Context, and Test Data Requirements come from checkpoint answers
+- Checkpoint History records dated Q&A pairs from developer checkpoints
+- If a feature has no frontend routes (e.g., widget), list the component entry point instead
+- If status is "partial", add an `## Existing Test Cases` section listing TC IDs found by the test-case-locator agent
+- commit records which commit was analyzed during outline generation — used for staleness detection by consuming skills
+- tc_count starts at 0 and is updated by write-test-cases when TCs are created
--- a/extensions/rpiv-pi/skills/outline-test-cases/templates/outline-readme.md
+++ b/extensions/rpiv-pi/skills/outline-test-cases/templates/outline-readme.md
@@ -0,0 +1,36 @@
+```markdown
+# {Project Name} — Test Case Outline
+
+## Overview
+- Project: {project name}
+- Framework: {framework}
+- Applications: {N} ({app names})
+- Total features: {N} outlined
+- Backend endpoints: ~{N} across {M} controllers
+- Last updated: {YYYY-MM-DD} | Branch: `{branch}` | Commit: `{commit}`
+
+## Features by Portal
+
+### {Portal Name} ({N} features)
+| # | Feature | Module | Slug | Routes | Endpoints | Status |
+|---|---------|--------|------|--------|-----------|--------|
+| 1 | {name} | {MOD} | {slug} | {N} | {M} | pending |
+
+## Backend-Only Endpoints (no frontend exposure)
+- **{Group name}** ({N} controllers, ~{M} endpoints) — {reason}
+
+## Next Steps
+Generate test cases for a specific feature:
+```
+/skill:write-test-cases {feature-name}
+```
+
+To update this outline after codebase changes:
+```
+/skill:outline-test-cases
+```
+Incremental runs detect existing outlines and take faster paths.
+
+## Coverage
+This outline was generated by static code analysis. It may not capture dynamically loaded features, features behind feature flags, or functionality added after the generation date. Re-run `outline-test-cases` periodically or add features manually.
+```
--- a/extensions/rpiv-pi/skills/plan/SKILL.md
+++ b/extensions/rpiv-pi/skills/plan/SKILL.md
@@ -0,0 +1,286 @@
+---
+name: plan
+description: Convert a design artifact into a phased implementation plan with parallelized atomic phases and explicit success criteria, written to thoughts/shared/plans/. Use after the design skill when the user wants a design turned into an actionable, phase-by-phase plan to hand to the implement skill. Prefer plan when a straightforward phased breakdown is sufficient, and prefer blueprint when iterative vertical-slice micro-checkpoints between phases are needed.
+argument-hint: [design artifact path]
+---
+
+# Write Plan
+
+You are tasked with creating phased implementation plans from design artifacts. The design artifact contains all architectural decisions, full implementation code, and ordering constraints. Your job is to decompose that design into parallelized atomic phases with success criteria that implement can execute.
+
+## Step 1: Read Design Artifact
+
+When this command is invoked:
+
+1. **Determine input mode**:
+
+   **Design artifact provided** (path to a `.md` file in `thoughts/shared/designs/`):
+   - Read the design artifact FULLY using the Read tool WITHOUT limit/offset
+   - Extract: Architecture (the code changes), File Map, Ordering Constraints, Verification Notes, Performance Considerations, Scope
+   - These are the inputs for phasing
+   - Design decisions are settled — do not re-evaluate them
+   - If the design has unresolved questions, STOP — tell the developer to return to design
+
+   **No arguments provided**:
+   ```
+   I'll create an implementation plan from a design artifact. Please provide the path:
+
+   `/skill:plan thoughts/shared/designs/2025-01-20_09-30-00_feature.md`
+
+   Run `/skill:design` first to produce the design artifact. There is no standalone path.
+   ```
+   Then wait for input.
+
+2. **Read any additional files mentioned** in the design's References — research documents, tickets. Read them FULLY for context.
+
+## Step 2: Decompose into Phases
+
+Read the Ordering Constraints and File Map from the design artifact. Apply phasing rules:
+
+1. **Independently implementable**: Each phase must compile and pass tests on its own — no cross-phase runtime state
+2. **Parallelizable**: Phases that don't depend on each other are explicitly marked (e.g., "Phases 2 and 3 can run in parallel")
+3. **Worktree-sized**: Each phase should be appropriate for a single implement session in a worktree (~3-8 files changed, 1-3 components touched)
+4. **Dependency-ordered**: Phase ordering follows the design artifact's Ordering Constraints
+5. **Grouped coherently**: Related file changes go in the same phase (e.g., import change + hook setup + JSX modification for one component)
+
+**If the design's Ordering Constraints say "all files independent"**, consider whether a single phase is appropriate. Don't split into phases just for the sake of it — if all changes can be done in one worktree session, one phase is correct.
+
+Present phase outline and get developer feedback BEFORE writing details:
+
+```
+Here's my proposed plan structure based on the design at {path}:
+
+## Implementation Phases:
+1. {Phase name} - {what it accomplishes} ({N} files)
+2. {Phase name} - {what it accomplishes} ({N} files)
+3. {Phase name} - {what it accomplishes} ({N} files)
+
+Phases {2} and {3} can run in parallel after Phase 1.
+Total: {N} files across {M} phases.
+
+Does this phasing make sense? Should I adjust the order or granularity?
+```
+
+Use the `ask_user_question` tool to confirm the phase structure. Question: "{N} phases, {M} total files. Does this structure work?". Header: "Phases". Options: "Proceed (Recommended)" (Write the detailed plan with code blocks and success criteria); "Adjust phases" (Split, merge, or reorder phases before writing); "Change scope" (Add or remove files from the plan).
+
+Get feedback on structure before writing details.
+
+## Step 3: Write Plan
+
+After structure approval, write the plan **incrementally** — skeleton first, then fill each phase:
+
+1. **Write the plan skeleton** to `thoughts/shared/plans/YYYY-MM-DD_HH-MM-SS_description.md`
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Format: `YYYY-MM-DD_HH-MM-SS_description.md` where:
+     - YYYY-MM-DD / HH-MM-SS come from the `date` output above
+     - description is a brief kebab-case description (may include ticket number)
+   - Examples:
+     - With ticket: `2025-01-08_14-30-00_ENG-1478-parent-child-tracking.md`
+     - Without ticket: `2025-01-08_14-30-00_improve-error-handling.md`
+   - The skeleton includes everything EXCEPT large code blocks: frontmatter, Overview, Desired End State, What We're NOT Doing, full phase structure (Overview, Changes Required with file paths and change summaries, Success Criteria, parallelism annotations), Testing Strategy, Performance Considerations, References. All phasing and structural decisions happen in this pass.
+
+2. **Fill code blocks using Edit** — one phase at a time:
+   - For each phase, Edit to insert the before/after code blocks from the design's Architecture section into the Changes Required subsections
+
+3. **Use this template structure**:
+
+```markdown
+---
+date: {Current date and time with timezone in ISO format}
+author: {User from injected git context}
+commit: {Current commit hash}
+branch: {Current branch name}
+repository: {Repository name}
+topic: "{Feature/Task Name}"
+tags: [plan, relevant-component-names]
+status: ready
+parent: "{path to design artifact}"
+last_updated: {Same ISO timestamp as `date:` above}
+last_updated_by: {User from injected git context}
+---
+
+# {Feature/Task Name} Implementation Plan
+
+## Overview
+
+{Brief description of what we're implementing and why. Reference design artifact.}
+
+## Desired End State
+
+{From design artifact's Desired End State / Summary — what "done" looks like and how to verify it}
+
+## What We're NOT Doing
+
+{From design artifact's Scope → Not Building}
+
+## Phase 1: {Descriptive Name}
+
+### Overview
+{What this phase accomplishes}
+
+### Changes Required:
+
+#### 1. {Component/File Group}
+**File**: `path/to/file.ext`
+**Changes**: {Summary of changes}
+
+```{language}
+// Code from design artifact's Architecture section
+```
+
+### Success Criteria:
+
+#### Automated Verification:
+- [ ] Type checking passes: `pnpm typecheck`
+- [ ] Linting passes: `pnpm lint`
+- [ ] Tests pass: `pnpm test`
+
+#### Manual Verification:
+- [ ] {From design's Verification Notes — specific visual/behavioral check}
+- [ ] {Component-specific verification}
+
+---
+
+## Phase 2: {Descriptive Name}
+
+{Similar structure with both automated and manual success criteria...}
+
+---
+
+## Testing Strategy
+
+### Automated:
+- {Standard project checks from success criteria}
+
+### Manual Testing Steps:
+1. {From design's Verification Notes — converted to step-by-step}
+2. {Another verification step}
+
+## Performance Considerations
+
+{From design artifact — copied directly}
+
+## Migration Notes
+
+{From design artifact — copied directly. If applicable: schema changes, data migration, rollback strategy, backwards compatibility. Empty if not applicable.}
+
+## References
+
+- Design: `thoughts/shared/designs/{file}.md`
+- Research: `thoughts/shared/research/{file}.md`
+- Original ticket: `thoughts/me/tickets/{file}.md`
+```
+
+## Step 4: Review
+
+1. **Present the plan location**:
+   ```
+   Implementation plan written to:
+   `thoughts/shared/plans/{filename}.md`
+
+   {N} phases, {M} total file changes.
+
+   Please review:
+   - Are the phases properly scoped for worktree execution?
+   - Are the success criteria specific enough?
+   - Any phase that should be split or merged?
+
+   ---
+
+   💬 Follow-up: describe the change in chat to append a timestamped Follow-up section to this artifact, or use `/skill:revise <plan-path>` for surgical phase edits. Re-run `/skill:plan` for a fresh artifact.
+
+   **Next step:** `/skill:implement thoughts/shared/plans/{filename}.md Phase 1` — start execution at Phase 1 (omit `Phase 1` to run all phases sequentially).
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+## Step 10: Handle Follow-ups
+
+- **Edit in-place.** Use the Edit tool to update the plan artifact directly. Phase numbering stays stable when possible — renumber only when a phase is split or merged.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "<one-line summary>"`.
+- **Phase-level moves.** Split large phases, merge small phases, adjust success criteria, reorder phases — all in-place. Continue refining until the developer is satisfied.
+- **When to re-invoke instead.** For surgical edits driven by review findings, prefer `/skill:revise <plan-path>`. Re-run `/skill:plan` only when the underlying design changed materially. The previous block's `Next step:` stays valid for the existing plan.
+
+## Guidelines
+
+1. **Trust the Design**:
+   - Design decisions are fixed — do not re-evaluate architectural choices
+   - If something in the design seems wrong, flag it to the developer
+   - Don't silently change the approach or add scope
+   - The design is the source of truth for what to build
+
+2. **Be Interactive**:
+   - Don't write the full plan in one shot
+   - Get buy-in on phase structure first
+   - Allow course corrections on granularity
+   - Work collaboratively
+
+3. **Be Practical**:
+   - Focus on incremental, testable changes
+   - Each phase should leave the codebase in a working state
+   - Think about what can be verified independently
+   - Include "what we're NOT doing" from the design's scope
+
+4. **Phase for Worktrees**:
+   - Each phase should be implementable in an isolated worktree
+   - No phase should depend on another phase's uncommitted changes
+   - If the design says "all independent," one phase may be correct
+   - Don't split for the sake of splitting
+
+5. **Track Progress**:
+   - Use a todo list to track planning tasks
+   - Mark planning tasks complete when done
+
+6. **No Open Questions in Final Plan**:
+   - If you encounter open questions during planning, STOP
+   - If the design artifact has unresolved questions, send the developer back to design
+   - Do NOT write the plan with unresolved questions
+   - The implementation plan must be complete and actionable
+
+## Success Criteria Guidelines
+
+**Always separate success criteria into two categories:**
+
+1. **Automated Verification** (can be run by execution agents):
+   - Commands that can be run: `make test`, `npm run lint`, etc.
+   - Specific files that should exist
+   - Code compilation/type checking
+   - Automated test suites
+
+2. **Manual Verification** (requires human testing):
+   - UI/UX functionality
+   - Performance under real conditions
+   - Edge cases that are hard to automate
+   - User acceptance criteria
+
+**Format example:**
+```markdown
+### Success Criteria:
+
+#### Automated Verification:
+- [ ] Database migration runs successfully: `make migrate`
+- [ ] All unit tests pass: `go test ./...`
+- [ ] No linting errors: `golangci-lint run`
+- [ ] API endpoint returns 200: `curl localhost:8080/api/new-endpoint`
+
+#### Manual Verification:
+- [ ] New feature appears correctly in the UI
+- [ ] Performance is acceptable with 1000+ items
+- [ ] Error messages are user-friendly
+- [ ] Feature works correctly on mobile devices
+```
+
+**Convert design's Verification Notes to success criteria:**
+- Prose warnings → specific automated commands or manual steps
+- "Test production builds" → `pnpm build && verify in built app`
+- "Verify scrollbar appearance" → `[ ] Open {component}, scroll, observe slim scrollbar`
+- "Do NOT use X" → `grep -r "X" src/ should return 0 matches`
+
+## Important Notes
+
+- NEVER edit source files — this skill produces a plan document, not implementation
+- Always read the design artifact FULLY before decomposing into phases
+- The plan template must be compatible with implement — preserve the phase/success criteria structure
+- If the design artifact has unresolved questions, STOP — send the developer back to design
+- Code in the plan comes from the design artifact's Architecture section — do not invent new code
+- **Frontmatter consistency**: Use snake_case for multi-word field names in plan frontmatter
--- a/extensions/rpiv-pi/skills/research/SKILL.md
+++ b/extensions/rpiv-pi/skills/research/SKILL.md
@@ -0,0 +1,328 @@
+---
+name: research
+description: Answer structured research questions about a codebase using targeted parallel analysis agents, then synthesize findings into a research document in thoughts/shared/research/. Internally dispatches the scope-tracer agent to formulate trace-quality research questions, then answers them. Use when the user wants in-depth research on a codebase area, asks to "research X", or needs answers to architecture or behavior questions before designing changes.
+argument-hint: [free-text research prompt]
+---
+
+# Research
+
+You are tasked with answering structured research questions by spawning targeted analysis agents and synthesizing their findings into a comprehensive research document. This skill internally dispatches the `scope-tracer` agent to formulate trace-quality research questions, then answers them.
+
+Input: `$ARGUMENTS`
+
+## Step 1: Trace the Investigation Scope
+
+1. **Argument is empty:**
+   ```
+   Please provide a free-text research prompt.
+   ```
+   Then wait for input.
+
+2. **Detect chained discover artifact:** If `$ARGUMENTS` mentions a path matching `thoughts/shared/discover/.*\.md`, read it FULLY using the Read tool (no limit/offset) before scope-tracer dispatch:
+   - Translate each `### [Decision title]` block in the FRD's `## Decisions` section into a Developer Context entry: `**Q (discover: <Decision title>): <Question text>**` followed by `A: <Chosen text>`. Hold these entries in main context — they're recorded in the research artifact's Developer Context section in Step 4 (write document).
+   - Use the FRD's `## Recommended Approach` text (1-2 sentences naming the architectural shape) as the topic body for the next sub-step's scope-tracer prompt. The full discover artifact path stays in `$ARGUMENTS` so scope-tracer's "read mentioned files first" rule picks up the file naturally for additional context.
+   - Carry the FRD's Open Questions forward verbatim into the research artifact's Open Questions section in Step 4.
+   - If `$ARGUMENTS` is plain free-text or mentions a non-discover path, skip this sub-step and proceed directly to scope-tracer dispatch with `$ARGUMENTS` as the topic.
+
+3. **Dispatch the scope-tracer agent** to formulate trace-quality research questions for the user's topic:
+   ```
+   Agent({
+     subagent_type: "scope-tracer",
+     description: "trace scope",
+     prompt: "$ARGUMENTS"
+   })
+   ```
+   The agent reads any mentioned files, sweeps anchor terms via grep/find/ls, reads 5-10 key files for depth, then emits a Discovery Summary + 5-10 dense numbered questions inline in its final message. Nothing is written to disk.
+
+4. **Parse the agent's final message** as the questions artifact body. Extract: Discovery Summary (3-5 sentence file-landscape overview), Questions (numbered dense 3-6 sentence paragraphs).
+
+5. **Read key shared files** referenced across multiple questions into main context — especially shared utilities, type definitions, and integration points that multiple questions mention.
+
+6. **Analyze question overlap for grouping:**
+   - Parse all question paragraphs and extract file references from each
+   - Identify questions that share 2+ file references — these are candidates for grouping
+   - Group related questions together (2-3 questions per group max)
+   - Questions with no significant file overlap remain standalone
+   - Target: 3-6 agent dispatches total (grouped + standalone)
+
+7. **Report scoped status:**
+   ```
+   [Scoped]: ran scope-tracer. {N} questions in {G} groups, {M} shared files.
+   ```
+
+## Step 2: Dispatch Analysis Agents
+
+Spawn analysis agents using the Agent tool. All agents run in parallel.
+
+**Default agent**: `codebase-analyzer` for all codebase questions. This agent has Read, Grep, Glob, LS — it can trace code paths, find patterns, and analyze integration points.
+
+**Exception**: Questions that explicitly reference external documentation, web APIs, or third-party libraries → `web-search-researcher`.
+
+**Agent prompt — question-as-prompt:**
+
+Each agent receives the dense question paragraph(s) directly as its prompt. The question IS the instruction.
+
+For standalone questions (no grouping):
+```
+Research topic: {topic from frontmatter}
+
+Answer the following research question thoroughly with file:line references. Read the files mentioned, trace the code paths described, and provide a complete analysis.
+
+{Full dense question paragraph}
+
+Provide your analysis with exact file:line references. Focus on DEPTH — trace the actual code, don't just locate it.
+```
+
+For grouped questions:
+```
+Research topic: {topic from frontmatter}
+
+Answer the following related research questions thoroughly with file:line references. These questions share overlapping code paths — use your cross-question context to provide deeper, more connected analysis.
+
+Question 1: {Full dense question paragraph}
+
+Question 2: {Full dense question paragraph}
+
+For each question, provide your analysis with exact file:line references. Note connections between the questions where the same code serves multiple roles. Focus on DEPTH — trace the actual code, don't just locate it.
+```
+
+**Precedent sweep (git-gated):**
+
+When `commit` is available (not `no-commit`), spawn one `precedent-locator` agent alongside the question agents with prompt: "Find similar past changes involving {list key files from Discovery Summary}. Search git log for commits that touched these files, similar commit messages, and follow-up fixes. Research topic: {original query}."
+
+Findings go into Precedents & Lessons. Otherwise skip and note "git history unavailable" there.
+
+**Wait for ALL agents to complete** before proceeding.
+
+## Step 3: Synthesize and Checkpoint
+
+1. **Compile findings:**
+   - Match each agent's response to the question(s) it answered
+   - Cross-reference findings across questions — look for patterns, conflicts, and connections
+   - Prioritize live codebase findings as primary source of truth
+   - Use thoughts/ findings as supplementary historical context
+   - Include specific file paths and line numbers
+   - Build Code References as jump-table entries for the planner, not narrative (file:startLine-endLine format)
+   - No multi-line code blocks (>3 lines) — use file:line refs + prose. No implementation recipes — facts only.
+   - No artifact summaries — link plans/designs in Historical Context, don't summarize their contents. Research describes current codebase state.
+
+2. **Developer checkpoint — grounded questions one at a time:**
+
+   Start with grounded questions referencing real findings with file:line evidence. Ask ONE question at a time, waiting for the answer before the next. Use a **❓ Question:** prefix. Each question must pull NEW information from the developer — not confirm what you already found:
+
+   Every question MUST embed at least one `file:line` reference in the question text — not just in surrounding context. Examples:
+
+   - "❓ Question: `src/events/orders.ts:45-67` has 3 event hooks but no error recovery path. Is there a retry mechanism elsewhere I'm not seeing?"
+   - "❓ Question: Pattern-finder found manual mapping at `src/services/OrderService.ts:45` (8 uses) vs AutoMapper at `src/services/UserService.ts:12` (2 uses). Which should new code follow?"
+   - "❓ Question: Precedent commit `abc123` required a follow-up fix at `src/handlers/key.ts:158` for connection leak. Should we account for that pattern in this design?"
+
+   Anti-patterns — NEVER ask these:
+   - "Is this research to understand X or prepare for Y?" — confirmatory, pulls zero new information
+   - "Does this look correct?" / "Should I continue?" — asks developer to validate YOUR work instead of providing NEW context
+   - Questions without `file:line` — if you can't ground it in code, it's not a research question
+
+   **Question patterns by finding type:**
+
+   - **Pattern conflict**: "Found 2 implementations of {X} — which is canonical?" with options citing `file:line` + occurrence count
+   - **Scope boundary**: "Question {N} references files {A,B,C} but analysis shows {D} is the real integration point. Extend scope?" with yes/no + "describe what I missed"
+   - **Priority override**: "Questions Q1 and Q2 have competing implications for {area}. Which is load-bearing?" with options
+   - **Integration ambiguity**: "Found no connection between {X} and {Y}. Is there an indirect path?" (free-text — can't predict the answer)
+
+   **Choosing question format:**
+
+   - **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example:
+
+     > Use the `ask_user_question` tool with the following question: "Found 2 patterns for retry logic — which is canonical?". Header: "Pattern". Options: "Event-sourced retry (Recommended)" (`src/events/orders.ts:45-67` — 3 hooks, matches precedent commit `abc123`); "Direct retry loop" (`src/services/OrderService.ts:112` — single use, no event traceability).
+
+   - **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+     "❓ Question: `src/events/orders.ts:45-67` has 3 event hooks but no error recovery path. Is there a retry mechanism elsewhere I'm not seeing?"
+
+   **Anti-pattern** — do NOT dump a verbose paragraph mixing analysis with a trailing question:
+
+   ❌ "The premise inversion is load-bearing for prioritization — it means Site A is a no-op, and the real bloat only hits general-purpose dispatches. Given this, where is the bloat actually landing? Do your skills dispatch named bundled agents — in which case append-mode is irrelevant — or general-purpose — in which case it IS the dominant source?"
+
+   ✅ Extract the 2 concrete options and call `ask_user_question`: "Where is the prompt bloat landing?". Header: "Bloat source". Options: "Named bundled agents (Recommended)" (Skills dispatch `codebase-analyzer` etc. — `prompt_mode: "replace"`, no parent inheritance); "General-purpose agent" (`default-agents.ts:11-28` — `promptMode: "append"`, inherits full parent prompt).
+
+   **Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+   **CRITICAL**: Ask ONE question at a time. Wait for the answer before asking the next. Lead with your most significant finding.
+
+3. **Present compiled scan** (under 30 lines):
+   ```
+   Task: {one-line summary}
+   Scope: {N files across M layers, K integration points}
+
+   {Layer name} — {key files and what they do}
+   {Layer name} — {key files and what they do}
+   Integration — {N inbound, M outbound, K wiring. Top concern if any}
+   History — {N relevant docs. Key insight if any}
+
+   Best template: {implementation to model after}
+   Precedents — {N similar changes found. Top lesson if any}
+   Inconsistencies: {count} found ({short names})
+   ```
+
+   Wait for the developer's response before proceeding.
+
+4. **Incorporate developer input:**
+
+   Classify each response:
+
+   **Corrections** (e.g., "skip the job scheduler", "use CreateProduct not GetUser"):
+   - Incorporate directly into synthesis. Record in Developer Context.
+
+   **New areas** (e.g., "you missed the events module"):
+   - Spawn targeted rescan: **codebase-locator** + **codebase-analyzer** on the new area (max 2 agents).
+   - Merge results into synthesis. Record in Developer Context.
+
+   **Decisions** (e.g., "yes, hook into that event chain"):
+   - Record in Developer Context. Remove corresponding item from Open Questions.
+
+   **Scope/focus** (e.g., "focus on API layer, UI is out of scope"):
+   - Record in Developer Context.
+
+   After incorporating all input, proceed to Step 4.
+
+## Step 4: Write Research Document
+
+1. **Determine metadata:**
+   - Filename: `thoughts/shared/research/YYYY-MM-DD_HH-MM-SS_{topic}.md`
+     - YYYY-MM-DD_HH-MM-SS: Current date and time
+     - topic: Brief kebab-case description
+   - Repository name: from git root basename, or current directory basename if not a git repo
+   - Use the git branch and commit from the git context injected at the start of the session (or run `git branch --show-current` / `git rev-parse --short HEAD` directly)
+   - Timestamp: run `date +"%Y-%m-%dT%H:%M:%S%z"` — raw for `date:` and `last_updated:`, first 19 chars (`T`→`_`, `:`→`-`) for filename slug.
+   - Author: use the User from the git context injected at the start of the session (fallback: "unknown")
+   - If metadata unavailable: use "unknown" for commit/branch
+
+2. **Write the research document** — this document is compressed context for a new session. Include everything the planner needs to make architectural decisions without re-researching:
+
+   ```markdown
+   ---
+   date: {Current date and time with timezone in ISO format}
+   author: {User from injected git context}
+   commit: {Current commit hash}
+   branch: {Current branch name}
+   repository: {Repository name}
+   topic: "{User's Research Topic}"
+   tags: [research, codebase, relevant-component-names]
+   status: complete
+   last_updated: {Same ISO timestamp as `date:` above}
+   last_updated_by: {User from injected git context}
+   ---
+
+   # Research: {User's Research Topic}
+
+   ## Research Question
+   {Original user query from questions artifact}
+
+   ## Summary
+   {High-level findings answering the user's question}
+
+   ## Detailed Findings
+
+   ### {Component/Area 1}
+   - Finding with reference (`file.ext:line`)
+   - Connection to other components
+   - Implementation details
+
+   ### {Component/Area 2}
+   ...
+
+   ## Code References
+   - `path/to/file.py:123` — Description of what's there
+   - `another/file.ts:45-67` — Description of the code block
+
+   ## Integration Points
+   {All connections to the researched area. Enumerate each consumer, dependency, and wiring point with file:line. Source from the questions artifact's Discovery Summary + new connections found by analysis agents.}
+
+   ### Inbound References
+   - `path/to/consumer.ext:line` — {What references the component and how}
+
+   ### Outbound Dependencies
+   - `path/to/dependency.ext:line` — {What the component depends on}
+
+   ### Infrastructure Wiring
+   - `path/to/config.ext:line` — {DI, routes, events, jobs, middleware}
+
+   ## Architecture Insights
+   {Patterns, conventions, and design decisions discovered}
+
+   ## Precedents & Lessons
+   {N} similar past changes analyzed.
+
+   ### Precedent: {what was added/changed}
+   **Commit(s)**: `hash` — "message" (YYYY-MM-DD)
+   **Blast radius**: N files across M layers
+     layer/ — what changed
+
+   **Follow-up fixes**:
+   - `hash` — "message" (date) — what went wrong
+
+   **Lessons from docs**:
+   - thoughts/path/to/doc.md — key lesson extracted
+
+   **Takeaway**: {one sentence — what to watch out for}
+
+   ### Composite Lessons
+   - {Composite lesson 1 — most recurring pattern first, with relevant `commit hash` inline}
+   - {Composite lesson 2}
+
+   ## Historical Context (from thoughts/)
+   {Links only — one line per doc, no summaries of their contents}
+   - `thoughts/shared/something.md` — {one-line description of what this doc covers}
+   ## Developer Context
+   **Q (`file.ext:line`): {Question grounded in specific code reference}**
+   A: {Developer's answer}
+
+   ## Related Research
+   - {Links to other research documents}
+
+   ## Open Questions
+   {Only questions NOT resolved during checkpoint}
+   ```
+
+## Step 5: Present and Chain
+
+```
+Research document written to:
+`thoughts/shared/research/{filename}.md`
+
+{N} questions answered, {M} findings across {K} files.
+
+Please review and let me know if you have follow-up questions.
+
+---
+
+💬 Follow-up: describe the change in chat to append a timestamped Follow-up section to this artifact. Re-run `/skill:research` for a fresh artifact.
+
+**Next step (choose one):**
+- `/skill:design thoughts/shared/research/{filename}.md` — iterative design with vertical-slice decomposition (produces design artifact for plan)
+- `/skill:blueprint thoughts/shared/research/{filename}.md` — lightweight fast path for smaller tasks; combined design + phased plan in one pass (produces implement-ready plan directly)
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+```
+
+## Step 6: Handle Follow-ups
+
+- **Append, never rewrite.** Edit the artifact to add a `## Follow-up Research {ISO 8601 timestamp}` section. Prior content stays immutable.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "Added follow-up research for <brief description>"`.
+- **Re-dispatch narrowly.** Spawn ≤1–2 fresh analysis agents scoped to the new question. Do NOT re-run the full skill.
+- **When to re-invoke instead.** If scope changed materially (different feature surface, different research target), re-run `/skill:research` for a fresh artifact. The previous block's `Next step:` stays valid for the existing artifact.
+
+## Important Notes
+
+- **Analysis only**: This skill answers questions. Question formulation is delegated to the scope-tracer subagent at Step 1.
+- **Single entry point**: Free-text research prompt. Argument substitution is handled by `rpiv-args`; scope-tracer runs in-band before analysis dispatch.
+- **Chained from discover**: when `$ARGUMENTS` mentions a `thoughts/shared/discover/*.md` artifact, read it FULLY in Step 1 and translate each Decision into a `Q (discover: <title>) / A: <Chosen>` Developer Context entry. Pass the FRD's `Recommended Approach` text as the scope-tracer topic. Open Questions carry forward verbatim. The `argument-hint` stays free-text-only — discover artifact recognition is by path-mention, not by argument-hint widening.
+- **Grouped dispatch**: Related questions are batched per agent based on file overlap. Default agent: codebase-analyzer. This reduces token waste from redundant file reads and lets agents build cross-question context.
+- **Downstream compatible**: Research documents feed directly into design and plan — the same Code References / Integration Points / Architecture Insights sections they expect.
+- **Agent-message parsing**: scope-tracer emits Discovery Summary + numbered Questions inline in its final assistant message; parse the agent's final-message text (no file write).
+- **Critical ordering**: Follow the numbered steps exactly
+  - ALWAYS dispatch scope-tracer first (Step 1)
+  - ALWAYS analyze question overlap for grouping (Step 1)
+  - ALWAYS wait for all agents to complete (Step 2)
+  - ALWAYS run developer checkpoint before writing (Step 3)
+  - ALWAYS gather metadata before writing (Step 4)
+  - NEVER write the document with placeholder values
+- **Frontmatter consistency**: Always include frontmatter, use snake_case fields
--- a/extensions/rpiv-pi/skills/resume-handoff/SKILL.md
+++ b/extensions/rpiv-pi/skills/resume-handoff/SKILL.md
@@ -0,0 +1,207 @@
+---
+name: resume-handoff
+description: Resume work from a handoff document produced by create-handoff. Reads the handoff, verifies current repo, branch, and state, and continues from where the previous session left off. Use at the start of a new session when the user references a handoff file, says "resume from handoff", "continue from where we left off", or invokes /resume-handoff.
+argument-hint: [handoff-path]
+---
+
+# Resume work from a handoff document
+
+You are tasked with resuming work from a handoff document through an interactive process. These handoffs contain critical context, learnings, and next steps from previous work sessions that need to be understood and continued.
+
+## Initial Response
+
+When this command is invoked:
+
+1. **If the path to a handoff document was provided**:
+   - If a handoff document path was provided as a parameter, skip the default message
+   - Immediately read the handoff document FULLY using the Read tool
+   - Immediately read any research or plan documents that it links to under `thoughts/shared/plans` or `thoughts/shared/research` or `thoughts/shared/solutions`. Read these critical files DIRECTLY using the Read tool - do NOT invoke skills for this initial reading phase.
+   - Begin the analysis process by ingesting relevant context from the handoff document, reading additional files it mentions
+   - Then propose a course of action to the user and confirm, or ask for clarification on direction.
+
+2. **If no parameters provided**, respond with:
+```
+I'll help you resume work from a handoff document. Let me find the available handoffs.
+
+Which handoff would you like to resume from?
+
+Tip: You can invoke this command directly with a handoff path: `/skill:resume-handoff thoughts/shared/handoffs/YYYY-MM-DD_HH-MM-SS_description.md`
+```
+
+Then wait for the user's input.
+
+## Process Steps
+
+### Step 1: Read and Analyze Handoff
+
+1. **Read handoff document completely**:
+   - Use the Read tool WITHOUT limit/offset parameters
+   - Extract all sections:
+     - Task(s) and their statuses
+     - Recent changes
+     - Learnings
+     - Artifacts
+     - Action items and next steps
+     - Other notes
+
+2. **Spawn focused research agents**:
+   After reading all critical handoff/plan/research documents directly, spawn the agents below in parallel using the Agent tool. Wait for ALL agents to complete before proceeding.
+
+   ```
+   Task 1 - Gather artifact context:
+   Read all artifacts mentioned in the handoff.
+   1. Read feature documents listed in "Artifacts"
+   2. Read implementation plans referenced
+   3. Read any research documents mentioned
+   4. Extract key requirements and decisions
+   Use tools: Read
+   Return: Summary of artifact contents and key decisions
+   ```
+
+3. **Wait for ALL agents to complete** before proceeding
+
+4. **Verify current state**:
+   - Read files from "Learnings" section completely to validate patterns still apply
+   - Read files from "Recent changes" to verify modifications are still present
+   - Use git log or git diff if needed to check commit history since handoff
+   - Re-read implementation files mentioned to confirm current state matches handoff expectations
+   - Read any new related files discovered during research
+
+### Step 2: Synthesize and Present Analysis
+
+1. **Present comprehensive analysis**:
+   ```
+   I've analyzed the handoff from {date} by {author}. Here's the current situation:
+
+   **Original Tasks:**
+   - {Task 1}: {Status from handoff} → {Current verification}
+   - {Task 2}: {Status from handoff} → {Current verification}
+
+   **Key Learnings Validated:**
+   - {Learning with file:line reference} - {Still valid/Changed}
+   - {Pattern discovered} - {Still applicable/Modified}
+
+   **Recent Changes Status:**
+   - {Change 1} - {Verified present/Missing/Modified}
+   - {Change 2} - {Verified present/Missing/Modified}
+
+   **Artifacts Reviewed:**
+   - {Document 1}: {Key takeaway}
+   - {Document 2}: {Key takeaway}
+
+   **Recommended Next Actions:**
+   Based on the handoff's action items and current state:
+   1. {Most logical next step based on handoff}
+   2. {Second priority action}
+   3. {Additional tasks discovered}
+
+   **Potential Issues Identified:**
+   - {Any conflicts or regressions found}
+   - {Missing dependencies or broken code}
+
+   ```
+
+   Use the `ask_user_question` tool to confirm the approach. Question: "{Summary of recommended next action}. Proceed?". Header: "Resume". Options: "Proceed (Recommended)" (Begin with {recommended action 1}); "Adjust approach" (Change the order or scope of next steps); "Re-analyze" (The codebase has changed — re-verify state first).
+
+### Step 3: Create Action Plan
+
+1. **Create a task list**:
+   - Convert action items from handoff into todos
+   - Add any new tasks discovered during analysis
+   - Prioritize based on dependencies and handoff guidance
+
+2. **Present the plan**:
+   ```
+   I've created a task list based on the handoff and current analysis:
+
+   {Show todo list}
+
+   Ready to begin with the first task: {task description}?
+   ```
+
+### Step 4: Begin Implementation
+
+1. **Start with the first approved task**
+2. **Reference learnings from handoff** throughout implementation
+3. **Apply patterns and approaches documented** in the handoff
+4. **Update progress** as tasks are completed
+
+## Guidelines
+
+1. **Be Thorough in Analysis**:
+   - Read the entire handoff document first
+   - Verify ALL mentioned changes still exist
+   - Check for any regressions or conflicts
+   - Read all referenced artifacts
+
+2. **Be Interactive**:
+   - Present findings before starting work
+   - Get buy-in on the approach
+   - Allow for course corrections
+   - Adapt based on current state vs handoff state
+
+3. **Leverage Handoff Wisdom**:
+   - Pay special attention to "Learnings" section
+   - Apply documented patterns and approaches
+   - Avoid repeating mistakes mentioned
+   - Build on discovered solutions
+
+4. **Track Continuity**:
+   - Keep the task list updated to maintain task continuity
+   - Reference the handoff document in commits
+   - Document any deviations from original plan
+   - Consider creating a new handoff when done
+
+5. **Validate Before Acting**:
+   - Never assume handoff state matches current state
+   - Verify all file references still exist by reading them
+   - Check for breaking changes since handoff using git log/diff or by reading modified files
+   - Confirm patterns mentioned in "Learnings" are still valid by examining current code
+   - Compare handoff timestamps with current git commits to assess how much has changed
+
+## Common Scenarios
+
+### Scenario 1: Clean Continuation
+- All changes from handoff are present
+- No conflicts or regressions
+- Clear next steps in action items
+- Proceed with recommended actions
+
+### Scenario 2: Diverged Codebase
+- Some changes missing or modified
+- New related code added since handoff
+- Need to reconcile differences
+- Adapt plan based on current state
+
+### Scenario 3: Incomplete Handoff Work
+- Tasks marked as "in_progress" in handoff
+- Need to complete unfinished work first
+- May need to re-understand partial implementations
+- Focus on completing before new work
+
+### Scenario 4: Stale Handoff
+- Significant time has passed
+- Major refactoring has occurred
+- Original approach may no longer apply
+- Need to re-evaluate strategy
+
+## Example Interaction Flow
+
+```
+User: /skill:resume-handoff thoughts/shared/handoffs/2025-01-08_14-30-15_webhook-validation.md
+Assistant: Let me read and analyze that handoff document...
+
+{Reads handoff completely}
+{Spawns research agents}
+{Waits for completion}
+{Reads identified files}
+
+I've analyzed the handoff from {date}. Here's the current situation...
+
+{Presents analysis}
+
+Shall I proceed with implementing the webhook validation fix, or would you like to adjust the approach?
+
+User: Yes, proceed with the webhook validation
+Assistant: {Creates todo list and begins implementation}
+```
--- a/extensions/rpiv-pi/skills/revise/SKILL.md
+++ b/extensions/rpiv-pi/skills/revise/SKILL.md
@@ -0,0 +1,276 @@
+---
+name: revise
+description: Surgically update an existing implementation plan in thoughts/shared/plans/ based on review feedback, mid-implementation discoveries, or new constraints, preserving structure and quality rather than rewriting. Use when the user wants a plan adjusted after code-review feedback, has hit a blocker mid-implement, scope changed, or asks to "revise the plan".
+argument-hint: "[plan-path] [feedback]"
+---
+
+# Iterate Implementation Plan
+
+You are tasked with updating existing implementation plans based on user feedback. You should be skeptical, thorough, and ensure changes are grounded in actual codebase reality.
+
+## Initial Response
+
+When this command is invoked:
+
+1. **Parse the input to identify**:
+   - Plan file path (e.g., `thoughts/shared/plans/2025-10-16_09-00-00_feature.md`)
+   - Whether the user accidentally provided a review artifact path instead (e.g., `thoughts/shared/reviews/2025-10-16_10-00-00_feature.md`)
+   - Requested changes/feedback
+
+2. **Handle different input scenarios**:
+
+   **If a REVIEW artifact path is provided**:
+   ```
+   `revise` updates implementation plans, not review artifacts.
+
+   If you want to act on code-review findings, provide the target plan path plus the changes to make.
+
+   Example:
+   `/skill:revise thoughts/shared/plans/2025-10-16_09-00-00_feature.md "Address the findings from thoughts/shared/reviews/2025-10-16_10-00-00_feature.md by tightening validation in Phase 2 and expanding success criteria."`
+   ```
+   Wait for user input.
+
+   **If NO plan file provided**:
+   ```
+   I'll help you iterate on an existing implementation plan.
+
+   Which plan would you like to update? Please provide the path to the plan file (e.g., `thoughts/shared/plans/2025-10-16_09-00-00_feature.md`).
+
+   If you're coming from `/skill:code-review`, pass the relevant plan path and summarize which findings should change the plan.
+
+   Tip: You can list recent plans with `ls -lt thoughts/shared/plans/ | head`
+   ```
+   Wait for user input, then re-check for feedback.
+
+   **If plan file provided but NO feedback**:
+   ```
+   I've found the plan at {path}. What changes would you like to make?
+
+   For example:
+   - "Add a phase for migration handling"
+   - "Update the success criteria to include performance tests"
+   - "Adjust the scope to exclude feature X"
+   - "Split Phase 2 into two separate phases"
+   ```
+   Wait for user input.
+
+   **If BOTH plan file AND feedback provided**:
+   - Proceed immediately to Step 1
+   - No preliminary questions needed
+
+## Process Steps
+
+### Step 1: Read and Understand Current Plan
+
+1. **Read the existing plan file COMPLETELY**:
+   - Use the Read tool WITHOUT limit/offset parameters
+   - Understand the current structure, phases, and scope
+   - Note the success criteria and implementation approach
+
+2. **Understand the requested changes**:
+   - Parse what the user wants to add/modify/remove
+   - Identify if changes require codebase research
+   - Determine scope of the update
+
+### Step 2: Research If Needed
+
+**Only spawn research tasks if the changes require new technical understanding.**
+
+If the user's feedback requires understanding new code patterns or validating assumptions:
+
+1. **Spawn parallel agents for research** using the Agent tool:
+   **For code investigation:**
+   - Use the **codebase-locator** agent to find relevant files
+   - Use the **codebase-analyzer** agent to understand implementation details
+   - Use the **codebase-pattern-finder** agent to find similar patterns
+
+   **For historical context:**
+   - Use the **thoughts-locator** agent to find related research or decisions in thoughts/
+   - Use the **thoughts-analyzer** agent to extract insights from documents
+
+   **Be EXTREMELY specific about directories**:
+   - Include full path context in prompts
+
+2. **Read any new files identified by research**:
+   - Read them FULLY into the main context
+   - Cross-reference with the plan requirements
+
+3. **Wait for ALL agents to complete** before proceeding
+
+### Step 3: Present Understanding and Approach
+
+Before making changes, confirm your understanding:
+
+```
+Based on your feedback, I understand you want to:
+- {Change 1 with specific detail}
+- {Change 2 with specific detail}
+
+My research found:
+- {Relevant code pattern or constraint}
+- {Important discovery that affects the change}
+
+I plan to update the plan by:
+1. {Specific modification to make}
+2. {Another modification}
+
+Does this align with your intent?
+```
+
+Use the `ask_user_question` tool to confirm before editing. Question: "{Summary of planned modifications}. Proceed with these edits?". Header: "Changes". Options: "Proceed (Recommended)" (Apply the planned changes to the existing plan); "Adjust approach" (Modify what will be changed before editing); "Show me first" (Show the exact text changes before applying).
+
+### Step 4: Update the Plan
+
+1. **Make focused, precise edits** to the existing plan:
+   - Use the Edit tool for surgical changes
+   - NEVER use Write tool - plan files already exist, use Edit tool only
+   - Maintain the existing structure unless explicitly changing it
+   - Keep all file:line references accurate
+   - Update success criteria if needed
+
+2. **Ensure consistency**:
+   - If adding a new phase, ensure it follows the existing pattern
+   - If modifying scope, update "What We're NOT Doing" section
+   - If changing approach, update "Implementation Approach" section
+   - Maintain the distinction between automated vs manual success criteria
+   - If the plan has YAML frontmatter, run `date +"%Y-%m-%dT%H:%M:%S%z"` once and use the output for `last_updated`; set `last_updated_by` to your name
+
+3. **Preserve quality standards**:
+   - Include specific file paths and line numbers for new content
+   - Write measurable success criteria
+   - Use project's build/test commands (`make`, `npm`, etc.) for automated verification
+   - Keep language clear and actionable
+
+### Step 5: Sync and Review
+
+1. **Present the changes made**:
+   ```
+   Plan updated at `thoughts/shared/plans/{filename}.md`
+
+   Changes made:
+   - {Specific change 1}
+   - {Specific change 2}
+
+   The updated plan now:
+   - {Key improvement}
+   - {Another improvement}
+
+   Let me know if you want further adjustments — otherwise chain forward.
+
+   ---
+
+   💬 Follow-up: describe further plan changes in chat — each `/skill:revise` call appends another timestamped Follow-up section, history is preserved.
+
+   **Next step:** `/skill:implement thoughts/shared/plans/{filename}.md Phase {N}` — resume execution at the affected phase (or omit `Phase {N}` to run all phases sequentially).
+
+   > 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+   ```
+
+## Step 6: Handle Follow-ups
+
+- **Each invocation appends history.** Every `/skill:revise` call adds another timestamped Follow-up section — do not collapse history. Prior phase decisions stay visible.
+- **Bump frontmatter.** Update `last_updated` + `last_updated_by`; set `last_updated_note: "<one-line summary of revision>"`.
+- **Surgical edits only.** Make precise edits to specific phases or success criteria — not wholesale rewrites. Preserve good content that doesn't need changing.
+- **When to re-invoke instead.** For deep architectural changes, the upstream design or research is the right place to revise — re-run those rather than expanding revise's scope. The previous block's `Next step:` stays valid for the existing plan.
+
+## Important Guidelines
+
+1. **Be Skeptical**:
+   - Don't blindly accept change requests that seem problematic
+   - Question vague feedback - ask for clarification
+   - Use AskUserQuestion tool for structured clarification when there are multiple valid approaches
+   - Verify technical feasibility with code research
+   - Point out potential conflicts with existing plan phases
+
+2. **Be Surgical**:
+   - Make precise edits, not wholesale rewrites
+   - Preserve good content that doesn't need changing
+   - Only research what's necessary for the specific changes
+   - Don't over-engineer the updates
+
+3. **Be Thorough**:
+   - Read the entire existing plan before making changes
+   - Research code patterns if changes require new technical understanding
+   - Ensure updated sections maintain quality standards
+   - Verify success criteria are still measurable
+
+4. **Be Interactive**:
+   - Confirm understanding before making changes
+   - Show what you plan to change before doing it
+   - Allow course corrections
+   - Don't disappear into research without communicating
+
+5. **Track Progress**:
+   - Update todos as you complete research
+   - Mark tasks complete when done
+
+6. **No Open Questions**:
+   - If the requested change raises questions, ASK
+   - Research or get clarification immediately
+   - Do NOT update the plan with unresolved questions
+   - Every change must be complete and actionable
+
+## Success Criteria Guidelines
+
+When updating success criteria, always maintain the two-category structure:
+
+1. **Automated Verification** (can be run by execution agents):
+   - Commands that can be run: `make test`, `npm run lint`, etc.
+   - Specific files that should exist
+   - Code compilation/type checking
+
+2. **Manual Verification** (requires human testing):
+   - UI/UX functionality
+   - Performance under real conditions
+   - Edge cases that are hard to automate
+   - User acceptance criteria
+
+## Subagent Invocation Best Practices
+
+When spawning research agents:
+
+1. **Only spawn if truly needed** - don't research for simple changes
+2. **Parallel dispatch** — every `Agent(...)` call in the same assistant message (multiple tool_use blocks in one response), never one per turn. Call shape: `Agent({ subagent_type: "<agent-name>", description: "<3-5 word task label>", prompt: "<task>" })`.
+3. **Each agent should be focused** on a specific area
+4. **Provide detailed instructions** including:
+   - Exactly what to search for
+   - Which directories to focus on
+   - What information to extract
+   - Expected output format
+5. **Request specific file:line references** in responses
+6. **Wait for all agents to complete** before synthesizing
+7. **Verify agent results** - if something seems off, spawn follow-up agents
+
+## Example Interaction Flows
+
+**Scenario 1: User provides everything upfront**
+```
+User: /skill:revise thoughts/shared/plans/2025-10-16_09-00-00_feature.md - add phase for error handling
+Assistant: {Reads plan, researches error handling patterns, updates plan}
+```
+
+**Scenario 2: User provides just plan file**
+```
+User: /skill:revise thoughts/shared/plans/2025-10-16_09-00-00_feature.md
+Assistant: I've found the plan. What changes would you like to make?
+User: Split Phase 2 into two phases - one for backend, one for frontend
+Assistant: {Proceeds with update}
+```
+
+**Scenario 3: User provides no arguments**
+```
+User: /skill:revise
+Assistant: Which plan would you like to update? Please provide the path...
+User: thoughts/shared/plans/2025-10-16_09-00-00_feature.md
+Assistant: I've found the plan. What changes would you like to make?
+User: Add more specific success criteria
+Assistant: {Proceeds with update}
+```
+
+**Scenario 4: User passes a review artifact instead of a plan**
+```
+User: /skill:revise thoughts/shared/reviews/2025-10-16_10-00-00_feature.md
+Assistant: `revise` updates implementation plans, not review artifacts. Please provide the target plan path plus the changes to make.
+User: /skill:revise thoughts/shared/plans/2025-10-16_09-00-00_feature.md "Address the review findings by splitting Phase 2 and adding validation coverage"
+Assistant: {Proceeds with update}
+```
--- a/extensions/rpiv-pi/skills/validate/SKILL.md
+++ b/extensions/rpiv-pi/skills/validate/SKILL.md
@@ -0,0 +1,190 @@
+---
+name: validate
+description: Verify that an implementation plan was correctly executed by running each phase's success criteria against the working tree and producing a validation report. Use after the implement skill completes, when the user asks to "validate the plan", wants a post-implementation audit, or needs to confirm a feature is fully shipped per its plan.
+argument-hint: [plan-path]
+allowed-tools: Read, Bash(git *), Bash(make *), Glob, Grep, Agent
+---
+
+# Validate Plan
+
+You are tasked with validating that an implementation plan was correctly executed, verifying all success criteria and identifying any deviations or issues.
+
+## Initial Setup
+
+When invoked:
+1. **Determine context** - Are you in an existing conversation or starting fresh?
+   - If existing: Review what was implemented in this session
+   - If fresh: Need to discover what was done through git and codebase analysis
+
+2. **Locate the plan**:
+   - If plan path provided, use it
+   - Otherwise, search recent commits for plan references or ask user
+
+3. **Gather implementation evidence**:
+   ```bash
+   # Check recent commits
+   git log --oneline -n 20
+   git diff HEAD~N..HEAD  # Where N covers implementation commits
+
+   # Run comprehensive checks
+   cd $(git rev-parse --show-toplevel) && make check test
+   ```
+
+## Validation Process
+
+### Step 1: Context Discovery
+
+If starting fresh or need more context:
+
+   **If the injected git context shows "not a git repo":**
+   - Skip git-based evidence gathering (git log, git diff)
+   - Validate via file inspection, automated test commands, and plan checklist
+   - Note in report: "Git history unavailable — validation based on file inspection only"
+
+1. **Read the implementation plan** completely
+2. **Identify what should have changed**:
+   - List all files that should be modified
+   - Note all success criteria (automated and manual)
+   - Identify key functionality to verify
+
+3. **Spawn parallel research agents** to verify implementation:
+
+   Spawn the agents below in parallel using the Agent tool. Wait for ALL agents to complete before proceeding.
+   - **general-purpose** agent — Verify implementation details match plan specifications (analyzer role)
+   - **general-purpose** agent — Verify implementation follows established codebase patterns (pattern-finder role)
+
+   Example agent prompts:
+   - "Analyze {component} and verify it implements {plan requirement} correctly"
+   - "Find patterns similar to {new code} and check if conventions are followed"
+
+   Also gather evidence directly:
+   ```bash
+   # Check recent commits
+   git log --oneline -n 20
+   git diff HEAD~N..HEAD  # Where N covers implementation commits
+
+   # Run comprehensive checks
+   cd $(git rev-parse --show-toplevel) && make check test
+   ```
+
+### Step 2: Systematic Validation
+
+For each phase in the plan:
+
+1. **Check completion status**:
+   - Look for checkmarks in the plan (- [x])
+   - Verify the actual code matches claimed completion
+
+2. **Run automated verification**:
+   - Execute each command from "Automated Verification"
+   - Document pass/fail status
+   - If failures, investigate root cause
+
+3. **Assess manual criteria**:
+   - List what needs manual testing
+   - Provide clear steps for user verification
+
+4. **Think deeply about edge cases**:
+   - Were error conditions handled?
+   - Are there missing validations?
+   - Could the implementation break existing functionality?
+
+### Step 3: Generate Validation Report
+
+Create comprehensive validation summary:
+
+```markdown
+## Validation Report: {Plan Name}
+
+### Implementation Status
+✓ Phase 1: {Name} - Fully implemented
+✓ Phase 2: {Name} - Fully implemented
+⚠️ Phase 3: {Name} - Partially implemented (see issues)
+
+### Automated Verification Results
+✓ Build passes: `make build`
+✓ Tests pass: `make test`
+✗ Linting issues: `make lint` (3 warnings)
+
+### Code Review Findings
+
+#### Matches Plan:
+- Database migration correctly adds {table}
+- API endpoints implement specified methods
+- Error handling follows plan
+
+#### Deviations from Plan:
+- Used different variable names in {file:line}
+- Added extra validation in {file:line} (improvement)
+
+#### Potential Issues:
+- Missing index on foreign key could impact performance
+- No rollback handling in migration
+
+### Manual Testing Required:
+1. UI functionality:
+   - [ ] Verify {feature} appears correctly
+   - [ ] Test error states with invalid input
+
+2. Integration:
+   - [ ] Confirm works with existing {component}
+   - [ ] Check performance with large datasets
+
+### Recommendations:
+- Address linting warnings before merge
+- Consider adding integration test for {scenario}
+- Document new API endpoints
+
+---
+
+💬 Follow-up: if findings are localized, fix them and re-run `/skill:validate`. If findings imply plan-level changes, escalate to `/skill:revise <plan-path>` first.
+
+**Next step:** `/skill:commit` — group the validated changes into atomic commits (skip if status is `needs_changes` — fix the gaps first, then re-run `/skill:validate`).
+
+> 🆕 Tip: start a fresh session with `/new` first — chained skills work best with a clean context window.
+```
+
+## Handle Follow-ups
+
+- **Validate does not edit code or plans.** It produces a report. Fixes happen in implement; plan revisions happen in revise.
+- **Localized gaps.** If findings are small and localized, fix them in-place and re-run `/skill:validate` for a fresh report.
+- **Plan-level gaps.** If findings imply the plan itself is wrong (missing phases, wrong approach, untestable success criteria), escalate to `/skill:revise <plan-path>` first, then re-implement, then re-validate.
+- **No append mode.** Each validation run produces a fresh report — there is no `## Follow-up` append. The previous block's `Next step:` stays valid only when status is `complete`.
+
+## Working with Existing Context
+
+If you were part of the implementation:
+- Review the conversation history
+- Check your todo list for what was completed
+- Focus validation on work done in this session
+- Be honest about any shortcuts or incomplete items
+
+## Important Guidelines
+
+1. **Be thorough but practical** - Focus on what matters
+2. **Run all automated checks** - Don't skip verification commands
+3. **Document everything** - Both successes and issues
+4. **Think critically** - Question if the implementation truly solves the problem
+5. **Consider maintenance** - Will this be maintainable long-term?
+
+## Validation Checklist
+
+Always verify:
+- [ ] All phases marked complete are actually done
+- [ ] Automated tests pass
+- [ ] Code follows existing patterns
+- [ ] No regressions introduced
+- [ ] Error handling is robust
+- [ ] Documentation updated if needed
+- [ ] Manual test steps are clear
+
+## Relationship to Other Skills
+
+Recommended workflow:
+1. `/skill:implement` - Execute the implementation
+2. `/skill:commit` - Create atomic commits for changes
+3. `/skill:validate` - Verify implementation correctness
+
+The validation works best after commits are made, as it can analyze the git history to understand what was implemented.
+
+Remember: Good validation catches issues before they reach production. Be constructive but thorough in identifying gaps or improvements.
--- a/extensions/rpiv-pi/skills/write-test-cases/SKILL.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/SKILL.md
@@ -0,0 +1,324 @@
+---
+name: write-test-cases
+description: Generate manual test-case specifications for a single feature by analyzing the implementing code in parallel, producing flow-based test cases plus a regression suite and project-wide coverage map under .rpiv/test-cases/{feature}/. Consumes an outline-test-cases _meta.md when available for warm-start. Use when the user wants test cases written for a specific feature, asks for QA specs, or has run outline-test-cases and is ready to flesh out a feature.
+argument-hint: "[feature name, component path, feature slug, or _meta.md path] [additional instructions]"
+---
+
+# Write Test Cases
+
+You are tasked with generating manual test case specifications for a single feature by analyzing code in parallel and producing flow-based test case documents for QA teams.
+
+## Initial Setup
+
+When this command is invoked, respond with:
+```
+I'll generate test cases for this feature. Let me discover the relevant code and analyze it.
+```
+
+Then proceed to Step 1.
+
+## Steps
+
+### Step 1: Determine Feature Scope
+
+Parse the user's input to determine the feature under test. Handle these input forms:
+
+1. **_meta.md path** (e.g., `.rpiv/test-cases/users/_meta.md`):
+   - Read the file. Extract `feature` from frontmatter. Mark as **has _meta.md**.
+
+2. **Feature folder or slug** (e.g., `.rpiv/test-cases/order-management/` or `order-management`):
+   - Check if `.rpiv/test-cases/{input}/_meta.md` exists
+   - If yes: read it, extract `feature`, mark as **has _meta.md**
+   - If no: treat as feature name
+
+3. **Source code path** (e.g., `src/orders/` or `src/api/controllers/OrdersController.ts`):
+   - Use the path directly as the starting point for analysis
+
+4. **Feature name with optional instructions** (e.g., `Order Management focus on refund edge cases`):
+   - Parse as `{feature identifier} [additional instructions]`
+   - Check if `.rpiv/test-cases/{slugified-name}/_meta.md` exists — if yes, read it and mark as **has _meta.md**
+   - Store additional instructions as supplemental context for agent prompts and checkpoint
+
+5. **No arguments provided**:
+   ```
+   I'll help you generate test cases. Please provide either:
+   1. A feature name: `/skill:write-test-cases Order Management`
+   2. A component path: `/skill:write-test-cases src/orders/`
+   3. A feature slug: `/skill:write-test-cases order-management`
+   4. A _meta.md path: `/skill:write-test-cases .rpiv/test-cases/orders/_meta.md`
+
+   Add instructions after the feature: `/skill:write-test-cases Order Management focus on refund edge cases`
+   ```
+   Then wait for input.
+
+#### Warm-Start from _meta.md
+
+When `_meta.md` is available, read it FULLY and extract:
+- **Identity**: `feature`, `module`, `portal`, `slug` from frontmatter
+- **Routes**: from `## Routes` section — route paths and component names
+- **Endpoints**: from `## Endpoints` section — HTTP methods and paths
+- **Scope decisions**: from `## Scope Decisions` section — in/out of scope items
+- **Domain context**: from `## Domain Context` section — business rules and intentional behaviors
+- **Checkpoint history**: from `## Checkpoint History` section — prior Q&A pairs
+
+Report:
+```
+[Warm-start]: Found _meta.md for "{feature}" ({module}, {portal}). {N} routes, {M} endpoints.
+```
+
+When no _meta.md, detect the project's technology stack before spawning agents: check `package.json` for framework indicators (see Framework Detection Reference at end of document). If no `package.json`, check for `.csproj`/`.sln` (.NET), `pyproject.toml`/`requirements.txt` (Python). Use the detected framework to adapt Agent A's prompt in Step 2.
+
+### Step 2: Discover Feature Code (parallel agents)
+
+Spawn the following agents in parallel using the Agent tool. Wait for ALL agents to complete before proceeding.
+
+**Agent A — Web Layer Discovery:**
+- subagent_type: `codebase-locator`
+- When _meta.md is available: "Validate these known Web Layer entry points for {feature name}: {routes and endpoints from _meta.md}. Check if they still exist and find any NEW entry points not in this list. Report: confirmed (still exists), removed (no longer found), new (not in the list)."
+- When no _meta.md: "Find all Web Layer entry points for the {feature name} feature{framework_hint}. Look for: controllers, route definitions, page components, form handlers, API endpoints. Search across all web layers (API, Admin, Customer Portal, Host, etc.). Also find frontend service files, HTTP clients, or API call sites that reference these endpoints — report which frontend pages call which backend URLs. For each entry point found, report: file path, HTTP method/route or page path, and a one-line description of what it does. Group by web layer."
+
+{framework_hint} is " in this {Framework} project" when a framework is detected (e.g., " in this Angular project"), or empty string if none detected. See Framework Detection Reference at end of document.
+
+**Agent B — Existing Test Cases:**
+- subagent_type: `test-case-locator`
+- Prompt: "Search for existing test cases related to {feature name} in .rpiv/test-cases/. Report any existing TCs with their IDs, titles, and priorities so we can avoid duplicates."
+
+Wait for both agents to complete before proceeding.
+
+### Step 3: Analyze Feature Code (parallel agents)
+
+Using the entry points discovered in Step 2 (validated against _meta.md when available), spawn analysis agents in parallel. When _meta.md is available, enrich prompts: append scope exclusions from `## Scope Decisions` as {scope_context}, domain rules from `## Domain Context` as {domain_context}, and endpoint list as {endpoint_scope}. When no _meta.md, omit these.
+
+**Agent C — Code Analysis:**
+- subagent_type: `codebase-analyzer`
+- Prompt: "Analyze the {feature name} feature implementation in detail. Read the controllers/route handlers at {discovered paths}. For each endpoint/action, determine: 1) What user input is accepted (request body, query params, form fields)? 2) What validation rules exist — report specific limits (max lengths, regex patterns, required vs optional)? 3) What business logic is executed? 4) What are the success/error responses? 5) What authorization/permissions are required? Focus on understanding USER FLOWS — sequences of actions a user would perform to accomplish a goal. ALSO read the frontend page components and templates at {discovered frontend paths}. Extract what a QA tester would actually see: exact button labels, form field labels/placeholders, navigation items, table column headers, success/error messages, and conditional UI (role- or state-dependent elements). Resolve any i18n translation keys to displayed text. Report UI elements per page/route alongside the backend analysis.{scope_context}{domain_context}"
+
+**Agent D — Postcondition Discovery:**
+- subagent_type: `integration-scanner`
+- Prompt: "Find all side effects triggered by {feature name} actions{endpoint_scope}. Look for: domain events published, message handlers invoked, email/notification triggers, external API calls, database cascades, cache invalidations, audit log entries, webhook dispatches. For each side effect, report: what triggers it (which action/endpoint) and where the handler code lives (file:line). Do NOT describe what the handler does — only locate it. These locations become postconditions in test cases.{scope_context}"
+
+Wait for ALL agents to complete before proceeding.
+
+### Step 4: Synthesize Findings
+
+Compile all agent results into a feature analysis:
+
+1. **Map user flows** — Group the discovered endpoints/pages into logical user journeys:
+   - Identify the natural sequence of actions (e.g., browse -> select -> configure -> checkout -> confirm)
+   - Each flow should represent a complete user goal, not isolated actions
+   - A feature typically produces 3-8 flows depending on complexity
+   - **When to separate**: If view and edit serve different user goals, keep them as separate flows. If a sub-operation (e.g., replace, export, bulk action) has its own trigger and confirmation, it deserves its own flow. If different user roles interact with the same entity differently, split by role.
+   - **Use real UI element names** from Agent C's frontend analysis — actual button labels, form field names, navigation text, displayed messages. Do not infer UI element names from backend action semantics.
+
+2. **Enrich with postconditions** — For each flow, attach the side effects discovered by the integration-scanner:
+   - Map domain events to specific flow steps
+   - Include cross-system effects (emails, webhooks, inventory changes)
+
+3. **Check for duplicates** — Cross-reference synthesized flows against existing TCs from test-case-locator:
+   - If an existing TC covers a flow, note it and skip that flow
+   - If partial overlap, note the gap to fill
+
+4. **Assign priorities**:
+   - **high**: Core happy path, payment/money flows, data integrity, security-critical
+   - **medium**: Alternative paths, common edge cases, permission boundaries
+   - **low**: Rare edge cases, cosmetic validation, error message wording
+
+5. **Determine test case IDs**:
+   - Module abbreviation: from _meta.md `module` field, or derive from feature name (e.g., Order Management -> ORD)
+   - Numbering: start at 001, or continue from highest existing TC ID if duplicates found
+   - Format: `TC-{MODULE}-{NNN}`
+
+**Do NOT write test cases yet** — proceed to the developer checkpoint first.
+
+### Step 5: Developer Checkpoint
+
+Present a flow summary, then ask grounded questions one at a time.
+
+**Flow summary** (under 20 lines):
+```
+## Feature: {Feature Name}
+
+Entry points: {N} endpoints across {M} web layers
+Postconditions: {K} side effects discovered
+Existing TCs: {X} found (will skip duplicates)
+
+### Proposed Test Cases:
+1. TC-{MOD}-001: {Flow title} (priority: high)
+   Steps: {brief flow summary — e.g., "browse -> add to cart -> checkout -> payment -> confirm"}
+2. TC-{MOD}-002: {Flow title} (priority: medium)
+   Steps: {brief flow summary}
+{etc.}
+
+Flows skipped (already covered): {list or "none"}
+```
+
+When _meta.md is available, prepend:
+```
+### Prior Scope Decisions (from outline):
+- {decision 1}
+- {decision 2}
+These are carried forward. I'll only ask about new findings.
+```
+
+Then ask grounded questions — **one at a time**. Use a **❓ Question:** prefix so the developer knows their input is needed. Each question must reference real findings with file:line evidence and pull NEW information from the developer. Focus on:
+
+- Missing flows the code analysis couldn't detect (e.g., "I found create/update/delete flows but no bulk import — is that a feature?")
+- Postconditions the integration-scanner might have missed (e.g., "No webhook found for order status changes — is there an external notification I'm not seeing?")
+- Priority overrides (e.g., "I marked refund flow as medium — should it be high given payment implications?")
+- User roles and permissions that affect test preconditions
+- Test data requirements not obvious from code
+
+When _meta.md is available: skip questions already answered in `## Checkpoint History`. Only ask about new findings not covered by prior decisions.
+
+**CRITICAL**: Ask ONE question at a time. Wait for the answer before asking the next. Lead with your most significant finding.
+
+**Choosing question format:**
+
+- **`ask_user_question` tool** — when your question has 2-4 concrete options from code analysis (pattern conflicts, integration choices, scope boundaries, priority overrides). The user can always pick "Other" for free-text. Example: Use the `ask_user_question` tool with the question "Found 2 mapping approaches — which should new code follow?". Options: "Manual mapping (Recommended)" (Used in OrderService (src/services/OrderService.ts:45) — 8 occurrences); "AutoMapper" (Used in UserService (src/services/UserService.ts:12) — 2 occurrences).
+
+- **Free-text with ❓ Question: prefix** — when the question is open-ended and options can't be predicted (discovery, "what am I missing?", corrections). Example:
+  "❓ Question: Integration scanner found no background job registration for this area. Is that expected, or is there async processing I'm not seeing?"
+
+**Batching**: When you have 2-4 independent questions (answers don't depend on each other), you MAY batch them in a single `ask_user_question` call. Keep dependent questions sequential.
+
+**Classify each response:**
+
+**Corrections** (e.g., "that flow doesn't exist", "wrong priority"):
+- Update flow list. Record in notes.
+
+**Missing flows** (e.g., "you missed the bulk export feature"):
+- Spawn targeted **codebase-analyzer** (max 1 agent) to analyze the missing area.
+- Add the flow to the list.
+
+**Scope adjustments** (e.g., "skip admin flows, focus on customer portal"):
+- Remove out-of-scope flows. Record the adjustment.
+
+**Confirmations** (e.g., "looks good", "yes proceed"):
+- Proceed to Step 6.
+
+### Step 6: Generate Test Case Documents
+
+Read the templates before writing:
+- Read the full test case template at `templates/test-case.md`
+- Read the full regression suite template at `templates/regression-suite.md`
+
+See `examples/order-placement-flow.md` (e-commerce order flow), `examples/customer-auth-flow.md` (authentication flow), and `examples/team-management-flow.md` (SaaS team management flow) for well-formed test case examples.
+
+What makes these examples good:
+- **Steps are user-centric** — "Navigate to...", "Click...", "Enter..." — not technical ("POST to /api/orders")
+- **Expected results are observable** — what the user SEES, not internal state changes
+- **Postconditions verify side effects** — email sent, inventory updated, audit logged
+- **Edge cases are separate bullets** — not crammed into steps
+- **Preconditions are specific** — exact user role, required test data, system state
+
+See `examples/order-management-suite.md` and `examples/team-management-suite.md` for well-formed regression suite examples.
+
+What makes these examples good:
+- **Smoke subset is minimal** — 2-4 high-priority TCs covering critical paths
+- **Priority ordering** — high -> medium -> low within the full regression table
+- **Coverage map** cross-references TCs against feature sub-areas
+- **Gaps section** flags known uncovered areas for future work
+
+**For each confirmed flow**, generate a test case document:
+- Follow the test-case.md template exactly
+- Write user-facing actions in Steps (what they click/type/navigate), not API calls
+- Use actual UI element names discovered by Agent C (button labels, form fields, navigation items, messages) — do NOT fabricate element names from backend semantics. If Agent C didn't find a specific label, describe the element generically (e.g., "submit button" not "Click 'Save Changes'")
+- Expected results describe what the user observes (success message, redirect, updated list)
+- Postconditions describe system-level side effects (from integration-scanner findings)
+- Edge cases list variant scenarios worth separate testing
+- Include preconditions: user role, required test data, system state
+- Include `commit` in frontmatter with current git commit hash
+
+**After all TCs**, generate the regression suite document:
+- Follow the regression-suite.md template
+- List all TCs with priority ordering (high -> medium -> low)
+- Mark smoke test subset (TCs that cover critical paths in minimal time)
+- Include coverage map cross-referencing TCs to feature sub-areas
+- Calculate total estimated execution time
+- Include `commit` in overview with current commit hash
+
+### Step 7: Write Files & Update Artifacts
+
+1. **Determine output directory**:
+   - Target: `.rpiv/test-cases/{feature-slug}/` in the current working directory
+   - Feature slug: from _meta.md (when available) or kebab-case from feature name
+   - Create the directory if it doesn't exist
+
+2. **Write all files at once** using the Write tool:
+   - Individual TC files: `TC-{MOD}-{NNN}_{flow-slug}.md`
+   - Regression suite: `_regression-suite.md` (underscore prefix sorts it first)
+   - Do NOT ask for confirmation before each file — batch mode
+
+3. **Update _meta.md** (when it exists):
+   - Set `tc_count` to the number of TCs written
+   - Set `status` to `generated`
+   - Update `date` to current date
+   - Append new checkpoint Q&A pairs to `## Checkpoint History` under a new date header — only if new Q&A occurred during Step 5
+
+4. **Rebuild root coverage map** at `.rpiv/test-cases/_coverage-map.md`:
+   - Read the coverage map template at `templates/coverage-map.md`
+   - Glob for all `_regression-suite.md` files across `.rpiv/test-cases/*/`
+   - Glob for all `_meta.md` files across `.rpiv/test-cases/*/`
+   - Read each file's key data (frontmatter, summary stats, coverage map, smoke subset)
+   - Aggregate into the coverage map template
+   - Write the file (if only one feature exists, the map shows just that feature — it grows over time)
+
+5. **Present summary**:
+   ```
+   ## Test Cases Written
+
+   | File | Priority | Flow |
+   |------|----------|------|
+   | TC-ORD-001_place-order.md | high | Place and confirm order |
+   | TC-ORD-002_cancel-order.md | medium | Cancel order before fulfillment |
+   | _regression-suite.md | — | Feature summary (N TCs, ~Xm execution) |
+   | _coverage-map.md | — | Project-wide coverage (N features, M TCs) |
+
+   Output: `.rpiv/test-cases/{feature-slug}/`
+   Total: {N} test cases + 1 regression suite + 1 coverage map
+
+   Review the generated test cases and let me know if you'd like adjustments.
+   ```
+
+### Step 8: Handle Follow-ups
+
+- **Append, never rewrite.** Edit specific TC files directly; preserve TC IDs (continue numbering from the highest existing ID when adding).
+- **Re-dispatch narrowly.** Spawn one targeted `codebase-analyzer` for missing flows. Do NOT re-run the full skill.
+- **Regenerate suites on any TC change.** Always regenerate `_regression-suite.md` and `_coverage-map.md` to keep them in sync.
+- **When to re-invoke instead.** Re-run `/skill:write-test-cases <feature>` for a different feature; for the same feature, prefer in-place edits. The previous block's `Next step:` stays valid.
+
+Skill-specific verbs:
+- **Add missing flows**: spawn targeted `codebase-analyzer`, generate new TCs, regenerate suites.
+- **Adjust priorities**: edit TC frontmatter, regenerate suites.
+- **Modify steps**: edit specific TC files directly.
+- **Delete TCs**: remove the file, regenerate suites.
+
+## Framework Detection Reference
+
+| Indicator | Framework | Detection |
+|-----------|-----------|-----------|
+| `@angular/core` | Angular | `package.json` dependencies |
+| `react-router-dom` / `react-router` / `@react-router` | React | `package.json` dependencies |
+| `next` | Next.js | `package.json` dependencies |
+| `vue-router` | Vue Router | `package.json` dependencies |
+| `nuxt` | Nuxt | `package.json` dependencies |
+| `.csproj` / `.sln` | .NET | File presence in project root |
+| `pyproject.toml` / `requirements.txt` with Django/Flask/FastAPI | Python | File presence + dependency check |
+| None found | Backend-only | Fallback — omit framework hint |
+
+## Important Notes
+
+- **Manual test cases for QA teams** — NOT automated test code. Write in natural language from the user's perspective.
+- **Flow-level granularity** — each TC covers a complete user journey, not a single endpoint.
+- **Postconditions are critical** — side effects from domain events are what distinguish a thorough TC from a superficial one.
+- **Never skip the developer checkpoint** — QA domain knowledge (which flows matter most, what edge cases exist in production) is the highest-value signal.
+- **_meta.md is warm start, not truth** — always validate against live code via Agent A, even with _meta.md available.
+- **File reading**: Always read templates FULLY (no limit/offset) before generating test cases.
+- **Critical ordering**: Follow the numbered steps exactly.
+  - ALWAYS wait for discovery agents (Step 2) before spawning analysis agents (Step 3)
+  - ALWAYS wait for ALL agents to complete before synthesizing (Step 4)
+  - ALWAYS resolve all checkpoint questions (Step 5) before generating TCs (Step 6)
+  - ALWAYS regenerate regression suite and coverage map after any TC writes (Step 7)
+  - NEVER write test case files with placeholder values
+- **Duplicate avoidance**: Always check existing TCs via test-case-locator before generating new ones.
+- **ID continuity**: If existing TCs exist for this module, continue numbering from the highest existing ID.
--- a/extensions/rpiv-pi/skills/write-test-cases/examples/customer-auth-flow.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/examples/customer-auth-flow.md
@@ -0,0 +1,50 @@
+---
+id: TC-AUTH-001
+title: "Customer magic-link login"
+feature: "Customer Authentication"
+priority: high
+type: functional
+status: draft
+tags: ["auth", "login", "magic-link", "customer-portal", "happy-path"]
+commit: abc1234
+---
+
+# Customer magic-link login
+
+## Objective
+Verify that a customer can request a magic-link login email, click the link, and be authenticated into the Customer Portal with the correct session and permissions.
+
+## Preconditions
+- Customer account exists with email "test@example.com"
+- Email delivery service is configured in test mode
+- Customer is NOT currently logged in
+- No active sessions exist for this customer
+
+## Steps
+| # | Action | Expected Result |
+|---|--------|-----------------|
+| 1 | Navigate to Customer Portal login page | Login form displays with email field and "Send Magic Link" button |
+| 2 | Enter "test@example.com" in email field | Email field validates format, no error shown |
+| 3 | Click "Send Magic Link" | Success message: "Check your email for a login link". Button disabled for 60s |
+| 4 | Open email inbox and find magic-link email | Email received within 2 minutes with one-time login URL |
+| 5 | Click the magic-link URL in the email | Browser opens, brief loading state, redirects to Customer Portal dashboard |
+| 6 | Verify dashboard displays correctly | Customer name in header, recent orders listed, subscription status visible |
+| 7 | Refresh the page | Session persists — dashboard still shows, not redirected to login |
+
+## Postconditions
+- Session token created and stored (verify via browser cookies/localStorage)
+- Login event recorded in audit log with timestamp, IP address, and auth method "magic-link"
+- Magic link marked as used — clicking same link again shows "Link expired" page
+- Last login timestamp updated on customer record
+
+## Edge Cases
+- Expired magic link (>15 minutes old) — verify "Link expired, request a new one" message
+- Already-used magic link — verify "Link already used" message
+- Non-existent email address — verify same success message shown (no email enumeration)
+- Multiple magic links requested — verify only the most recent link works
+- Magic link opened in different browser/device — verify it still works
+
+## Notes
+- Related TCs: TC-AUTH-002 (logout), TC-AUTH-003 (session expiry)
+- Dependencies: Email delivery service in test mode, ability to inspect test emails
+- Known issues: Magic link emails may be delayed up to 2 minutes in test environments
--- a/extensions/rpiv-pi/skills/write-test-cases/examples/order-management-suite.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/examples/order-management-suite.md
@@ -0,0 +1,57 @@
+# Order Management — Regression Suite
+
+## Overview
+- Feature: Order Management
+- Module: ORD
+- Total test cases: 6
+- Estimated execution: ~35 minutes
+- Last generated: 2026-03-31
+- Commit: abc1234
+
+## Smoke Test Subset
+| Priority | TC ID | Title | Est. Time |
+|----------|-------|-------|-----------|
+| high | TC-ORD-001 | Place order with physical products | ~5m |
+| high | TC-ORD-004 | Process full refund | ~5m |
+
+**Smoke total: ~10 minutes**
+
+## Full Regression
+
+### High Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-ORD-001 | Place order with physical products | functional | ~5m |
+| TC-ORD-003 | Fulfill order and trigger shipping | functional | ~8m |
+| TC-ORD-004 | Process full refund | functional | ~5m |
+
+### Medium Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-ORD-002 | Cancel order before fulfillment | functional | ~5m |
+| TC-ORD-005 | Admin edits order line items | functional | ~5m |
+
+### Low Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-ORD-006 | Filter and search order list | regression | ~3m |
+
+**Full regression total: ~31 minutes**
+
+## Coverage Map
+| Area | TCs Covering |
+|------|-------------|
+| Order Creation | TC-ORD-001 |
+| Order Cancellation | TC-ORD-002 |
+| Fulfillment | TC-ORD-003 |
+| Refunds | TC-ORD-004 |
+| Order Editing | TC-ORD-005 |
+| Order Listing/Search | TC-ORD-006 |
+| Payment Processing | TC-ORD-001, TC-ORD-004 |
+| Email Notifications | TC-ORD-001, TC-ORD-003, TC-ORD-004 |
+| Inventory Updates | TC-ORD-001, TC-ORD-003, TC-ORD-004 |
+
+## Gaps
+- Bulk order import — no TC generated, feature not yet implemented
+- Partial refund flow — deferred, pending UX design for line-item selection
+- Order export to CSV — low priority, cosmetic feature
--- a/extensions/rpiv-pi/skills/write-test-cases/examples/order-placement-flow.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/examples/order-placement-flow.md
@@ -0,0 +1,54 @@
+---
+id: TC-ORD-001
+title: "Place order with physical products"
+feature: "Order Management"
+priority: high
+type: functional
+status: draft
+tags: ["orders", "checkout", "payment", "happy-path"]
+commit: abc1234
+---
+
+# Place order with physical products
+
+## Objective
+Verify that a customer can browse products, add them to cart, complete checkout with a valid credit card, and receive an order confirmation. This is the primary revenue-generating flow.
+
+## Preconditions
+- Customer account exists with verified email
+- At least 2 physical products are published with available inventory
+- Stripe test mode is configured with valid API keys
+- Customer is logged into the Customer Portal
+
+## Steps
+| # | Action | Expected Result |
+|---|--------|-----------------|
+| 1 | Navigate to product catalog page | Product listing displays with prices and availability |
+| 2 | Click "Add to Cart" on first product | Cart badge updates to show 1 item, toast confirms addition |
+| 3 | Click "Add to Cart" on second product | Cart badge updates to 2 items |
+| 4 | Click cart icon in navigation header | Cart drawer slides open showing both products with quantities and subtotal |
+| 5 | Click "Proceed to Checkout" | Checkout page loads with shipping address form |
+| 6 | Enter valid shipping address and select shipping method | Shipping cost calculates and order total updates |
+| 7 | Enter valid test credit card (4242 4242 4242 4242) | Card field shows validated state with card brand icon |
+| 8 | Click "Place Order" | Loading spinner appears, then redirects to order confirmation page |
+| 9 | Verify order confirmation page | Order number displayed, line items match cart, total matches checkout |
+
+## Postconditions
+- Order record created in database with status "pending_fulfillment"
+- Order confirmation email sent to customer's email address
+- Inventory quantity decremented for both purchased products
+- Payment charge captured in Stripe (verify in Stripe dashboard)
+- Webhook dispatched to fulfillment service with order details
+- Audit log entry created with action "order.created" and customer ID
+
+## Edge Cases
+- Order with quantity > 1 of same product — verify inventory deducts correct amount
+- Order with product at maximum inventory — verify "last item" handling
+- Payment gateway timeout — verify order is not created, customer sees retry option
+- Browser back button during payment processing — verify no duplicate charges
+- Coupon code applied at checkout — verify discount reflected in total and payment
+
+## Notes
+- Related TCs: TC-ORD-002 (cancel order), TC-ORD-003 (refund order)
+- Dependencies: Stripe test environment, fulfillment webhook endpoint
+- Known issues: Intermittent Stripe webhook delay (up to 30s) may affect postcondition verification
--- a/extensions/rpiv-pi/skills/write-test-cases/examples/team-management-flow.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/examples/team-management-flow.md
@@ -0,0 +1,56 @@
+---
+id: TC-TEAM-001
+title: "Invite and onboard new team member"
+feature: "Team Management"
+priority: high
+type: functional
+status: draft
+tags: ["team", "invitation", "onboarding", "roles", "happy-path"]
+commit: abc1234
+---
+
+# Invite and onboard new team member
+
+## Objective
+Verify that a workspace admin can invite a new team member by email, the invitee receives an invitation, and upon accepting they gain access to the workspace with the assigned role and permissions.
+
+## Preconditions
+- Workspace exists with at least 1 admin user
+- Admin user is logged into the workspace Settings area
+- Invitation email service is configured in test mode
+- Target email address ("newmember@example.com") is not already a workspace member
+- Workspace is not at member limit
+
+## Steps
+| # | Action | Expected Result |
+|---|--------|-----------------|
+| 1 | Navigate to Settings > Team Members page | Team members list displays with current members and their roles |
+| 2 | Click "Invite Member" button | Invitation form appears with email field and role dropdown |
+| 3 | Enter "newmember@example.com" in email field | Email field validates format, no error shown |
+| 4 | Select "Editor" from role dropdown | Role selection highlights "Editor" with permission summary tooltip |
+| 5 | Click "Send Invitation" | Success toast: "Invitation sent to newmember@example.com". Member appears in list with status "Invited" |
+| 6 | Open invitee's email inbox | Invitation email received with workspace name and "Accept Invitation" button |
+| 7 | Click "Accept Invitation" link in email | Browser opens account creation page (or login page if account exists) |
+| 8 | Complete account creation with name and password | Account created, redirects to workspace dashboard |
+| 9 | Verify workspace dashboard access | Dashboard loads with workspace content visible, "Editor" badge in profile menu |
+| 10 | Return to admin's Team Members page | New member shows status "Active" with role "Editor" |
+
+## Postconditions
+- Invitation record created with status "accepted" and acceptance timestamp
+- New user account linked to workspace with "Editor" role
+- Invitation email marked as used — re-clicking link shows "Already accepted" message
+- Audit log entry created with action "team.member_invited" (admin) and "team.invitation_accepted" (invitee)
+- Workspace member count incremented by 1
+- Welcome notification sent to new member (in-app)
+
+## Edge Cases
+- Invite email already associated with an existing account — verify login flow instead of signup
+- Invite with "Admin" role — verify admin permissions granted after acceptance
+- Re-invite after previous invitation expired — verify new invitation supersedes old
+- Invite when workspace is at member limit — verify error message shown before sending
+- Invited user closes browser mid-signup and returns via link later — verify flow resumes
+
+## Notes
+- Related TCs: TC-TEAM-002 (change member role), TC-TEAM-003 (deactivate member)
+- Dependencies: Email delivery service in test mode, invitation token service
+- Known issues: Invitation emails may take up to 1 minute in test environments
--- a/extensions/rpiv-pi/skills/write-test-cases/examples/team-management-suite.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/examples/team-management-suite.md
@@ -0,0 +1,54 @@
+# Team Management — Regression Suite
+
+## Overview
+- Feature: Team Management
+- Module: TEAM
+- Total test cases: 5
+- Estimated execution: ~28 minutes
+- Last generated: 2026-04-01
+- Commit: abc1234
+
+## Smoke Test Subset
+| Priority | TC ID | Title | Est. Time |
+|----------|-------|-------|-----------|
+| high | TC-TEAM-001 | Invite and onboard new team member | ~5m |
+| high | TC-TEAM-003 | Deactivate team member | ~5m |
+
+**Smoke total: ~10 minutes**
+
+## Full Regression
+
+### High Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-TEAM-001 | Invite and onboard new team member | functional | ~5m |
+| TC-TEAM-003 | Deactivate team member | functional | ~5m |
+
+### Medium Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-TEAM-002 | Change member role | functional | ~5m |
+| TC-TEAM-004 | Manage team member permissions | functional | ~5m |
+
+### Low Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-TEAM-005 | Filter and search team member list | regression | ~3m |
+
+**Full regression total: ~23 minutes**
+
+## Coverage Map
+| Area | TCs Covering |
+|------|-------------|
+| Invitation Flow | TC-TEAM-001 |
+| Role Management | TC-TEAM-002 |
+| Member Deactivation | TC-TEAM-003 |
+| Permission Configuration | TC-TEAM-002, TC-TEAM-004 |
+| Member Listing/Search | TC-TEAM-005 |
+| Audit Logging | TC-TEAM-001, TC-TEAM-003 |
+| Email Notifications | TC-TEAM-001, TC-TEAM-003 |
+
+## Gaps
+- Bulk member import via CSV — feature exists but UI is in beta, deferred
+- SSO/SAML integration — separate authentication feature, not team management
+- Member activity reporting — read-only dashboard, low testing value
--- a/extensions/rpiv-pi/skills/write-test-cases/templates/coverage-map.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/templates/coverage-map.md
@@ -0,0 +1,64 @@
+```markdown
+# {Project Name} — Test Case Coverage Map
+
+## Overview
+- Project: {project name}
+- Total features: {N} covered
+- Total test cases: {N}
+- Estimated full regression: ~{X} minutes
+- Last updated: {YYYY-MM-DD}
+- Commit: {commit-hash}
+
+## Portal Summary
+
+### {Portal Name} ({N} features, {M} TCs, ~{X}m)
+| Feature | Module | TCs | High | Med | Low | Smoke | Est. Time |
+|---------|--------|-----|------|-----|-----|-------|-----------|
+| {Feature Name} | {MOD} | {N} | {h} | {m} | {l} | {smoke count} | ~{X}m |
+
+## Project-Wide Smoke Suite
+{Minimum TCs across ALL features for quick project-level sanity check}
+
+| Portal | TC ID | Feature | Title | Est. Time |
+|--------|-------|---------|-------|-----------|
+| {portal} | TC-{MOD}-{NNN} | {feature} | {title} | ~{N}m |
+
+**Project smoke total: ~{X} minutes**
+
+## Cross-Feature Coverage
+{Areas that span multiple features — verify these when cross-cutting changes are made}
+
+| Cross-Cutting Area | Features Involved | TCs Covering |
+|-------------------|-------------------|-------------|
+| {e.g., Payment Processing} | {Order Mgmt, Invoice Mgmt} | TC-ORD-001, TC-INV-003 |
+
+## Priority Distribution
+| Priority | Count | Percentage |
+|----------|-------|-----------|
+| High | {N} | {X}% |
+| Medium | {N} | {X}% |
+| Low | {N} | {X}% |
+
+## Uncovered Areas
+{Features or sub-areas without test cases — flagged for future work}
+- {uncovered area} — {reason: not yet generated / out of scope / deferred}
+
+## Phantom Features (Backend-Only)
+{Backend endpoints with no frontend exposure — skipped during generation. Populated from _meta.md data when available.}
+- {controller/endpoint group} — {reason: platform API / webhook / deprecated / sub-service}
+
+## Test Data Requirements
+{Aggregate test data needs across all features. Populated from _meta.md Test Data Requirements sections when available.}
+- {e.g., "Stripe test mode with valid API keys (Order Mgmt, Invoice Mgmt)"}
+- {e.g., "At least 2 published products with inventory (Order Mgmt, Product Mgmt)"}
+```
+
+**Portal Summary** groups features by application/portal for QA team assignment. Each portal section includes aggregate stats. Portal names come from `_meta.md` `portal` field when available, or default to "General" when features were generated in standalone mode.
+
+**Project-Wide Smoke Suite** selects the highest-priority TCs from each feature's smoke subset — typically 1-2 per feature. A QA tester should be able to run the project smoke suite in under 30 minutes.
+
+**Cross-Feature Coverage** identifies shared concerns (payment, email, auth, inventory) and which TCs from different features exercise them. Useful when a cross-cutting change is made — QA knows exactly which TCs to re-run. Built by scanning postconditions across all regression suites for shared keywords.
+
+**Phantom Features** documents what was NOT covered and why. Populated from `_meta.md` data (pipeline mode). In standalone mode, populated from phantom detection results. If no phantom data is available, omit this section.
+
+**Test Data Requirements** consolidates prerequisites across all features so QA can set up a test environment once. Populated from `_meta.md` `## Test Data Requirements` sections. If no _meta.md data is available, omit this section.
--- a/extensions/rpiv-pi/skills/write-test-cases/templates/regression-suite.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/templates/regression-suite.md
@@ -0,0 +1,63 @@
+```markdown
+# {Feature Name} — Regression Suite
+
+## Overview
+- Feature: {feature name}
+- Module: {module abbreviation}
+- Total test cases: {N}
+- Estimated execution: ~{X} minutes
+- Last updated: {YYYY-MM-DD}
+- Commit: {commit-hash}
+
+## Smoke Test Subset
+{Minimum set of TCs that cover critical paths — run these for quick sanity checks}
+
+| Priority | TC ID | Title | Est. Time |
+|----------|-------|-------|-----------|
+| high | TC-{MOD}-{NNN} | {title} | ~{N}m |
+
+**Smoke total: ~{X} minutes**
+
+## Full Regression
+
+### High Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-{MOD}-{NNN} | {title} | {type} | ~{N}m |
+
+### Medium Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-{MOD}-{NNN} | {title} | {type} | ~{N}m |
+
+### Low Priority
+| TC ID | Title | Type | Est. Time |
+|-------|-------|------|-----------|
+| TC-{MOD}-{NNN} | {title} | {type} | ~{N}m |
+
+**Full regression total: ~{X} minutes**
+
+## Coverage Map
+{Which areas of the feature each TC exercises}
+
+| Area | TCs Covering |
+|------|-------------|
+| {sub-area} | TC-{MOD}-001, TC-{MOD}-003 |
+
+## Gaps
+{Areas of the feature NOT covered by any test case — flagged for future work}
+- {uncovered area — reason}
+```
+
+**Smoke test subset** picks TCs that cover the highest-risk paths in minimum time. Typically 2-4 TCs per feature. A QA tester should be able to run the smoke suite in under 15 minutes.
+
+**Execution time estimates** based on step count:
+- Simple flow (3-5 steps): ~3 minutes
+- Medium flow (6-10 steps): ~5 minutes
+- Complex flow (11+ steps): ~8-10 minutes
+
+**Coverage map** cross-references TCs against feature sub-areas. Helps QA identify which TCs to re-run when a specific area changes. Sub-areas are derived from Web Layer entry points discovered during analysis.
+
+**Gaps section** flags areas the skill identified but chose not to generate TCs for — either explicitly excluded during checkpoint or insufficient code detail for generation.
+
+**Commit** tracks which code version was analyzed. Compare against current HEAD to detect when regression suite may be stale.
--- a/extensions/rpiv-pi/skills/write-test-cases/templates/test-case.md
+++ b/extensions/rpiv-pi/skills/write-test-cases/templates/test-case.md
@@ -0,0 +1,65 @@
+```markdown
+---
+id: TC-{MODULE}-{NNN}
+title: "{flow description}"
+feature: "{feature name}"
+priority: high|medium|low
+type: functional|regression|smoke|e2e|edge-case
+status: draft
+tags: ["{tag1}", "{tag2}"]
+commit: {commit-hash}
+---
+
+# {Title}
+
+## Objective
+{What this test verifies — 1-2 sentences describing the user goal and what the test proves}
+
+## Preconditions
+- {User role and permissions required}
+- {System state required before starting — e.g., "at least one product exists in catalog"}
+- {Test data requirements — e.g., "valid credit card in Stripe test mode"}
+- {Navigation starting point — e.g., "user is logged into Admin portal"}
+
+## Steps
+| # | Action | Expected Result |
+|---|--------|-----------------|
+| 1 | {user action — Navigate to, Click, Enter, Select, Submit} | {observable outcome — page loads, form appears, message displays} |
+| 2 | {next user action} | {what user sees or what changes} |
+| 3 | {next user action} | {confirmation, redirect, updated state} |
+
+## Postconditions
+{Side effects to verify AFTER the flow completes — sourced from domain events and integration points}
+- {e.g., "Order confirmation email sent to customer email address"}
+- {e.g., "Inventory quantity decremented for purchased items"}
+- {e.g., "Audit log entry created with action 'order.created'"}
+
+## Edge Cases
+{Variant scenarios worth separate attention — each could become its own TC if important enough}
+- {e.g., "Order with mixed digital and physical products"}
+- {e.g., "Payment fails after order created — verify rollback"}
+
+## Notes
+- Related TCs: {cross-references to other TCs in this module}
+- Dependencies: {external system dependencies — payment gateway, email service}
+- Known issues: {documented bugs or limitations affecting this flow}
+```
+
+**Frontmatter fields** align with what `test-case-locator` greps for (`id`, `title`, `priority`, `status`, `type`, `tags`). Always populate all fields — the locator agent extracts them for coverage reporting. The `commit` field tracks which code version was analyzed to produce this TC — used for staleness detection on regeneration.
+
+**Steps table rules:**
+- Actions use imperative verbs from the user's perspective: Navigate, Click, Enter, Select, Submit, Drag, Upload, Scroll
+- Expected results describe what the user OBSERVES — visible UI changes, messages, redirects — not internal state
+- Keep each row to one atomic action. Multi-step actions (fill form -> submit) split into separate rows
+- Number steps sequentially — branching flows (if X then Y) become separate TCs
+
+**Postconditions sourced from:**
+- Domain events (e.g., `OrderCreatedEvent` -> "confirmation email sent")
+- Message handlers (e.g., `InventoryReservationHandler` -> "inventory reserved")
+- Webhook dispatches (e.g., `FulfillmentWebhookPublisher` -> "fulfillment notified")
+- Audit log entries, cache invalidations, CRM syncs
+
+**Priority definitions:**
+- **high**: Core happy path, payment/money flows, data integrity, security-critical
+- **medium**: Alternative paths, common edge cases, permission boundaries
+- **low**: Rare edge cases, cosmetic validation, error message wording