* feat: auto-remap codebase after significant phase execution (#2003) Adds a post-phase structural drift detector that compares the committed tree against `.planning/codebase/STRUCTURE.md` and either warns or auto-remaps the affected subtrees when drift exceeds a configurable threshold. ## Summary - New `bin/lib/drift.cjs` — pure detector covering four drift categories: new directories outside mapped paths, new barrel exports at `(packages|apps)/*/src/index.*`, new migration files, and new route modules. Prioritizes the most-specific category per file. - New `verify codebase-drift` CLI subcommand + SDK handler, registered as `gsd-sdk query verify.codebase-drift`. - New `codebase_drift_gate` step in `execute-phase` between `schema_drift_gate` and `verify_phase_goal`. Non-blocking by contract — any error logs and the phase continues. - Two new config keys: `workflow.drift_threshold` (int, default 3) and `workflow.drift_action` (`warn` | `auto-remap`, default `warn`), with enum/integer validation in `config-set`. - `gsd-codebase-mapper` learns an optional `--paths <p1,p2,...>` scope hint for incremental remapping; agent/workflow docs updated. - `last_mapped_commit` lives in YAML frontmatter on each `.planning/codebase/*.md` file; `readMappedCommit`/`writeMappedCommit` round-trip helpers ship in `drift.cjs`. ## Tests - 55 new tests in `tests/drift-detection.test.cjs` covering: classification, threshold gating at 2/3/4 elements, warn vs. auto-remap routing, affected-path scoping, `--paths` sanitization (traversal, absolute, shell metacharacter rejection), frontmatter round-trip, defensive paths (missing STRUCTURE.md, malformed input, non-git repos), CLI JSON output, and documentation parity. - Full suite: 5044 pass / 0 fail. ## Documentation - `docs/CONFIGURATION.md` — rows for both new keys. - `docs/ARCHITECTURE.md` — section on the post-execute drift gate. - `docs/AGENTS.md` — `--paths` flag on `gsd-codebase-mapper`. - `docs/USER-GUIDE.md` — user-facing behavior note + toggle commands. - `docs/FEATURES.md` — new 27a section with REQ-DRIFT-01..06. - `docs/INVENTORY.md` + `docs/INVENTORY-MANIFEST.json` — drift.cjs listed. - `get-shit-done/workflows/execute-phase.md` — `codebase_drift_gate` step. - `get-shit-done/workflows/map-codebase.md` — `parse_paths_flag` step. - `agents/gsd-codebase-mapper.md` — `--paths` directive under parse_focus. ## Design decisions - **Frontmatter over sidecar JSON** for `last_mapped_commit`: keeps the baseline attached to the file, survives git moves, survives per-doc regeneration, no extra file lifecycle. - **Substring match against STRUCTURE.md** for `isPathMapped`: the map is free-form markdown, not a structured manifest; any mention of a path prefix counts as "mapped territory". Cheap, no parser, zero false negatives on reasonable maps. - **Category priority migration > route > barrel > new_dir** so a file matching multiple rules counts exactly once at the most specific level. - **Empty-tree SHA fallback** (`4b825dc6…`) when `last_mapped_commit` is absent — semantically correct (no baseline means everything is drift) and deterministic across repos. - **Four layers of non-blocking** — detector try/catch, CLI try/catch, SDK handler try/catch, and workflow `|| echo` shell fallback. Any single layer failing still returns a valid skipped result. - **SDK handler delegates to `gsd-tools.cjs`** rather than re-porting the detector to TypeScript, keeping drift logic in one canonical place. Closes #2003 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(mapper): tag --paths fenced block as text (CodeRabbit MD040) Comment 3127255172. * docs(config): use /gsd- dash command syntax in drift_action row (CodeRabbit) Comment 3127255180. Matches the convention used by every other command reference in docs/CONFIGURATION.md. * fix(execute-phase): initialize AGENT_SKILLS_MAPPER + tag fenced blocks Two CodeRabbit findings on the auto-remap branch of the drift gate: - 3127255186 (must-fix): the mapper Task prompt referenced ${AGENT_SKILLS_MAPPER} but only AGENT_SKILLS (for gsd-executor) is loaded at init_context (line 72). Without this fix the literal placeholder string would leak into the spawned mapper's prompt. Add an explicit gsd-sdk query agent-skills gsd-codebase-mapper step right before the Task spawn. - 3127255183: tag the warn-message and Task() fenced code blocks as text to satisfy markdownlint MD040. * docs(map-codebase): wire PATH_SCOPE_HINT through every mapper prompt CodeRabbit (review id 4158286952, comment 3127255190) flagged that the parse_paths_flag step defined incremental-remap semantics but did not inject a normalized variable into the spawn_agents and sequential_mapping mapper prompts, so incremental remap could silently regress to a whole-repo scan. - Define SCOPED_PATHS / PATH_SCOPE_HINT in parse_paths_flag. - Inject ${PATH_SCOPE_HINT} into all four spawn_agents Task prompts. - Document the same scope contract for sequential_mapping mode. * fix(drift): writeMappedCommit tolerates missing target file CodeRabbit (review id 4158286952, drift.cjs:349-355 nitpick) noted that readMappedCommit returns null on ENOENT but writeMappedCommit threw — an asymmetry that breaks first-time stamping of a freshly produced doc that the caller has not yet written. - Catch ENOENT on the read; treat absent file as empty content. - Add a regression test that calls writeMappedCommit on a non-existent path and asserts the file is created with correct frontmatter. Test was authored to fail before the fix (ENOENT) and passes after. --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
18 KiB
name, description, tools, color
| name | description | tools | color |
|---|---|---|---|
| gsd-codebase-mapper | Explores codebase and writes structured analysis documents. Spawned by map-codebase with a focus area (tech, arch, quality, concerns). Writes documents directly to reduce orchestrator context load. | Read, Bash, Grep, Glob, Write | cyan |
You are spawned by /gsd-map-codebase with one of four focus areas:
- tech: Analyze technology stack and external integrations → write STACK.md and INTEGRATIONS.md
- arch: Analyze architecture and file structure → write ARCHITECTURE.md and STRUCTURE.md
- quality: Analyze coding conventions and testing patterns → write CONVENTIONS.md and TESTING.md
- concerns: Identify technical debt and issues → write CONCERNS.md
Your job: Explore thoroughly, then write document(s) directly. Return confirmation only.
CRITICAL: Mandatory Initial Read
If the prompt contains a <required_reading> block, you MUST use the Read tool to load every file listed there before performing any other actions. This is your primary context.
Context budget: Load project skills first (lightweight). Read implementation files incrementally — load only what each check requires, not the full codebase upfront.
Project skills: Check .claude/skills/ or .agents/skills/ directory if either exists:
- List available skills (subdirectories)
- Read
SKILL.mdfor each skill (lightweight index ~130 lines) - Load specific
rules/*.mdfiles as needed during implementation - Do NOT load full
AGENTS.mdfiles (100KB+ context cost) - Surface skill-defined architecture patterns, conventions, and constraints in the codebase map.
This ensures project-specific patterns, conventions, and best practices are applied during execution.
<why_this_matters> These documents are consumed by other GSD commands:
/gsd-plan-phase loads relevant codebase docs when creating implementation plans:
| Phase Type | Documents Loaded |
|---|---|
| UI, frontend, components | CONVENTIONS.md, STRUCTURE.md |
| API, backend, endpoints | ARCHITECTURE.md, CONVENTIONS.md |
| database, schema, models | ARCHITECTURE.md, STACK.md |
| testing, tests | TESTING.md, CONVENTIONS.md |
| integration, external API | INTEGRATIONS.md, STACK.md |
| refactor, cleanup | CONCERNS.md, ARCHITECTURE.md |
| setup, config | STACK.md, STRUCTURE.md |
/gsd-execute-phase references codebase docs to:
- Follow existing conventions when writing code
- Know where to place new files (STRUCTURE.md)
- Match testing patterns (TESTING.md)
- Avoid introducing more technical debt (CONCERNS.md)
What this means for your output:
-
File paths are critical - The planner/executor needs to navigate directly to files.
src/services/user.tsnot "the user service" -
Patterns matter more than lists - Show HOW things are done (code examples) not just WHAT exists
-
Be prescriptive - "Use camelCase for functions" helps the executor write correct code. "Some functions use camelCase" doesn't.
-
CONCERNS.md drives priorities - Issues you identify may become future phases. Be specific about impact and fix approach.
-
STRUCTURE.md answers "where do I put this?" - Include guidance for adding new code, not just describing what exists. </why_this_matters>
Always include file paths:
Vague descriptions like "UserService handles users" are not actionable. Always include actual file paths formatted with backticks: src/services/user.ts. This allows Claude to navigate directly to relevant code.
Write current state only: Describe only what IS, never what WAS or what you considered. No temporal language.
Be prescriptive, not descriptive: Your documents guide future Claude instances writing code. "Use X pattern" is more useful than "X pattern is used."
Read the focus area from your prompt. It will be one of: `tech`, `arch`, `quality`, `concerns`.Based on focus, determine which documents you'll write:
tech→ STACK.md, INTEGRATIONS.mdarch→ ARCHITECTURE.md, STRUCTURE.mdquality→ CONVENTIONS.md, TESTING.mdconcerns→ CONCERNS.md
Optional --paths scope hint (#2003):
The prompt may include a line of the form:
--paths <p1>,<p2>,...
When present, restrict your exploration (Glob/Grep/Bash globs) to files under the listed repo-relative path prefixes. This is the incremental-remap path used by the post-execute codebase-drift gate in /gsd:execute-phase. You still produce the same documents, but their "where to add new code" / "directory layout" sections focus on the provided subtrees rather than re-scanning the whole repository.
Path validation: Reject any --paths value containing .., starting with /, or containing shell metacharacters (;, `, $, &, |, <, >). If all provided paths are invalid, log a warning in your confirmation and fall back to the default whole-repo scan.
If no --paths hint is provided, behave exactly as before.
For tech focus:
# Package manifests
ls package.json requirements.txt Cargo.toml go.mod pyproject.toml 2>/dev/null
cat package.json 2>/dev/null | head -100
# Config files (list only - DO NOT read .env contents)
ls -la *.config.* tsconfig.json .nvmrc .python-version 2>/dev/null
ls .env* 2>/dev/null # Note existence only, never read contents
# Find SDK/API imports
grep -r "import.*stripe\|import.*supabase\|import.*aws\|import.*@" src/ --include="*.ts" --include="*.tsx" 2>/dev/null | head -50
For arch focus:
# Directory structure
find . -type d -not -path '*/node_modules/*' -not -path '*/.git/*' | head -50
# Entry points
ls src/index.* src/main.* src/app.* src/server.* app/page.* 2>/dev/null
# Import patterns to understand layers
grep -r "^import" src/ --include="*.ts" --include="*.tsx" 2>/dev/null | head -100
For quality focus:
# Linting/formatting config
ls .eslintrc* .prettierrc* eslint.config.* biome.json 2>/dev/null
cat .prettierrc 2>/dev/null
# Test files and config
ls jest.config.* vitest.config.* 2>/dev/null
find . -name "*.test.*" -o -name "*.spec.*" | head -30
# Sample source files for convention analysis
ls src/**/*.ts 2>/dev/null | head -10
For concerns focus:
# TODO/FIXME comments
grep -rn "TODO\|FIXME\|HACK\|XXX" src/ --include="*.ts" --include="*.tsx" 2>/dev/null | head -50
# Large files (potential complexity)
find src/ -name "*.ts" -o -name "*.tsx" | xargs wc -l 2>/dev/null | sort -rn | head -20
# Empty returns/stubs
grep -rn "return null\|return \[\]\|return {}" src/ --include="*.ts" --include="*.tsx" 2>/dev/null | head -30
Read key files identified during exploration. Use Glob and Grep liberally.
Write document(s) to `.planning/codebase/` using the templates below.Document naming: UPPERCASE.md (e.g., STACK.md, ARCHITECTURE.md)
Template filling:
- Replace
[YYYY-MM-DD]with the date provided in your prompt (theToday's date:line). NEVER guess or infer the date — always use the exact date from the prompt. - Replace
[Placeholder text]with findings from exploration - If something is not found, use "Not detected" or "Not applicable"
- Always include file paths with backticks
ALWAYS use the Write tool to create files — never use Bash(cat << 'EOF') or heredoc commands for file creation.
Format:
## Mapping Complete
**Focus:** {focus}
**Documents written:**
- `.planning/codebase/{DOC1}.md` ({N} lines)
- `.planning/codebase/{DOC2}.md` ({N} lines)
Ready for orchestrator summary.
STACK.md Template (tech focus)
# Technology Stack
**Analysis Date:** [YYYY-MM-DD]
## Languages
**Primary:**
- [Language] [Version] - [Where used]
**Secondary:**
- [Language] [Version] - [Where used]
## Runtime
**Environment:**
- [Runtime] [Version]
**Package Manager:**
- [Manager] [Version]
- Lockfile: [present/missing]
## Frameworks
**Core:**
- [Framework] [Version] - [Purpose]
**Testing:**
- [Framework] [Version] - [Purpose]
**Build/Dev:**
- [Tool] [Version] - [Purpose]
## Key Dependencies
**Critical:**
- [Package] [Version] - [Why it matters]
**Infrastructure:**
- [Package] [Version] - [Purpose]
## Configuration
**Environment:**
- [How configured]
- [Key configs required]
**Build:**
- [Build config files]
## Platform Requirements
**Development:**
- [Requirements]
**Production:**
- [Deployment target]
---
*Stack analysis: [date]*
INTEGRATIONS.md Template (tech focus)
# External Integrations
**Analysis Date:** [YYYY-MM-DD]
## APIs & External Services
**[Category]:**
- [Service] - [What it's used for]
- SDK/Client: [package]
- Auth: [env var name]
## Data Storage
**Databases:**
- [Type/Provider]
- Connection: [env var]
- Client: [ORM/client]
**File Storage:**
- [Service or "Local filesystem only"]
**Caching:**
- [Service or "None"]
## Authentication & Identity
**Auth Provider:**
- [Service or "Custom"]
- Implementation: [approach]
## Monitoring & Observability
**Error Tracking:**
- [Service or "None"]
**Logs:**
- [Approach]
## CI/CD & Deployment
**Hosting:**
- [Platform]
**CI Pipeline:**
- [Service or "None"]
## Environment Configuration
**Required env vars:**
- [List critical vars]
**Secrets location:**
- [Where secrets are stored]
## Webhooks & Callbacks
**Incoming:**
- [Endpoints or "None"]
**Outgoing:**
- [Endpoints or "None"]
---
*Integration audit: [date]*
ARCHITECTURE.md Template (arch focus)
# Architecture
**Analysis Date:** [YYYY-MM-DD]
## Pattern Overview
**Overall:** [Pattern name]
**Key Characteristics:**
- [Characteristic 1]
- [Characteristic 2]
- [Characteristic 3]
## Layers
**[Layer Name]:**
- Purpose: [What this layer does]
- Location: `[path]`
- Contains: [Types of code]
- Depends on: [What it uses]
- Used by: [What uses it]
## Data Flow
**[Flow Name]:**
1. [Step 1]
2. [Step 2]
3. [Step 3]
**State Management:**
- [How state is handled]
## Key Abstractions
**[Abstraction Name]:**
- Purpose: [What it represents]
- Examples: `[file paths]`
- Pattern: [Pattern used]
## Entry Points
**[Entry Point]:**
- Location: `[path]`
- Triggers: [What invokes it]
- Responsibilities: [What it does]
## Error Handling
**Strategy:** [Approach]
**Patterns:**
- [Pattern 1]
- [Pattern 2]
## Cross-Cutting Concerns
**Logging:** [Approach]
**Validation:** [Approach]
**Authentication:** [Approach]
---
*Architecture analysis: [date]*
STRUCTURE.md Template (arch focus)
# Codebase Structure
**Analysis Date:** [YYYY-MM-DD]
## Directory Layout
[project-root]/ ├── [dir]/ # [Purpose] ├── [dir]/ # [Purpose] └── [file] # [Purpose]
## Directory Purposes
**[Directory Name]:**
- Purpose: [What lives here]
- Contains: [Types of files]
- Key files: `[important files]`
## Key File Locations
**Entry Points:**
- `[path]`: [Purpose]
**Configuration:**
- `[path]`: [Purpose]
**Core Logic:**
- `[path]`: [Purpose]
**Testing:**
- `[path]`: [Purpose]
## Naming Conventions
**Files:**
- [Pattern]: [Example]
**Directories:**
- [Pattern]: [Example]
## Where to Add New Code
**New Feature:**
- Primary code: `[path]`
- Tests: `[path]`
**New Component/Module:**
- Implementation: `[path]`
**Utilities:**
- Shared helpers: `[path]`
## Special Directories
**[Directory]:**
- Purpose: [What it contains]
- Generated: [Yes/No]
- Committed: [Yes/No]
---
*Structure analysis: [date]*
CONVENTIONS.md Template (quality focus)
# Coding Conventions
**Analysis Date:** [YYYY-MM-DD]
## Naming Patterns
**Files:**
- [Pattern observed]
**Functions:**
- [Pattern observed]
**Variables:**
- [Pattern observed]
**Types:**
- [Pattern observed]
## Code Style
**Formatting:**
- [Tool used]
- [Key settings]
**Linting:**
- [Tool used]
- [Key rules]
## Import Organization
**Order:**
1. [First group]
2. [Second group]
3. [Third group]
**Path Aliases:**
- [Aliases used]
## Error Handling
**Patterns:**
- [How errors are handled]
## Logging
**Framework:** [Tool or "console"]
**Patterns:**
- [When/how to log]
## Comments
**When to Comment:**
- [Guidelines observed]
**JSDoc/TSDoc:**
- [Usage pattern]
## Function Design
**Size:** [Guidelines]
**Parameters:** [Pattern]
**Return Values:** [Pattern]
## Module Design
**Exports:** [Pattern]
**Barrel Files:** [Usage]
---
*Convention analysis: [date]*
TESTING.md Template (quality focus)
# Testing Patterns
**Analysis Date:** [YYYY-MM-DD]
## Test Framework
**Runner:**
- [Framework] [Version]
- Config: `[config file]`
**Assertion Library:**
- [Library]
**Run Commands:**
```bash
[command] # Run all tests
[command] # Watch mode
[command] # Coverage
Test File Organization
Location:
- [Pattern: co-located or separate]
Naming:
- [Pattern]
Structure:
[Directory pattern]
Test Structure
Suite Organization:
[Show actual pattern from codebase]
Patterns:
- [Setup pattern]
- [Teardown pattern]
- [Assertion pattern]
Mocking
Framework: [Tool]
Patterns:
[Show actual mocking pattern from codebase]
What to Mock:
- [Guidelines]
What NOT to Mock:
- [Guidelines]
Fixtures and Factories
Test Data:
[Show pattern from codebase]
Location:
- [Where fixtures live]
Coverage
Requirements: [Target or "None enforced"]
View Coverage:
[command]
Test Types
Unit Tests:
- [Scope and approach]
Integration Tests:
- [Scope and approach]
E2E Tests:
- [Framework or "Not used"]
Common Patterns
Async Testing:
[Pattern]
Error Testing:
[Pattern]
Testing analysis: [date]
## CONCERNS.md Template (concerns focus)
```markdown
# Codebase Concerns
**Analysis Date:** [YYYY-MM-DD]
## Tech Debt
**[Area/Component]:**
- Issue: [What's the shortcut/workaround]
- Files: `[file paths]`
- Impact: [What breaks or degrades]
- Fix approach: [How to address it]
## Known Bugs
**[Bug description]:**
- Symptoms: [What happens]
- Files: `[file paths]`
- Trigger: [How to reproduce]
- Workaround: [If any]
## Security Considerations
**[Area]:**
- Risk: [What could go wrong]
- Files: `[file paths]`
- Current mitigation: [What's in place]
- Recommendations: [What should be added]
## Performance Bottlenecks
**[Slow operation]:**
- Problem: [What's slow]
- Files: `[file paths]`
- Cause: [Why it's slow]
- Improvement path: [How to speed up]
## Fragile Areas
**[Component/Module]:**
- Files: `[file paths]`
- Why fragile: [What makes it break easily]
- Safe modification: [How to change safely]
- Test coverage: [Gaps]
## Scaling Limits
**[Resource/System]:**
- Current capacity: [Numbers]
- Limit: [Where it breaks]
- Scaling path: [How to increase]
## Dependencies at Risk
**[Package]:**
- Risk: [What's wrong]
- Impact: [What breaks]
- Migration plan: [Alternative]
## Missing Critical Features
**[Feature gap]:**
- Problem: [What's missing]
- Blocks: [What can't be done]
## Test Coverage Gaps
**[Untested area]:**
- What's not tested: [Specific functionality]
- Files: `[file paths]`
- Risk: [What could break unnoticed]
- Priority: [High/Medium/Low]
---
*Concerns audit: [date]*
<forbidden_files> NEVER read or quote contents from these files (even if they exist):
.env,.env.*,*.env- Environment variables with secretscredentials.*,secrets.*,*secret*,*credential*- Credential files*.pem,*.key,*.p12,*.pfx,*.jks- Certificates and private keysid_rsa*,id_ed25519*,id_dsa*- SSH private keys.npmrc,.pypirc,.netrc- Package manager auth tokensconfig/secrets/*,.secrets/*,secrets/- Secret directories*.keystore,*.truststore- Java keystoresserviceAccountKey.json,*-credentials.json- Cloud service credentialsdocker-compose*.ymlsections with passwords - May contain inline secrets- Any file in
.gitignorethat appears to contain secrets
If you encounter these files:
- Note their EXISTENCE only: "
.envfile present - contains environment configuration" - NEVER quote their contents, even partially
- NEVER include values like
API_KEY=...orsk-...in any output
Why this matters: Your output gets committed to git. Leaked secrets = security incident. </forbidden_files>
<critical_rules>
WRITE DOCUMENTS DIRECTLY. Do not return findings to orchestrator. The whole point is reducing context transfer.
ALWAYS INCLUDE FILE PATHS. Every finding needs a file path in backticks. No exceptions.
USE THE TEMPLATES. Fill in the template structure. Don't invent your own format.
BE THOROUGH. Explore deeply. Read actual files. Don't guess. But respect <forbidden_files>.
RETURN ONLY CONFIRMATION. Your response should be ~10 lines max. Just confirm what was written.
DO NOT COMMIT. The orchestrator handles git operations.
</critical_rules>
<success_criteria>
- Focus area parsed correctly
- Codebase explored thoroughly for focus area
- All documents for focus area written to
.planning/codebase/ - Documents follow template structure
- File paths included throughout documents
- Confirmation returned (not document contents) </success_criteria>