mirror of https://github.com/glittercowboy/get-shit-done synced 2026-04-25 17:25:23 +02:00

Files

Rezolv 86fb9c85c3 docs(sdk): registry docs and gsd-sdk query call sites (#2302 Track B) (#2340 )

* feat(sdk): golden parity harness and query handler CJS alignment (#2302 Track A)

Golden/read-only parity tests and registry alignment, query handler fixes
(check-completion, state-mutation, commit, validate, summary, etc.), and
WAITING.json dual-write for .gsd/.planning readers.

Refs gsd-build/get-shit-done#2341

* fix(sdk): getMilestoneInfo matches GSD ROADMAP (🟡, last bold, STATE fallback)

- Recognize in-flight 🟡 milestone bullets like 🚧.
- Derive from last **vX.Y Title** before ## Phases when emoji absent.
- Fall back to STATE.md milestone when ROADMAP is missing; use last bare vX.Y
  in cleaned text instead of first (avoids v1.0 from shipped list).
- Fixes init.execute-phase milestone_version and buildStateFrontmatter after
  state.begin-phase (syncStateFrontmatter).

* feat(sdk): phase list, plan task structure, requirements extract handlers

- Register phase.list-plans, phase.list-artifacts, plan.task-structure,
  requirements.extract-from-plans (SDK-only; golden-policy exceptions).
- Add unit tests; document in QUERY-HANDLERS.md.
- writeProfile: honor --output, render dimensions, return profile_path and dimensions_scored.

* feat(sdk): centralize getGsdAgentsDir in query helpers

Extract agent directory resolution to helpers (GSD_AGENTS_DIR, primary
~/.claude/agents, legacy path). Use from init and docs-init init bundles.

docs(15): add 15-CONTEXT for autonomous phase-15 run.

* feat(sdk): query CLI CJS fallback and session correlation

- createRegistry(eventStream, sessionId) threads correlation into mutation events
- gsd-sdk query falls back to gsd-tools.cjs when no native handler matches
  (disable with GSD_QUERY_FALLBACK=off); stderr bridge warnings
- Export createRegistry from @gsd-build/sdk; add sdk/README.md
- Update QUERY-HANDLERS.md and registry module docs for fallback + sessionId
- Agents: prefer node dist/cli.js query over cat/grep for STATE and plans

* fix(sdk): init phase_found parity, docs-init agents path, state field extract

- Normalize findPhase not-found to null before roadmap fallback (matches findPhaseInternal)

- docs-init: use detectRuntime + resolveAgentsDir for checkAgentsInstalled

- state.cjs stateExtractField: horizontal whitespace only after colon (YAML progress guard)

- Tests: commit_docs default true; config-get golden uses temp config; golden integration green

Refs: #2302

* refactor(sdk): share SessionJsonlRecord in profile-extract-messages

CodeRabbit nit: dedupe JSONL record shape for isGenuineUserMessage and streamExtractMessages.

* fix(sdk): address CodeRabbit major threads (paths, gates, audit, verify)

- Resolve @file: and CLI JSON indirection relative to projectDir; guard empty normalized query command

- plan.task-structure + intel extract/patch-meta: resolvePathUnderProject containment

- check.config-gates: safe string booleans; plan_checker alias precedence over plan_check default

- state.validate/sync: phaseTokenMatches + comparePhaseNum ordering

- verify.schema-drift: token match phase dirs; files_modified from parsed frontmatter

- audit-open: has_scan_errors, unreadable rows, human report when scans fail

- requirements PLANNED key PLAN for root PLAN.md; gsd-tools timeout note

- ingest-docs: repo-root path containment; classifier output slug-hash

Golden parity test strips has_scan_errors until CJS adds field.

* fix: Resolve CodeRabbit security and quality findings
- Secure intel.ts and cli.ts against path traversal
- Catch and validate git add status in commit.ts
- Expand roadmap milestone marker extraction
- Fix parsing array-of-objects in frontmatter YAML
- Fix unhandled config evaluations
- Improve coverage test parity mapping

* docs(sdk): registry docs and gsd-sdk query call sites (#2302 Track B)

Update CHANGELOG, architecture and user guides, workflow call sites, and read-guard tests for gsd-sdk query; sync ARCHITECTURE.md command/workflow counts and directory-tree totals with the repo (80 commands, 77 workflows).

Address CodeRabbit: fix markdown tables and emphasis; align CLI-TOOLS GSDTools and state.read docs with implementation; correct roadmap handler name in universal-anti-patterns; resolve settings workflow config path without relying on config_path from state.load.

Refs gsd-build/get-shit-done#2340

* test: raise planner character extraction limit to 48K

* fix(sdk): resolve build TS error and doc conflict markers

2026-04-20 18:09:21 -04:00

34 KiB

Raw Blame History

GSD Architecture

System architecture for contributors and advanced users. For user-facing documentation, see Feature Reference or User Guide.

System Overview
Design Principles
Component Architecture
Agent Model
Data Flow
File System Layout
Installer Architecture
Hook System
CLI Tools Layer
Runtime Abstraction

System Overview

GSD is a meta-prompting framework that sits between the user and AI coding agents (Claude Code, Gemini CLI, OpenCode, Kilo, Codex, Copilot, Antigravity, Trae, Cline, Augment Code). It provides:

Context engineering — Structured artifacts that give the AI everything it needs per task
Multi-agent orchestration — Thin orchestrators that spawn specialized agents with fresh context windows
Spec-driven development — Requirements → research → plans → execution → verification pipeline
State management — Persistent project memory across sessions and context resets

┌──────────────────────────────────────────────────────┐
│                      USER                            │
│            /gsd-command [args]                        │
└─────────────────────┬────────────────────────────────┘
                      │
┌─────────────────────▼────────────────────────────────┐
│              COMMAND LAYER                            │
│   commands/gsd/*.md — Prompt-based command files      │
│   (Claude Code custom commands / Codex skills)        │
└─────────────────────┬────────────────────────────────┘
                      │
┌─────────────────────▼────────────────────────────────┐
│              WORKFLOW LAYER                           │
│   get-shit-done/workflows/*.md — Orchestration logic  │
│   (Reads references, spawns agents, manages state)    │
└──────┬──────────────┬─────────────────┬──────────────┘
       │              │                 │
┌──────▼──────┐ ┌─────▼─────┐ ┌────────▼───────┐
│  AGENT      │ │  AGENT    │ │  AGENT         │
│  (fresh     │ │  (fresh   │ │  (fresh        │
│   context)  │ │   context)│ │   context)     │
└──────┬──────┘ └─────┬─────┘ └────────┬───────┘
       │              │                 │
┌──────▼──────────────▼─────────────────▼──────────────┐
│              CLI TOOLS LAYER                          │
│   gsd-sdk query (sdk/src/query) + gsd-tools.cjs       │
│   (State, config, phase, roadmap, verify, templates)  │
└──────────────────────┬───────────────────────────────┘
                       │
┌──────────────────────▼───────────────────────────────┐
│              FILE SYSTEM (.planning/)                 │
│   PROJECT.md | REQUIREMENTS.md | ROADMAP.md          │
│   STATE.md | config.json | phases/ | research/       │
└──────────────────────────────────────────────────────┘

Design Principles

1. Fresh Context Per Agent

Every agent spawned by an orchestrator gets a clean context window (up to 200K tokens). This eliminates context rot — the quality degradation that happens as an AI fills its context window with accumulated conversation.

2. Thin Orchestrators

Workflow files (get-shit-done/workflows/*.md) never do heavy lifting. They:

Load context via gsd-sdk query init.<workflow> (or legacy gsd-tools.cjs init <workflow>)
Spawn specialized agents with focused prompts
Collect results and route to the next step
Update state between steps

3. File-Based State

All state lives in .planning/ as human-readable Markdown and JSON. No database, no server, no external dependencies. This means:

State survives context resets (/clear)
State is inspectable by both humans and agents
State can be committed to git for team visibility

4. Absent = Enabled

Workflow feature flags follow the absent = enabled pattern. If a key is missing from config.json, it defaults to true. Users explicitly disable features; they don't need to enable defaults.

5. Defense in Depth

Multiple layers prevent common failure modes:

Plans are verified before execution (plan-checker agent)
Execution produces atomic commits per task
Post-execution verification checks against phase goals
UAT provides human verification as final gate

Component Architecture

Commands (`commands/gsd/*.md`)

User-facing entry points. Each file contains YAML frontmatter (name, description, allowed-tools) and a prompt body that bootstraps the workflow. Commands are installed as:

Claude Code: Custom slash commands (/gsd-command-name)
OpenCode / Kilo: Slash commands (/gsd-command-name)
Codex: Skills ($gsd-command-name)
Copilot: Slash commands (/gsd-command-name)
Antigravity: Skills

Total commands: see docs/INVENTORY.md for the authoritative count and full roster.

Workflows (`get-shit-done/workflows/*.md`)

Orchestration logic that commands reference. Contains the step-by-step process including:

Context loading via gsd-sdk query init handlers (or legacy gsd-tools.cjs init)
Agent spawn instructions with model resolution
Gate/checkpoint definitions
State update patterns
Error handling and recovery

Total workflows: see docs/INVENTORY.md for the authoritative count and full roster.

Agents (`agents/*.md`)

Specialized agent definitions with frontmatter specifying:

name — Agent identifier
description — Role and purpose
tools — Allowed tool access (Read, Write, Edit, Bash, Grep, Glob, WebSearch, etc.)
color — Terminal output color for visual distinction

Total agents: 33

References (`get-shit-done/references/*.md`)

Shared knowledge documents that workflows and agents @-reference (see docs/INVENTORY.md for the authoritative count and full roster):

Core references:

checkpoints.md — Checkpoint type definitions and interaction patterns
gates.md — 4 canonical gate types (Confirm, Quality, Safety, Transition) wired into plan-checker and verifier
model-profiles.md — Per-agent model tier assignments
model-profile-resolution.md — Model resolution algorithm documentation
verification-patterns.md — How to verify different artifact types
verification-overrides.md — Per-artifact verification override rules
planning-config.md — Full config schema and behavior
git-integration.md — Git commit, branching, and history patterns
git-planning-commit.md — Planning directory commit conventions
questioning.md — Dream extraction philosophy for project initialization
tdd.md — Test-driven development integration patterns
ui-brand.md — Visual output formatting patterns
common-bug-patterns.md — Common bug patterns for code review and verification

Workflow references:

agent-contracts.md — Formal interface between orchestrators and agents
context-budget.md — Context window budget allocation rules
continuation-format.md — Session continuation/resume format
domain-probes.md — Domain-specific probing questions for discuss-phase
gate-prompts.md — Gate/checkpoint prompt templates
revision-loop.md — Plan revision iteration patterns
universal-anti-patterns.md — Common anti-patterns to detect and avoid
artifact-types.md — Planning artifact type definitions
phase-argument-parsing.md — Phase argument parsing conventions
decimal-phase-calculation.md — Decimal sub-phase numbering rules
workstream-flag.md — Workstream active pointer conventions
user-profiling.md — User behavioral profiling methodology
thinking-partner.md — Conditional thinking partner activation at decision points

Thinking model references:

References for integrating thinking-class models (o3, o4-mini, Gemini 2.5 Pro) into GSD workflows:

thinking-models-debug.md — Thinking model patterns for debugging workflows
thinking-models-execution.md — Thinking model patterns for execution agents
thinking-models-planning.md — Thinking model patterns for planning agents
thinking-models-research.md — Thinking model patterns for research agents
thinking-models-verification.md — Thinking model patterns for verification agents

Modular planner decomposition:

The planner agent (agents/gsd-planner.md) was decomposed from a single monolithic file into a core agent plus reference modules to stay under the 50K character limit imposed by some runtimes:

planner-gap-closure.md — Gap closure mode behavior (reads VERIFICATION.md, targeted replanning)
planner-reviews.md — Cross-AI review integration (reads REVIEWS.md from /gsd-review)
planner-revision.md — Plan revision patterns for iterative refinement

Templates (`get-shit-done/templates/`)

Markdown templates for all planning artifacts. Used by gsd-sdk query template.fill / phase.scaffold (and legacy gsd-tools.cjs template fill / top-level scaffold) to create pre-structured files:

project.md, requirements.md, roadmap.md, state.md — Core project files
phase-prompt.md — Phase execution prompt template
summary.md (+ summary-minimal.md, summary-standard.md, summary-complex.md) — Granularity-aware summary templates
DEBUG.md — Debug session tracking template
UI-SPEC.md, UAT.md, VALIDATION.md — Specialized verification templates
discussion-log.md — Discussion audit trail template
codebase/ — Brownfield mapping templates (stack, architecture, conventions, concerns, structure, testing, integrations)
research-project/ — Research output templates (SUMMARY, STACK, FEATURES, ARCHITECTURE, PITFALLS)

Hooks (`hooks/`)

Runtime hooks that integrate with the host AI agent:

Hook	Event	Purpose
`gsd-statusline.js`	`statusLine`	Displays model, task, directory, and context usage bar
`gsd-context-monitor.js`	`PostToolUse` / `AfterTool`	Injects agent-facing context warnings at 35%/25% remaining
`gsd-check-update.js`	`SessionStart`	Foreground trigger for the background update check
`gsd-check-update-worker.js`	(helper)	Background worker spawned by `gsd-check-update.js`; no direct event registration
`gsd-prompt-guard.js`	`PreToolUse`	Scans `.planning/` writes for prompt injection patterns (advisory)
`gsd-read-injection-scanner.js`	`PostToolUse`	Scans Read tool output for injected instructions in untrusted content
`gsd-workflow-guard.js`	`PreToolUse`	Detects file edits outside GSD workflow context (advisory, opt-in via `hooks.workflow_guard`)
`gsd-read-guard.js`	`PreToolUse`	Advisory guard preventing Edit/Write on files not yet read in the session
`gsd-session-state.sh`	`PostToolUse`	Session state tracking for shell-based runtimes
`gsd-validate-commit.sh`	`PostToolUse`	Commit validation for conventional commit enforcement
`gsd-phase-boundary.sh`	`PostToolUse`	Phase boundary detection for workflow transitions

See docs/INVENTORY.md for the authoritative 11-hook roster.

CLI Tools (`get-shit-done/bin/`)

Node.js CLI utility (gsd-tools.cjs) with domain modules split across get-shit-done/bin/lib/ (see docs/INVENTORY.md for the authoritative roster):

Module	Responsibility
`core.cjs`	Error handling, output formatting, shared utilities
`state.cjs`	STATE.md parsing, updating, progression, metrics
`phase.cjs`	Phase directory operations, decimal numbering, plan indexing
`roadmap.cjs`	ROADMAP.md parsing, phase extraction, plan progress
`config.cjs`	config.json read/write, section initialization
`verify.cjs`	Plan structure, phase completeness, reference, commit validation
`template.cjs`	Template selection and filling with variable substitution
`frontmatter.cjs`	YAML frontmatter CRUD operations
`init.cjs`	Compound context loading for each workflow type
`milestone.cjs`	Milestone archival, requirements marking
`commands.cjs`	Misc commands (slug, timestamp, todos, scaffolding, stats)
`model-profiles.cjs`	Model profile resolution table
`security.cjs`	Path traversal prevention, prompt injection detection, safe JSON parsing, shell argument validation
`uat.cjs`	UAT file parsing, verification debt tracking, audit-uat support
`docs.cjs`	Docs-update workflow init, Markdown scanning, monorepo detection
`workstream.cjs`	Workstream CRUD, migration, session-scoped active pointer
`schema-detect.cjs`	Schema-drift detection for ORM patterns (Prisma, Drizzle, etc.)
`profile-pipeline.cjs`	User behavioral profiling data pipeline, session file scanning
`profile-output.cjs`	Profile rendering, USER-PROFILE.md and dev-preferences.md generation

Agent Model

Orchestrator → Agent Pattern

Orchestrator (workflow .md)
    │
    ├── Load context: gsd-sdk query init.<workflow> <phase> (or legacy gsd-tools.cjs init)
    │   Returns JSON with: project info, config, state, phase details
    │
    ├── Resolve model: gsd-sdk query resolve-model <agent-name>
    │   Returns: opus | sonnet | haiku | inherit
    │
    ├── Spawn Agent (Task/SubAgent call)
    │   ├── Agent prompt (agents/*.md)
    │   ├── Context payload (init JSON)
    │   ├── Model assignment
    │   └── Tool permissions
    │
    ├── Collect result
    │
    └── Update state: gsd-sdk query state.update / state.patch / state.advance-plan (or legacy gsd-tools.cjs)

Primary Agent Spawn Categories

Conceptual spawn-pattern taxonomy for the 21 primary agents. For the authoritative 31-agent roster (including the 10 advanced/specialized agents such as gsd-pattern-mapper, gsd-code-reviewer, gsd-code-fixer, gsd-ai-researcher, gsd-domain-researcher, gsd-eval-planner, gsd-eval-auditor, gsd-framework-selector, gsd-debug-session-manager, gsd-intel-updater), see docs/INVENTORY.md.

Category	Agents	Parallelism
Researchers	gsd-project-researcher, gsd-phase-researcher, gsd-ui-researcher, gsd-advisor-researcher	4 parallel (stack, features, architecture, pitfalls); advisor spawns during discuss-phase
Synthesizers	gsd-research-synthesizer	Sequential (after researchers complete)
Planners	gsd-planner, gsd-roadmapper	Sequential
Checkers	gsd-plan-checker, gsd-integration-checker, gsd-ui-checker, gsd-nyquist-auditor	Sequential (verification loop, max 3 iterations)
Executors	gsd-executor	Parallel within waves, sequential across waves
Verifiers	gsd-verifier	Sequential (after all executors complete)
Mappers	gsd-codebase-mapper	4 parallel (tech, arch, quality, concerns)
Debuggers	gsd-debugger	Sequential (interactive)
Auditors	gsd-ui-auditor, gsd-security-auditor	Sequential
Doc Writers	gsd-doc-writer, gsd-doc-verifier	Sequential (writer then verifier)
Profilers	gsd-user-profiler	Sequential
Analyzers	gsd-assumptions-analyzer	Sequential (during discuss-phase)

Wave Execution Model

During execute-phase, plans are grouped into dependency waves:

Wave Analysis:
  Plan 01 (no deps)      ─┐
  Plan 02 (no deps)      ─┤── Wave 1 (parallel)
  Plan 03 (depends: 01)  ─┤── Wave 2 (waits for Wave 1)
  Plan 04 (depends: 02)  ─┘
  Plan 05 (depends: 03,04) ── Wave 3 (waits for Wave 2)

Each executor gets:

Fresh 200K context window (or up to 1M for models that support it)
The specific PLAN.md to execute
Project context (PROJECT.md, STATE.md)
Phase context (CONTEXT.md, RESEARCH.md if available)

Adaptive Context Enrichment (1M Models)

When the context window is 500K+ tokens (1M-class models like Opus 4.6, Sonnet 4.6), subagent prompts are automatically enriched with additional context that would not fit in standard 200K windows:

Executor agents receive prior wave SUMMARY.md files and the phase CONTEXT.md/RESEARCH.md, enabling cross-plan awareness within a phase
Verifier agents receive all PLAN.md, SUMMARY.md, CONTEXT.md files plus REQUIREMENTS.md, enabling history-aware verification

The orchestrator reads context_window from config (gsd-sdk query config-get context_window, or legacy gsd-tools.cjs config-get) and conditionally includes richer context when the value is >= 500,000. For standard 200K windows, prompts use truncated versions with cache-friendly ordering to maximize context efficiency.

Parallel Commit Safety

When multiple executors run within the same wave, two mechanisms prevent conflicts:

--no-verify commits — Parallel agents skip pre-commit hooks (which can cause build lock contention, e.g., cargo lock fights in Rust projects). The orchestrator runs git hook run pre-commit once after each wave completes.
STATE.md file locking — All writeStateMd() calls use lockfile-based mutual exclusion (STATE.md.lock with O_EXCL atomic creation). This prevents the read-modify-write race condition where two agents read STATE.md, modify different fields, and the last writer overwrites the other's changes. Includes stale lock detection (10s timeout) and spin-wait with jitter.

Data Flow

New Project Flow

User input (idea description)
    │
    ▼
Questions (questioning.md philosophy)
    │
    ▼
4x Project Researchers (parallel)
    ├── Stack → STACK.md
    ├── Features → FEATURES.md
    ├── Architecture → ARCHITECTURE.md
    └── Pitfalls → PITFALLS.md
    │
    ▼
Research Synthesizer → SUMMARY.md
    │
    ▼
Requirements extraction → REQUIREMENTS.md
    │
    ▼
Roadmapper → ROADMAP.md
    │
    ▼
User approval → STATE.md initialized

Phase Execution Flow

discuss-phase → CONTEXT.md (user preferences)
    │
    ▼
ui-phase → UI-SPEC.md (design contract, optional)
    │
    ▼
plan-phase
    ├── Research gate (blocks if RESEARCH.md has unresolved open questions)
    ├── Phase Researcher → RESEARCH.md
    ├── Planner (with reachability check) → PLAN.md files
    └── Plan Checker → Verify loop (max 3x)
    │
    ▼
state planned-phase → STATE.md (Planned/Ready to execute)
    │
    ▼
execute-phase (context reduction: truncated prompts, cache-friendly ordering)
    ├── Wave analysis (dependency grouping)
    ├── Executor per plan → code + atomic commits
    ├── SUMMARY.md per plan
    └── Verifier → VERIFICATION.md
    │
    ▼
verify-work → UAT.md (user acceptance testing)
    │
    ▼
ui-review → UI-REVIEW.md (visual audit, optional)

Context Propagation

Each workflow stage produces artifacts that feed into subsequent stages:

PROJECT.md ────────────────────────────────────────────► All agents
REQUIREMENTS.md ───────────────────────────────────────► Planner, Verifier, Auditor
ROADMAP.md ────────────────────────────────────────────► Orchestrators
STATE.md ──────────────────────────────────────────────► All agents (decisions, blockers)
CONTEXT.md (per phase) ────────────────────────────────► Researcher, Planner, Executor
RESEARCH.md (per phase) ───────────────────────────────► Planner, Plan Checker
PLAN.md (per plan) ────────────────────────────────────► Executor, Plan Checker
SUMMARY.md (per plan) ─────────────────────────────────► Verifier, State tracking
UI-SPEC.md (per phase) ────────────────────────────────► Executor, UI Auditor

File System Layout

Installation Files

~/.claude/                          # Claude Code (global install)
├── commands/gsd/*.md               # Slash commands (authoritative roster: docs/INVENTORY.md)
├── get-shit-done/
│   ├── bin/gsd-tools.cjs           # CLI utility
│   ├── bin/lib/*.cjs               # Domain modules (authoritative roster: docs/INVENTORY.md)
│   ├── workflows/*.md              # Workflow definitions (authoritative roster: docs/INVENTORY.md)
│   ├── references/*.md             # Shared reference docs (authoritative roster: docs/INVENTORY.md)
│   └── templates/                  # Planning artifact templates
├── agents/*.md                     # Agent definitions (authoritative roster: docs/INVENTORY.md)
├── hooks/*.js                      # Node.js hooks (statusline, guards, monitors, update check)
├── hooks/*.sh                      # Shell hooks (session state, commit validation, phase boundary)
├── settings.json                   # Hook registrations
└── VERSION                         # Installed version number

Equivalent paths for other runtimes:

OpenCode: ~/.config/opencode/ or ~/.opencode/
Kilo: ~/.config/kilo/ or ~/.kilo/
Gemini CLI: ~/.gemini/
Codex: ~/.codex/ (uses skills instead of commands)
Copilot: ~/.github/
Antigravity: ~/.gemini/antigravity/ (global) or ./.agent/ (local)

Project Files (`.planning/`)

.planning/
├── PROJECT.md              # Project vision, constraints, decisions, evolution rules
├── REQUIREMENTS.md         # Scoped requirements (v1/v2/out-of-scope)
├── ROADMAP.md              # Phase breakdown with status tracking
├── STATE.md                # Living memory: position, decisions, blockers, metrics
├── config.json             # Workflow configuration
├── MILESTONES.md           # Completed milestone archive
├── research/               # Domain research from /gsd-new-project
│   ├── SUMMARY.md
│   ├── STACK.md
│   ├── FEATURES.md
│   ├── ARCHITECTURE.md
│   └── PITFALLS.md
├── codebase/               # Brownfield mapping (from /gsd-map-codebase)
│   ├── STACK.md
│   ├── ARCHITECTURE.md
│   ├── CONVENTIONS.md
│   ├── CONCERNS.md
│   ├── STRUCTURE.md
│   ├── TESTING.md
│   └── INTEGRATIONS.md
├── phases/
│   └── XX-phase-name/
│       ├── XX-CONTEXT.md       # User preferences (from discuss-phase)
│       ├── XX-RESEARCH.md      # Ecosystem research (from plan-phase)
│       ├── XX-YY-PLAN.md       # Execution plans
│       ├── XX-YY-SUMMARY.md    # Execution outcomes
│       ├── XX-VERIFICATION.md  # Post-execution verification
│       ├── XX-VALIDATION.md    # Nyquist test coverage mapping
│       ├── XX-UI-SPEC.md       # UI design contract (from ui-phase)
│       ├── XX-UI-REVIEW.md     # Visual audit scores (from ui-review)
│       └── XX-UAT.md           # User acceptance test results
├── quick/                  # Quick task tracking
│   └── YYMMDD-xxx-slug/
│       ├── PLAN.md
│       └── SUMMARY.md
├── todos/
│   ├── pending/            # Captured ideas
│   └── done/               # Completed todos
├── threads/               # Persistent context threads (from /gsd-thread)
├── seeds/                 # Forward-looking ideas (from /gsd-plant-seed)
├── debug/                  # Active debug sessions
│   ├── *.md                # Active sessions
│   ├── resolved/           # Archived sessions
│   └── knowledge-base.md   # Persistent debug learnings
├── ui-reviews/             # Screenshots from /gsd-ui-review (gitignored)
└── continue-here.md        # Context handoff (from pause-work)

Installer Architecture

The installer (bin/install.js, ~3,000 lines) handles:

Runtime detection — Interactive prompt or CLI flags (--claude, --opencode, --gemini, --kilo, --codex, --copilot, --antigravity, --cursor, --windsurf, --trae, --cline, --augment, --all)
Location selection — Global (--global) or local (--local)
File deployment — Copies commands, workflows, references, templates, agents, hooks
Runtime adaptation — Transforms file content per runtime:

Claude Code: Uses as-is
OpenCode: Converts commands/agents to OpenCode-compatible flat command + subagent format
Kilo: Reuses the OpenCode conversion pipeline with Kilo config paths
Codex: Generates TOML config + skills from commands
Copilot: Maps tool names (Read→read, Bash→execute, etc.)
Gemini: Adjusts hook event names (AfterTool instead of PostToolUse)
Antigravity: Skills-first with Google model equivalents
Trae: Skills-first install to ~/.trae / ./.trae with no settings.json or hook integration
Cline: Writes .clinerules for rule-based integration
Augment Code: Skills-first with full skill conversion and config management

Path normalization — Replaces ~/.claude/ paths with runtime-specific paths
Settings integration — Registers hooks in runtime's settings.json
Patch backup — Since v1.17, backs up locally modified files to gsd-local-patches/ for /gsd-reapply-patches
Manifest tracking — Writes gsd-file-manifest.json for clean uninstall
Uninstall mode — --uninstall removes all GSD files, hooks, and settings

Platform Handling

Windows: windowsHide on child processes, EPERM/EACCES protection on protected directories, path separator normalization
WSL: Detects Windows Node.js running on WSL and warns about path mismatches
Docker/CI: Supports CLAUDE_CONFIG_DIR env var for custom config directory locations

Hook System

Architecture

Runtime Engine (Claude Code / Gemini CLI)
    │
    ├── statusLine event ──► gsd-statusline.js
    │   Reads: stdin (session JSON)
    │   Writes: stdout (formatted status), /tmp/claude-ctx-{session}.json (bridge)
    │
    ├── PostToolUse/AfterTool event ──► gsd-context-monitor.js
    │   Reads: stdin (tool event JSON), /tmp/claude-ctx-{session}.json (bridge)
    │   Writes: stdout (hookSpecificOutput with additionalContext warning)
    │
    └── SessionStart event ──► gsd-check-update.js
        Reads: VERSION file
        Writes: ~/.claude/cache/gsd-update-check.json (spawns background process)

Context Monitor Thresholds

Remaining Context	Level	Agent Behavior
> 35%	Normal	No warning injected
≤ 35%	WARNING	"Avoid starting new complex work"
≤ 25%	CRITICAL	"Context nearly exhausted, inform user"

Debounce: 5 tool uses between repeated warnings. Severity escalation (WARNING→CRITICAL) bypasses debounce.

Safety Properties

All hooks wrap in try/catch, exit silently on error
stdin timeout guard (3s) prevents hanging on pipe issues
Stale metrics (>60s old) are ignored
Missing bridge files handled gracefully (subagents, fresh sessions)
Context monitor is advisory — never issues imperative commands that override user preferences

Security Hooks (v1.27)

Prompt Guard (gsd-prompt-guard.js):

Triggers on Write/Edit to .planning/ files
Scans content for prompt injection patterns (role override, instruction bypass, system tag injection)
Advisory-only — logs detection, does not block
Patterns are inlined (subset of security.cjs) for hook independence

Workflow Guard (gsd-workflow-guard.js):

Triggers on Write/Edit to non-.planning/ files
Detects edits outside GSD workflow context (no active /gsd- command or Task subagent)
Advises using /gsd-quick or /gsd-fast for state-tracked changes
Opt-in via hooks.workflow_guard: true (default: false)

Runtime Abstraction

GSD supports multiple AI coding runtimes through a unified command/workflow architecture:

Runtime	Command Format	Agent System	Config Location
Claude Code	`/gsd-command`	Task spawning	`~/.claude/`
OpenCode	`/gsd-command`	Subagent mode	`~/.config/opencode/`
Kilo	`/gsd-command`	Subagent mode	`~/.config/kilo/`
Gemini CLI	`/gsd-command`	Task spawning	`~/.gemini/`
Codex	`$gsd-command`	Skills	`~/.codex/`
Copilot	`/gsd-command`	Agent delegation	`~/.github/`
Antigravity	Skills	Skills	`~/.gemini/antigravity/`
Trae	Skills	Skills	`~/.trae/`
Cline	Rules	Rules	`.clinerules`
Augment Code	Skills	Skills	Augment config

Abstraction Points

Tool name mapping — Each runtime has its own tool names (e.g., Claude's Bash → Copilot's execute)
Hook event names — Claude uses PostToolUse, Gemini uses AfterTool
Agent frontmatter — Each runtime has its own agent definition format
Path conventions — Each runtime stores config in different directories
Model references — inherit profile lets GSD defer to runtime's model selection

The installer handles all translation at install time. Workflows and agents are written in Claude Code's native format and transformed during deployment.

34 KiB Raw Blame History