get-shit-done

mirror of https://github.com/glittercowboy/get-shit-done synced 2026-04-25 17:25:23 +02:00

Author	SHA1	Message	Date
TÂCHES	4a34745950	feat(skills): normalize skill discovery contract across runtimes (#2261 )	2026-04-15 07:39:48 -06:00
Tom Boucher	c051e71851	test(docs): add command-count sync test; fix ARCHITECTURE.md drift (#2257 ) (#2259 ) Add tests/command-count-sync.test.cjs which programmatically counts .md files in commands/gsd/ and compares against the two count occurrences in docs/ARCHITECTURE.md ("Total commands: N" prose line and "# N slash commands" directory-tree comment). Counts are extracted from the doc at runtime — never hardcoded — so future drift is caught immediately in CI regardless of whether the doc or the filesystem moves. Fix the current drift: ARCHITECTURE.md said 69 commands; the actual committed count is 73. Both occurrences updated. Closes #2257 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-15 08:58:13 -04:00
Tom Boucher	62b5278040	fix(installer): restore detect-custom-files and backup_custom_files lost in release drift (#1997 ) (#2233 ) PR #2038 added detect-custom-files to gsd-tools.cjs and the backup_custom_files step to update.md, but commit 7bfb11b6 is not an ancestor of v1.36.0: main was rebuilt after the merge, orphaning the change. Users on 1.36.0 running /gsd-update silently lose any locally-authored files inside GSD-managed directories. Root cause: git merge-base 7bfb11b6 HEAD returns `aa3e9cf` (Cline runtime, PR #2032), 117 commits before the release tag. The "merged" GitHub state reflects the PR merge event, not reachability from the default branch. Fix: re-apply the three changes from 7bfb11b6 onto current main: - Add detect-custom-files subcommand to gsd-tools.cjs (walk managed dirs, compare against gsd-file-manifest.json keys via path.relative(), return JSON list) - Add 'detect-custom-files' to SKIP_ROOT_RESOLUTION set - Restore backup_custom_files step in update.md before run_update - Restore tests/update-custom-backup.test.cjs (7 tests, all passing) Closes #2229 Closes #1997 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-14 18:50:53 -04:00
Tom Boucher	50f61bfd9a	fix(hooks): complete stale-hooks false-positive fix — stamp .sh version headers + fix detector regex (#2224 ) * fix(hooks): stamp gsd-hook-version in .sh hooks and fix stale detection regex (#2136, #2206) Three-part fix for the persistent "⚠ stale hooks — run /gsd-update" false positive that appeared on every session after a fresh install. Root cause: the stale-hook detector (gsd-check-update.js) could only match the JS comment syntax // in its version regex — never the bash # syntax used in .sh hooks. And the bash hooks had no version header at all, so they always landed in the "unknown / stale" branch regardless. Neither partial fix (PR #2207 regex only, PR #2215 install stamping only) was sufficient alone: - Regex fix without install stamping: hooks install with literal "{{GSD_VERSION}}", the {{-guard silently skips them, bash hook staleness permanently undetectable after future updates. - Install stamping without regex fix: hooks are stamped correctly with "# gsd-hook-version: 1.36.0" but the detector's // regex can't read it; still falls to the unknown/stale branch on every session. Fix: 1. Add "# gsd-hook-version: {{GSD_VERSION}}" header to gsd-phase-boundary.sh, gsd-session-state.sh, gsd-validate-commit.sh 2. Extend install.js (both bundled and Codex paths) to substitute {{GSD_VERSION}} in .sh files at install time (same as .js hooks) 3. Extend gsd-check-update.js versionMatch regex to handle bash "#" comment syntax: /(?:\/\/\|#) gsd-hook-version:\s(.+)/ Tests: 11 new assertions across 5 describe blocks covering all three fix parts independently plus an E2E install+detect round-trip. 3885/3885 pass. Approach credit: PR #2207 (j2h4u / Maxim Brashenko) for the regex fix; PR #2215 (nitsan2dots) for the install.js substitution approach. Closes #2136, #2206, #2209, #2210, #2212 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> refactor(hooks): extract check-update worker to dedicated file, eliminating template-literal regex escaping Move stale-hook detection logic from inline `node -e '<template literal>'` subprocess to a standalone gsd-check-update-worker.js. Benefits: - Regex is plain JS with no double-escaping (root cause of the (?:\\/\\/\|#) confusion) - Worker is independently testable and can be read directly by tests - Uses execFileSync (array args) to satisfy security hook that blocks execSync - MANAGED_HOOKS now includes gsd-check-update-worker.js itself Update tests to read worker file instead of main hook for regex/configDir assertions. All 3886 tests pass. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-14 17:57:38 -04:00
Lex Christopherson	201b8f1a05	1.36.0 v1.36.0	2026-04-14 08:26:26 -06:00
Lex Christopherson	73c7281a36	docs: update changelog and README for v1.36.0 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 08:26:17 -06:00
Gabriel Rodrigues Garcia	e6e33602c3	fix(init): ignore archived phases from prior milestones sharing a phase number (#2186 ) When a new milestone reuses a phase number that exists in an archived milestone (e.g., v2.0 Phase 2 while v1.0-phases/02-old-feature exists), findPhaseInternal falls through to the archive and returns the old phase. init plan-phase and init execute-phase then emitted archived values for phase_dir, phase_slug, has_context, has_research, and *_path fields, while phase_req_ids came from the current ROADMAP — producing a silent inconsistency that pointed downstream agents at a shipped phase from a previous milestone. cmdInitPhaseOp already guarded against this (see lines 617-642); apply the same guard in cmdInitPlanPhase, cmdInitExecutePhase, and cmdInitVerifyWork: if findPhaseInternal returns an archived match and the current ROADMAP.md has the phase, discard the archived phaseInfo so the ROADMAP fallback path produces clean values. Adds three regression tests covering plan-phase, execute-phase, and verify-work under the shared-number scenario.	2026-04-13 10:59:11 -04:00
pingchesu	c11ec05554	feat: /gsd-graphify integration — knowledge graph for planning agents (#2164 ) * feat(01-01): create graphify.cjs library module with config gate, subprocess helper, presence detection, and version check - isGraphifyEnabled() gates on config.graphify.enabled in .planning/config.json - disabledResponse() returns structured disabled message with enable instructions - execGraphify() wraps spawnSync with PYTHONUNBUFFERED=1, 30s timeout, ENOENT/SIGTERM handling - checkGraphifyInstalled() detects missing binary via --help probe - checkGraphifyVersion() uses python3 importlib.metadata, validates >=0.4.0,<1.0 range * feat(01-01): register graphify.enabled in VALID_CONFIG_KEYS - Added graphify.enabled after intel.enabled in config.cjs VALID_CONFIG_KEYS Set - Enables gsd-tools config-set graphify.enabled true without key rejection * test(01-02): add comprehensive unit tests for graphify.cjs module - 23 tests covering all 5 exported functions across 5 describe blocks - Config gate tests: enabled/disabled/missing/malformed scenarios (TEST-03, FOUND-01) - Subprocess tests: success, ENOENT, timeout, env vars, timeout override (FOUND-04) - Presence tests: --help detection, install instructions (FOUND-02, TEST-04) - Version tests: compatible/incompatible/unparseable/missing (FOUND-03, TEST-04) - Fix graphify.cjs to use childProcess.spawnSync (not destructured) for testability * feat(02-01): add graphifyQuery, graphifyStatus, graphifyDiff to graphify.cjs - safeReadJson wraps JSON.parse in try/catch, returns null on failure - buildAdjacencyMap creates bidirectional adjacency map from graph nodes/edges - seedAndExpand matches on label+description (case-insensitive), BFS-expands up to maxHops - applyBudget uses chars/4 token estimation, drops AMBIGUOUS then INFERRED edges - graphifyQuery gates on config, reads graph.json, supports --budget option - graphifyStatus returns exists/last_build/counts/staleness or no-graph message - graphifyDiff compares current graph.json against .last-build-snapshot.json * feat(02-01): add case 'graphify' routing block to gsd-tools.cjs - Routes query/status/diff/build subcommands to graphify.cjs handlers - Query supports --budget flag via args.indexOf parsing - Build returns Phase 3 placeholder error message - Unknown subcommand lists all 4 available options * feat(02-01): create commands/gsd/graphify.md command definition - YAML frontmatter with name, description, argument-hint, allowed-tools - Config gate reads .planning/config.json directly (not gsd-tools config get-value) - Inline CLI calls for query/status/diff subcommands - Agent spawn placeholder for build subcommand - Anti-read warning and anti-patterns section * test(02-02): add Phase 2 test scaffolding with fixture helpers and describe blocks - Import 7 Phase 2 exports (graphifyQuery, graphifyStatus, graphifyDiff, safeReadJson, buildAdjacencyMap, seedAndExpand, applyBudget) - Add writeGraphJson and writeSnapshotJson fixture helpers - Add SAMPLE_GRAPH constant with 5 nodes, 5 edges across all confidence tiers - Scaffold 7 new describe blocks for Phase 2 functions * test(02-02): add comprehensive unit tests for all Phase 2 graphify.cjs functions - safeReadJson: valid JSON, malformed JSON, missing file (3 tests) - buildAdjacencyMap: bidirectional entries, orphan nodes, edge objects (3 tests) - seedAndExpand: label match, description match, BFS depth, empty results, maxHops (5 tests) - applyBudget: no budget passthrough, AMBIGUOUS drop, INFERRED drop, trimmed footer (4 tests) - graphifyQuery: disabled gate, no graph, valid query, confidence tiers, budget, counts (6 tests) - graphifyStatus: disabled gate, no graph, counts with graph, hyperedge count (4 tests) - graphifyDiff: disabled gate, no baseline, no graph, added/removed, changed (5 tests) - Requirements: TEST-01, QUERY-01..03, STAT-01..02, DIFF-01..02 - Full suite: 53 graphify tests pass, 3666 total tests pass (0 regressions) * feat(03-01): add graphifyBuild() pre-flight, writeSnapshot(), and build_timeout config key - Add graphifyBuild(cwd) returning spawn_agent JSON with graphs_dir, timeout, version - Add writeSnapshot(cwd) reading graph.json and writing atomic .last-build-snapshot.json - Register graphify.build_timeout in VALID_CONFIG_KEYS - Import atomicWriteFileSync from core.cjs for crash-safe snapshot writes * feat(03-01): wire build routing in gsd-tools and flesh out builder agent prompt - Replace Phase 3 placeholder with graphifyBuild() and writeSnapshot() dispatch - Route 'graphify build snapshot' to writeSnapshot(), 'graphify build' to graphifyBuild() - Expand Step 3 builder agent prompt with 5-step workflow: invoke, validate, copy, snapshot, summary - Include error handling guidance: non-zero exit preserves prior .planning/graphs/ * test(03-02): add graphifyBuild test suite with 6 tests - Disabled config returns disabled response - Missing CLI returns error with install instructions - Successful pre-flight returns spawn_agent action with correct shape - Creates .planning/graphs/ directory if missing - Reads graphify.build_timeout from config (custom 600s) - Version warning included when outside tested range * test(03-02): add writeSnapshot test suite with 6 tests - Writes snapshot from existing graph.json with correct structure - Returns error when graph.json does not exist - Returns error when graph.json is invalid JSON - Handles empty nodes and edges arrays - Handles missing nodes/edges keys gracefully - Overwrites existing snapshot on incremental rebuild * feat(04-01): add load_graph_context step to gsd-planner agent - Detects .planning/graphs/graph.json via ls check - Checks graph staleness via graphify status CLI call - Queries phase-relevant context with single --budget 2000 query - Silent no-op when graph.json absent (AGENT-01) * feat(04-01): add Step 1.3 Load Graph Context to gsd-phase-researcher agent - Detects .planning/graphs/graph.json via ls check - Checks graph staleness via graphify status CLI call - Queries 2-3 capability keywords with --budget 1500 each - Silent no-op when graph.json absent (AGENT-02) * test(04-01): add AGENT-03 graceful degradation tests - 3 AGENT-03 tests: absent-graph query, status, multi-term handling - 2 D-12 integration tests: known-graph query and status structure - All 5 tests pass with existing helpers and imports	2026-04-12 18:17:18 -04:00
Rezolv	6f79b1dd5e	feat(sdk): Phase 1 typed query foundation (gsd-sdk query) (#2118 ) * feat(sdk): add typed query foundation and gsd-sdk query (Phase 1) Add sdk/src/query registry and handlers with tests, GSDQueryError, CLI query wiring, and supporting type/tool-scoping hooks. Update CHANGELOG. Vitest 4 constructor mock fixes in milestone-runner tests. Made-with: Cursor * chore: gitignore .cursor for local-only Cursor assets Made-with: Cursor * fix(sdk): harden query layer for PR review (paths, locks, CLI, ReDoS) - resolvePathUnderProject: realpath + relative containment for frontmatter and key_links - commitToSubrepo: path checks + sanitizeCommitMessage - statePlannedPhase: readModifyWriteStateMd (lock); MUTATION_COMMANDS + events - key_links: regexForKeyLinkPattern length/ReDoS guard; phase dirs: reject .. and separators - gsd-sdk: strip --pick before parseArgs; strict parser; QueryRegistry.commands() - progress: static GSDError import; tests updated Made-with: Cursor * feat(sdk): query follow-up — tests, QUERY-HANDLERS, registry, locks, intel depth Made-with: Cursor * docs(sdk): use ASCII punctuation in QUERY-HANDLERS.md Made-with: Cursor	2026-04-12 18:15:04 -04:00
Tibsfox	66a5f939b0	feat(health): detect stale and orphan worktrees in validate-health (W017) (#2175 ) Add W017 warning to cmdValidateHealth that detects linked git worktrees that are stale (older than 1 hour, likely from crashed agents) or orphaned (path no longer exists on disk). Parses git worktree list --porcelain output, skips the main worktree, and provides actionable fix suggestions. Gracefully degrades if git worktree is unavailable. Closes #2167 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 17:56:39 -04:00
Tibsfox	67f5c6fd1d	docs(agents): standardize required_reading patterns across agent specs (#2176 ) Closes #2168 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 17:56:19 -04:00
Tibsfox	b2febdec2f	feat(workflow): scan planted seeds during new-milestone step 2.5 (#2177 ) Closes #2169 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 17:56:00 -04:00
Tom Boucher	990b87abd4	feat(discuss-phase): adapt gray area language for non-technical owners via USER-PROFILE.md (#2125 ) (#2173 ) When USER-PROFILE.md signals a non-technical product owner (learning_style: guided, jargon in frustration_triggers, or high-level explanation_depth), discuss-phase now reframes gray area labels and advisor_research rationale paragraphs in product-outcome language. Same technical decisions, translated framing so product owners can participate meaningfully without needing implementation vocabulary. Closes #2125 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 16:45:29 -04:00
Tom Boucher	6d50974943	fix: remove head -5 truncation from UAT file listing in verify-work (#2172 ) Projects with more than 5 phases had active UAT sessions silently dropped from the verify-work listing. Only the first 5 *-UAT.md files were shown, causing /gsd-verify-work to report incomplete results. Remove the \| head -5 pipe so all UAT files are listed regardless of phase count. Closes #2171 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 16:06:17 -04:00
Bhaskoro Muthohar	5a802e4fd2	feat: add flow diagram directive to phase researcher agent (#2139 ) (#2147 ) Architecture diagrams generated by gsd-phase-researcher now enforce data-flow style (conceptual components with arrows) instead of file-listing style. The directive is language-agnostic and applies to all project types. Changes: - agents/gsd-phase-researcher.md: add System Architecture Diagram subsection in Architecture Patterns output template - get-shit-done/templates/research.md: add matching directive in both architecture_patterns template sections - tests/phase-researcher-flow-diagram.test.cjs: 8 tests validating directive presence, content, and ordering in agent and template Closes #2139	2026-04-12 15:56:20 -04:00
Andreas Brauchli	72af8cd0f7	fix: display relative time in intel status output (#2132 ) * fix: display relative time instead of UTC in intel status output The `updated_at` timestamps in `gsd-tools intel status` were displayed as raw ISO/UTC strings, making them appear to show the wrong time in non-UTC timezones. Replace with fuzzy relative times ("5 minutes ago", "1 day ago") which are timezone-agnostic and more useful for freshness. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test: add regression tests for timeAgo utility Covers boundary values (seconds/minutes/hours/days/months/years), singular vs plural formatting, and future-date edge case. Addresses review feedback on #2132. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 15:54:17 -04:00
Tom Boucher	b896db6f91	fix: copy hook files to Codex install target (#2153 ) (#2166 ) Codex install registered gsd-check-update.js in config.toml but never copied the hook file to ~/.codex/hooks/. The hook-copy block in install() was gated by !isCodex, leaving a broken reference on every fresh Codex global install. Adds a dedicated hook-copy step inside the isCodex branch that mirrors the existing copy logic (template substitution, chmod). Adds a regression test that verifies the hook file physically exists after install. Closes #2153 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 15:52:57 -04:00
Tom Boucher	4bf3b02bec	fix: add phase add-batch command to prevent duplicate phase numbers on parallel invocations (#2165 ) (#2170 ) Parallel `phase add` invocations each read disk state before any write completes, causing all processes to calculate the same next phase number and produce duplicate directories and ROADMAP entries. The new `add-batch` subcommand accepts a JSON array of phase descriptions and performs all directory creation and ROADMAP appends within a single `withPlanningLock()` call, incrementing `maxPhase` within the lock for each entry. This guarantees sequential numbering regardless of call concurrency patterns. Closes #2165 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 15:52:33 -04:00
Tom Boucher	c5801e1613	fix: show contextual warning for dev installs with stale hooks (#2162 ) When a user manually installs a dev branch where VERSION > npm latest, gsd-check-update detects hooks as "stale" and the statusline showed the red "⚠ stale hooks — run /gsd-update" message. Running /gsd-update would incorrectly downgrade the dev install to the npm release. Fix: detect dev install (cache.installed > cache.latest) in the statusline and show an amber "⚠ dev install — re-run installer to sync hooks" message instead, with /gsd-update reserved for normal upgrades. Also expand the update.md workflow's installed > latest branch to explain the situation and give the correct remediation command (node bin/install.js --global --claude, not /gsd-update). Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 11:52:21 -04:00
Tom Boucher	f0a20e4dd7	feat: open artifact audit gate for milestone close and phase verify (#2157 , #2158 ) (#2160 ) * feat(2158): add audit.cjs open artifact scanner with security-hardened path handling - Scans 8 .planning/ artifact categories for unresolved state - Debug sessions, quick tasks, threads, todos, seeds, UAT gaps, verification gaps, CONTEXT open questions - requireSafePath with allowAbsolute:true on all file reads - sanitizeForDisplay on all output strings - Graceful per-category error handling, never throws - formatAuditReport returns human-readable report with emoji indicators * feat(2158): add audit-open CLI command to gsd-tools.cjs + Deferred Items to state template - Add audit-open [--json] case to switch router - Add audit-open entry to header comment block - Add Deferred Items section to state.md template for milestone carry-forward * feat(2157): add phase artifact scan step to verify-work workflow - scan_phase_artifacts step runs audit-open --json after UAT completion - Surfaces UAT gaps, VERIFICATION gaps, and CONTEXT open questions for current phase - Prompts user to confirm or decline before marking phase verified - Records acknowledged gaps in VERIFICATION.md Acknowledged Gaps section - SECURITY note: file paths validated, content truncated and sanitized before display * feat(2158): add pre-close artifact audit gate to complete-milestone workflow - pre_close_artifact_audit step runs before verify_readiness - Displays full audit report when open items exist - Three-way choice: Resolve, Acknowledge all, or Cancel - Acknowledge path writes deferred items table to STATE.md - Records deferred count in MILESTONES.md entry - Adds three new success criteria checklist items - SECURITY note on sanitizing all STATE.md writes * test(2157,2158): add milestone audit gate tests - 6 tests for audit.cjs: structured result, graceful missing dirs, open debug detection, resolved session exclusion, formatAuditReport header, all-clear message - 3 tests for complete-milestone.md: pre_close_artifact_audit step, Deferred Items, security note presence - 2 tests for verify-work.md: scan_phase_artifacts step, user prompt for gaps - 1 test for state.md template: Deferred Items section	2026-04-12 10:06:42 -04:00
Tom Boucher	7b07dde150	feat: add list/status/resume/close subcommands to /gsd-quick and /gsd-thread (#2159 ) * feat(2155): add list/status/resume subcommands and security hardening to /gsd-quick - Add SUBCMD routing (list/status/resume/run) before quick workflow delegation - LIST subcommand scans .planning/quick/ dirs, reads SUMMARY.md frontmatter status - STATUS subcommand shows plan description and current status for a slug - RESUME subcommand finds task by slug, prints context, then resumes quick workflow - Slug sanitization: only [a-z0-9-], max 60 chars, reject ".." and "/" - Directory name sanitization for display (strip non-printable + ANSI sequences) - Add security_notes section documenting all input handling guarantees * feat(2156): formalize thread status frontmatter, add list/close/status subcommands, remove heredoc injection risk - Replace heredoc (cat << 'EOF') with Write tool instruction — eliminates shell injection risk - Thread template now uses YAML frontmatter (slug, title, status, created, updated fields) - Add subcommand routing: list / list --open / list --resolved / close <slug> / status <slug> - LIST mode reads status from frontmatter, falls back to ## Status heading - CLOSE mode updates frontmatter status to resolved via frontmatter set, then commits - STATUS mode displays thread summary (title, status, goal, next steps) without spawning - RESUME mode updates status from open → in_progress via frontmatter set - Slug sanitization for close/status: only [a-z0-9-], max 60 chars, reject ".." and "/" - Add security_notes section documenting all input handling guarantees * test(2155,2156): add quick and thread session management tests - quick-session-management.test.cjs: verifies list/status/resume routing, slug sanitization, directory sanitization, frontmatter get usage, security_notes - thread-session-management.test.cjs: verifies list filters (--open/--resolved), close/status subcommands, no heredoc, frontmatter fields, Write tool usage, slug sanitization, security_notes	2026-04-12 10:05:17 -04:00
Tom Boucher	1aa89b8ae2	feat: debug skill dispatch and session manager sub-orchestrator (#2154 ) * feat(2148): add specialist_hint to ROOT CAUSE FOUND and skill dispatch to /gsd-debug - Add specialist_hint field to ROOT CAUSE FOUND return format in gsd-debugger structured_returns section - Add derivation guidance in return_diagnosis step (file extensions → hint mapping) - Add Step 4.5 specialist skill dispatch block to debug.md with security-hardened DATA_START/DATA_END prompt - Map specialist_hint values to skills: typescript-expert, swift-concurrency, python-expert-best-practices-code-review, ios-debugger-agent, engineering:debug - Session manager now handles specialist dispatch internally; debug.md documents delegation intent Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(2151): add gsd-debug-session-manager agent and refactor debug command as thin bootstrap - Create agents/gsd-debug-session-manager.md: handles full checkpoint/continuation loop in isolated context - Agent spawns gsd-debugger, handles ROOT CAUSE FOUND/TDD CHECKPOINT/DEBUG COMPLETE/CHECKPOINT REACHED/INVESTIGATION INCONCLUSIVE returns - Specialist dispatch via AskUserQuestion before fix options; user responses wrapped in DATA_START/DATA_END - Returns compact ≤2K DEBUG SESSION COMPLETE summary to keep main context lean - Refactor commands/gsd/debug.md: Steps 3-5 replaced with thin bootstrap that spawns session manager - Update available_agent_types to include gsd-debug-session-manager - Continue subcommand also delegates to session manager Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * test(2148,2151): add tests for skill dispatch and session manager - Add 8 new tests in debug-session-management.test.cjs covering specialist_hint field, skill dispatch mapping in debug.md, DATA_START/DATA_END security boundaries, session manager tools, compact summary format, anti-heredoc rule, and delegation check - Update copilot-install.test.cjs expected agent list to include gsd-debug-session-manager Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 09:40:36 -04:00
Tom Boucher	20fe395064	feat(2149,2150): add project skills awareness to 9 GSD agents (#2152 ) - gsd-debugger: add Project skills block after required_reading - gsd-integration-checker, gsd-security-auditor, gsd-nyquist-auditor, gsd-codebase-mapper, gsd-roadmapper, gsd-eval-auditor, gsd-intel-updater, gsd-doc-writer: add Project skills block at context-load step - Add context budget note to 8 quality/audit agents - gsd-doc-writer: add security note for user-supplied doc_assignment content - Add tests/agent-skills-awareness.test.cjs validation suite	2026-04-12 09:40:20 -04:00
Tom Boucher	c17209f902	feat(2145): /gsd-debug session management, TDD gate, reasoning checkpoint, security hardening (#2146 ) * feat(2145): add list/continue/status subcommands and surface next_action in /gsd-debug - Parse SUBCMD from \$ARGUMENTS before active-session check (list/status/continue/debug) - Step 1a: list subcommand prints formatted table of all active sessions - Step 1b: status subcommand prints full session summary without spawning agent - Step 1c: continue subcommand surfaces Current Focus then spawns continuation agent - Surface [debug] Session/Status/Hypothesis/Next before every agent spawn - Read TDD_MODE from config in Step 0 (used in Step 4) - Slug sanitization: strip path traversal chars, enforce ^[a-z0-9][a-z0-9-]$ pattern feat(2145): add TDD mode, delta debugging, reasoning checkpoint to gsd-debugger - Security note in <role>: DATA_START/DATA_END markers are data-only, never instructions - Delta Debugging technique added to investigation_techniques (binary search over change sets) - Structured Reasoning Checkpoint technique: mandatory five-field block before any fix - fix_and_verify step 0: mandatory reasoning_checkpoint before implementing fix - TDD mode block in <modes>: red/green cycle, tdd_checkpoint tracking, TDD CHECKPOINT return - TDD CHECKPOINT structured return format added to <structured_returns> - next_action concreteness guidance added to <debug_file_protocol> * feat(2145): update DEBUG.md template and docs for debug enhancements - DEBUG.md template: add reasoning_checkpoint and tdd_checkpoint fields to Current Focus - DEBUG.md section_rules: document next_action concreteness requirement and new fields - docs/COMMANDS.md: document list/status/continue subcommands and TDD mode flag - tests/debug-session-management.test.cjs: 12 content-validation tests (all pass)	2026-04-12 09:00:23 -04:00
Tom Boucher	002bcf2a8a	fix(2137): skip worktree isolation when .gitmodules detected (#2144 ) * feat(sdk): add typed query foundation and gsd-sdk query (Phase 1) Add sdk/src/query registry and handlers with tests, GSDQueryError, CLI query wiring, and supporting type/tool-scoping hooks. Update CHANGELOG. Vitest 4 constructor mock fixes in milestone-runner tests. Made-with: Cursor * fix(2137): skip worktree isolation when .gitmodules detected When a project contains git submodules, worktree isolation cannot correctly handle submodule commits — three separate gaps exist in worktree setup, executor commit protocol, and merge-back. Rather than patch each gap individually, detect .gitmodules at phase start and fall back to sequential execution, which handles submodules transparently (Option B). Affected workflows: execute-phase.md, quick.md --------- Co-authored-by: David Sienkowski <dave@sienkowski.com>	2026-04-12 08:33:04 -04:00
Tom Boucher	58632e0718	fix(2095): use cp instead of git-show for worktree STATE.md backup (#2143 ) Replace `git show HEAD:.planning/STATE.md` with `cp .planning/STATE.md` in the worktree merge-back protection logic of execute-phase.md and quick.md. The git show approach exits 128 when STATE.md has uncommitted changes or is not yet in HEAD's committed tree, leaving an empty backup and causing the post-merge restore guard to silently skip — zeroing or staling the file. Using cp reads the actual working-tree file (including orchestrator updates that haven't been committed yet), which is exactly what "main always wins" should protect.	2026-04-12 08:26:57 -04:00
Tom Boucher	a91f04bc82	fix(2136): add missing bash hooks to MANAGED_HOOKS staleness check (#2141 ) * test(2136): add failing test for MANAGED_HOOKS missing bash hooks Asserts that every gsd-.js and gsd-.sh file shipped in hooks/ appears in the MANAGED_HOOKS array inside gsd-check-update.js. The three bash hooks (gsd-phase-boundary.sh, gsd-session-state.sh, gsd-validate-commit.sh) were absent, causing this test to fail before the fix. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(2136): add gsd-phase-boundary.sh, gsd-session-state.sh, gsd-validate-commit.sh to MANAGED_HOOKS The MANAGED_HOOKS array in gsd-check-update.js only listed the 6 JS hooks. The 3 bash hooks were never checked for staleness after a GSD update, meaning users could run stale shell hooks indefinitely without any warning. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 08:10:56 -04:00
Tom Boucher	86dd9e1b09	fix(2134): fix code-review SUMMARY.md parser section-reset for top-level keys (#2142 ) * test(2134): add failing test for code-review SUMMARY.md YAML parser section reset Demonstrates bug #2134: the section-reset regex in the inline node parser in get-shit-done/workflows/code-review.md uses \s+ (requires leading whitespace), so top-level YAML keys at column 0 (decisions:, metrics:, tags:) never reset inSection, causing their list items to be mis-classified as key_files.modified entries. RED test asserts that the buggy parser contaminates the file list with decision strings. GREEN test and additional tests verify correct behaviour with the fix. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(2134): fix YAML parser section reset to handle top-level keys (\s* not \s+) The inline node parser in compute_file_scope (Tier 2) used \s+ in the section-reset regex, requiring leading whitespace. Top-level YAML keys at column 0 (decisions:, metrics:, tags:) never matched, so inSection was never cleared and their list items were mis-classified as key_files.modified entries. Fix: change \s+ to \s* in both the reset check and its dash-guard companion so any key at any indentation level (including column 0) resets inSection. Before: /^\s+\w+:/.test(line) && !/^\s+-/.test(line) After: /^\s\w+:/.test(line) && !/^\s-/.test(line) Closes #2134 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-12 08:10:30 -04:00
Tibsfox	ae8c0e6b26	docs(sdk): recommend 1-hour cache TTL for system prompts (#2055 ) * docs(sdk): recommend 1-hour cache TTL for system prompts (#1980) Add sdk/docs/caching.md with prompt caching best practices for API users building on GSD patterns. Recommends 1-hour TTL for executor, planner, and verifier system prompts which are large and stable across requests within a session. The default 5-minute TTL expires during human review pauses between phases. 1-hour TTL costs 2x on cache miss but pays for itself after 3 hits — GSD phases typically involve dozens of requests per hour. Closes #1980 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs(sdk): fix ttl type to string per Anthropic API spec The Anthropic extended caching API requires ttl as a string ('1h'), not an integer (3600). Corrects both code examples in caching.md. Review feedback on #2055 from @trek-e. * docs(sdk): fix second ttl value in direct-api example to string '1h' Follow-up to trek-e's re-review on #2055. The first fix corrected the Agent SDK integration example (line 16) but missed the second code block (line 60) that shows the direct Claude API call. Both now use ttl: '1h' (string) as the Anthropic extended caching API requires — integer forms like ttl: 3600 are silently ignored by the API and the cache never activates. Closes #1980 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-12 08:09:44 -04:00
Tom Boucher	eb03ba3dd8	fix(2129): exclude 999.x backlog phases from next-phase and all_complete (#2135 ) * test(2129): add failing tests for 999.x backlog phase exclusion Bug A: phase complete reports 999.1 as next phase instead of 3 Bug B: init manager returns all_complete:false when only 999.x is incomplete Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(2129): exclude 999.x backlog phases from next-phase scan and all_complete check In cmdPhaseComplete, backlog phases (999.x) on disk were picked as the next phase when intervening milestone phases had no directory yet. Now the filesystem scan skips any directory whose phase number starts with 999. In cmdInitManager, all_complete compared completed count against the full phase list including 999.x stubs, making it impossible to reach true when backlog items existed. Now the check uses only non-backlog phases. Closes #2129 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 23:50:25 -04:00
Tom Boucher	637daa831b	fix(2130): anchor extractFrontmatter regex to file start (#2133 ) * test(2130): add failing tests for frontmatter body --- sequence mis-parse Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(2130): anchor extractFrontmatter regex to file start, preventing body --- mis-parse Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 23:47:50 -04:00
Tom Boucher	553d9db56e	ci: upgrade GitHub Actions to Node 22+ runtimes (#2128 ) - actions/checkout v4.2.2 → v6.0.2 (pr-gate, auto-branch) - actions/github-script v7.0.1/v8 → v9.0.0 (all workflows) - actions/stale v9.0.0 → v10.2.0 Eliminates Node.js 20 deprecation warnings. Node 20 actions will be forced to Node 24 on June 2, 2026 and removed Sept 16, 2026. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 16:28:18 -04:00
Tom Boucher	8009b67e3e	feat: expose tdd_mode in init JSON and add --tdd flag override (#2124 ) * test(2123): add failing tests for TDD init JSON exposure and --tdd flag Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(2123): expose tdd_mode in init JSON and add --tdd flag override Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 15:39:50 -04:00
Tom Boucher	6b7b6a0ae8	ci: fix release pipeline — update actions, add GH releases, extend CI triggers (#1956 ) - Update actions/checkout and actions/setup-node to v6 in release.yml and hotfix.yml (Node.js 24 compat, prevents June 2026 breakage) - Add GitHub Release creation to release finalize, release RC, and hotfix finalize steps (populates Releases page automatically) - Extend test.yml push triggers to release/ and hotfix/ branches - Extend security-scan.yml PR triggers to release/ and hotfix/ branches Closes #1955 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 15:10:12 -04:00
Tom Boucher	177cb544cb	chore(ci): add branch-cleanup workflow — auto-delete on merge + weekly sweep (#2051 ) Adds .github/workflows/branch-cleanup.yml with two jobs: - delete-merged-branch: fires on pull_request closed+merged, immediately deletes the head branch. Belt-and-suspenders alongside the repo's delete_branch_on_merge setting (see issue for the one-line owner action). - sweep-orphaned-branches: runs weekly (Sunday 4am UTC) and on workflow_dispatch. Paginates all branches, deletes any whose only closed PRs are merged — cleans up branches that pre-date the setting change. Both jobs use the pinned actions/github-script hash already used across the repo. Protected branches (main, develop, release) are never touched. 422 responses (branch already gone) are treated as success. Closes #2050 Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-11 15:10:09 -04:00
Tom Boucher	3d096cb83c	Merge pull request #2078 from gsd-build/release/1.35.0 chore: merge release v1.35.0 to main	2026-04-11 15:10:02 -04:00
Tom Boucher	805696bd03	feat(state): add metrics table pruning and auto-prune on phase complete (#2087 ) (#2120 ) - Extend cmdStatePrune to prune Performance Metrics table rows older than cutoff - Add workflow.auto_prune_state config key (default: false) - Call cmdStatePrune automatically in cmdPhaseComplete when enabled - Document workflow.auto_prune_state in planning-config.md reference - Add silent option to cmdStatePrune for programmatic use without stdout Closes #2087 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 15:02:55 -04:00
Tom Boucher	e24cb18b72	feat(workflow): add opt-in TDD pipeline mode (#2119 ) * feat(workflow): add opt-in TDD pipeline mode (workflow.tdd_mode) Add workflow.tdd_mode config key (default: false) that enables red-green-refactor as a first-class phase execution mode. When enabled, the planner aggressively applies type: tdd to eligible tasks and the executor enforces RED/GREEN/REFACTOR gate sequence with fail-fast on unexpected GREEN before RED. An end-of-phase collaborative review checkpoint verifies gate compliance. Closes #1871 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(test): allowlist plan-phase.md in prompt injection scan plan-phase.md exceeds 50K chars after TDD mode integration. This is legitimate orchestration complexity, not prompt stuffing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * ci: trigger CI run * ci: trigger CI run --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 14:42:01 -04:00
Tom Boucher	d19b61a158	Merge pull request #2121 from gsd-build/feat/1861-pattern-mapper feat: add gsd-pattern-mapper agent for codebase pattern analysis	2026-04-11 14:37:03 -04:00
Tom Boucher	29f8bfeead	fix(test): allowlist plan-phase.md in prompt injection scan plan-phase.md exceeds 50K chars after pattern mapper step addition. This is legitimate orchestration complexity, not prompt stuffing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 14:34:13 -04:00
Tom Boucher	d59d635560	feat: add gsd-pattern-mapper agent for codebase pattern analysis (#1861 ) Add a new pattern mapper agent that analyzes the codebase for existing patterns before planning, producing PATTERNS.md with per-file analog assignments and code excerpts. Integrated into plan-phase workflow as Step 7.8 (between research and planning), controlled by the workflow.pattern_mapper config key (default: true). Changes: - New agent: agents/gsd-pattern-mapper.md - New config key: workflow.pattern_mapper in VALID_CONFIG_KEYS and CONFIG_DEFAULTS - init plan-phase: patterns_path field in JSON output - plan-phase.md: Step 7.8 spawns pattern mapper, PATTERNS_PATH in planner files_to_read - gsd-plan-checker.md: Dimension 12 (Pattern Compliance) - model-profiles.cjs: gsd-pattern-mapper profile entry - Tests: tests/pattern-mapper.test.cjs (5 tests) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 14:25:02 -04:00
Tom Boucher	ce1bb1f9ca	Merge pull request #2062 from Tibsfox/fix/global-skills-1992 feat(config): support global skills from ~/.claude/skills/ in agent_skills	2026-04-11 13:57:08 -04:00
Tom Boucher	121839e039	Merge pull request #2059 from Tibsfox/fix/context-exhaustion-record-1974 feat(hooks): auto-record session state on context exhaustion	2026-04-11 13:56:43 -04:00
Tom Boucher	6b643b37f4	Merge pull request #2061 from Tibsfox/fix/inline-small-plans-1979 perf(workflow): default to inline execution for 1-2 task plans	2026-04-11 13:56:35 -04:00
Tom Boucher	50be9321e3	Merge pull request #2058 from Tibsfox/fix/limit-prior-context-1969 perf(workflow): limit prior-phase context to 3 most recent phases	2026-04-11 13:56:27 -04:00
Tom Boucher	190804fc73	Merge pull request #2063 from Tibsfox/feat/state-prune-1970 feat(state): add state prune command for unbounded section growth	2026-04-11 13:56:19 -04:00
Tom Boucher	0c266958e4	Merge pull request #2054 from Tibsfox/fix/cache-state-frontmatter-1967 perf(state): cache buildStateFrontmatter disk scan per process	2026-04-11 13:55:43 -04:00
Tom Boucher	d8e7a1166b	Merge pull request #2053 from Tibsfox/fix/merge-readdir-health-1973 perf(health): merge four readdirSync passes into one in cmdValidateHealth	2026-04-11 13:55:26 -04:00
Tom Boucher	3e14904afe	Merge pull request #2056 from Tibsfox/fix/atomic-writes-1972 fix(core): extend atomicWriteFileSync to milestone, phase, and frontmatter	2026-04-11 13:54:55 -04:00
Tom Boucher	6d590dfe19	Merge pull request #2116 from gsd-build/fix/qwen-claude-reference-leaks fix(install): eliminate Claude reference leaks in Qwen install paths	2026-04-11 11:21:40 -04:00

1 2 3 4 5 ...

1854 Commits