# Changelog All notable changes to `@vyuhlabs/dxkit` are documented in this file. The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/), and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html). ## [Unreleased] ## [2.23.0] - 2026-07-04 ### Added — the correctness floor (a loop-safety liveness gate) The guardrail proves "no net-new findings" (secrets, CVEs, SAST, coverage). It does not prove the code still **compiles and its affected tests still pass** — so an autonomous agent loop can satisfy the finding gate while shipping code that does not build, and even a broken test that lifts coverage gets rewarded. The correctness floor closes that gap: a liveness check that asks "does this still build, and do the tests it affects still pass?" before an agent may declare "done". A failing floor is a pass/fail signal, not a fingerprinted, grandfathered finding (there is no "grandfather a syntax error"), so it sits outside the baseline and allowlist. - **Pack-declared, runner-executed** (CLAUDE.md Rule 15). Each language pack declares two pure command builders — `syntaxCheck` (the cheap "does it compile/parse" check) and `affectedTests` (run the tests the change reaches). One runner owns the load-bearing policy in a single place: fail-CLOSED on a real failure (a non-zero exit blocks), fail-OPEN on infrastructure (a missing toolchain or a timeout is a skip, never a block — a slow or un-installed toolchain is not broken code; CI is the backstop). A pack never shells out itself. - **All 8 packs, verified against real toolchains**, at the affected granularity each ecosystem natively supports: | Pack | Compile | Affected-test granularity | | --- | --- | --- | | TypeScript / JavaScript | `tsc --noEmit` (or the project's typecheck script) | per-file (`vitest related` / `jest --findRelatedTests`) | | Python | `py_compile` | per changed test file (pytest) | | Go | `go build ./...` | changed package(s) | | Rust | `cargo check` | changed crate(s) (`-p`) | | C# / .NET | `dotnet build` | changed test project (`dotnet test `) | | Java | Maven `test-compile` / Gradle `testClasses` | changed build module (Maven `-pl -am`, Gradle `:mod:test`) | | Kotlin | Maven `test-compile` / Gradle `testClasses` | changed build module | | Ruby | `ruby -c` per changed file | changed spec/test file (RSpec / minitest) | A change whose dependents live in another package/module/crate is caught at full/CI scope, not the fast affected surface. The two JVM packs share one `jvm-build.ts` provider (Rule 2). - **Three surfaces, one adaptive resolver.** The loop Stop-gate runs the floor by default (an agent must not stop on broken code), scoped to what the loop introduced via a testmon-style entry snapshot, so a pre-existing failure never blocks. The pre-push and CI surfaces are adaptive: when the repo already runs its tests in its own CI, the floor defaults to opt-in there; when no test-CI is detected it runs by default; when a CI exists but its test step is opaque it fails toward on. Precedence: an explicit flag, then a `DXKIT_FLOOR_` env, then `.dxkit/policy.json` `correctness.surfaces.`, then the adaptive default. - **`vyuh-dxkit floor check [--surface pre-push|ci] [--base ] [--correctness | --no-correctness]`** — the entry point the pre-push hook and CI workflow call. It is baseline-independent, so a brand-new repo with no baseline still gets push-time liveness. The installed pre-push hook runs it before the baseline check; the CI guardrail workflow runs it (full scope) after the finding gate. ### Changed - `LanguageSupport.correctness` is now a required field: the capability shipped optional and tightened once all eight packs declared it, so a new pack that omits it fails to compile (not just at test time). The `new-lang` scaffold wires a dormant provider so a fresh pack still compiles, with TODOs for the real commands. ## [2.22.0] - 2026-07-02 ### Added — the flow feature becomes agent-operable (setup, diagnose, publish, repair) The UI→API integration gate now has the surfaces an agent (or a person) needs to configure, diagnose, and repair it — folded into the commands you already run so the CLI stays small. - **Setup folds into `init`.** There is no standalone `flow init`. When `init` detects a UI→API surface (client calls and/or server routes) it offers the integration gate and asks for the posture — `warn` (default), `block`, or `off` — with a one-line description of each; a repo with no such surface stays silent. `--flow` forces it on with `warn`; `--no-flow` skips it. The dominant base-URL helper to strip and any multiple backend services are surfaced as confirm prompts. - **`.dxkit/workspace.json`** — a new top-level participants primitive naming the repos/services of a multi-repo system (path, optional git ref, base URLs). - **Diagnose folds into `doctor`.** There is no standalone `flow doctor`. When the repo has a UI→API surface, `doctor` reports a flow-contract diagnosis — the unresolved client calls (each with a reason and a suggested next step), the served routes nobody consumes, and how the served side is resolved — and `doctor --json` carries the whole `flow` object for an agent to read. - **`flow publish`** — the multi-repo handshake. Reads `workspace.json` and unions every participant's served routes into this repo's `served.json`, so a consumer repo gates its calls against a provider it does not co-locate. Participants are gathered from a local path, optionally pinned at a git ref; fail-open per participant. A content-hash on the snapshot lets a consumer detect drift. - **`dxkit-flow` skill** — the operator surface: setup, diagnose, fix (repair a net-new broken integration a guardrail flagged — never suppress it), and the cross-repo handshake. Thin orchestration over the CLI; `--flow` installs it. ## [2.21.2] - 2026-07-02 ### Fixed — the flow gate now runs in committed-baseline modes too The flow integration gate previously ran only in ref-based mode; in committed-full / committed-sanitized it skipped, so a repo pinned to `committed-full` (the default for private repos) got no flow gating unless it switched to `baseline.mode: ref-based`. The gate needs only a base *commit* to diff HEAD against, not a committed prior flow side — so it now uses the committed baseline's recorded anchor commit (`repo.commitSha`) as the base and gathers that side's flow model fresh from a worktree, exactly as ref-based mode gathers from its ref. Net-new broken integrations now block (or warn) the same way in every baseline mode. Fail-open, trigger-skip, no-served-truth self-skip, and per-finding allowlist suppression are all unchanged. When no base commit is resolvable at all (no ref and no baseline anchor), the gate skips as before. ## [2.21.1] - 2026-07-01 ### Fixed — the flow gate now honors the per-finding allowlist A net-new broken integration could not be accepted per-finding: the `flow-binding` allowlist category and identity existed, but the guardrail's flow pass ran outside the matcher-pair suppression path, so an allowlist entry never actually waived a flow block. The only escape hatch was the global `flow.mode`. Now an active `flow-binding` allowlist entry (matched by fingerprint, kind-guarded, expiry-respected) waives the block exactly like any other finding kind — the finding is still surfaced as "suppressed by allowlist" in the console, `--json`, and PR-comment markdown, but no longer fails the build. Expired entries do not waive; the finding re-blocks. ### Known limitation (tracked) The flow gate runs in **ref-based mode only**. In committed-baseline modes it skips (no committed prior flow side to diff against), so a repo pinned to `committed-full` is not yet flow-gated unless it sets `baseline.mode: ref-based`. A fix that recomputes the base flow model from the baseline's commit is planned. ## [2.21.0] - 2026-07-01 ### Added — the integration gate (`flow refresh` + guardrail flow pass) The Flow feature's third slice turns UI→API traceability into a guardrail: a PR that net-new breaks an integration — a frontend call to an endpoint no backend serves, or a backend route removal a consumer still binds to — fails the check, the same way a net-new secret or CVE does. - **Net-new broken-integration gate.** The guardrail check now runs an additive, fail-open flow pass over its ref-based `base↔HEAD` comparison. One algorithm covers both directions (dead frontend call / removed backend route), because both reduce to "a consumed binding whose `(method, path)` is not served." Pre-existing breakage is grandfathered — only what the diff *newly* breaks is surfaced. It never touches the existing finding matcher. - **Confidence-gated, false-positive-safe.** An exact, fully specified binding blocks; a placeholder-only path (`/{var}`) warns. The gate self-skips when it has no served-side truth to check against (a pure frontend with no committed contract), so it can never false-block a repo it can't fully see, and it fails open on any error. - **`vyuh-dxkit flow refresh`** writes the cross-repo contract snapshots (`.dxkit/flow/served.json` + `consumed.json`). A backend publishes what it serves; a frontend commits the counterpart's snapshot and gates against it — so a split-repo setup needs a cross-repo fetch only at refresh time, never on a developer's machine or in the per-check gate. A monorepo gates live against its own routes and needs no snapshot. - **Posture.** `.dxkit/policy.json:flow.mode` (`block` / `warn` / `off`, default `block`) governs the verdict; the loop preset overrides it (`security-only` warns, `full-debt` blocks) so an unattended loop can't wedge on a cross-repo false positive. Broken integrations render in the console, `--json`, and PR-comment markdown, and count toward the verdict banner. - Runs only in ref-based mode (committed mode has no base flow model to diff); a diff that touches no client call, route, or spec is skipped up front. Binding identity is line-independent and environment-independent, so a committed contract keeps matching when the check moves from a laptop to CI. ## [2.20.0] - 2026-07-01 ### Added — native flow map + blast radius (`flow`, `flow trace`) The Flow feature's second slice makes UI→API traceability a first-class part of the code graph, so a change's cross-boundary blast radius is a query, not a guess. - **Graph schema v2 — the endpoint overlay.** `graph.json` now carries `http-endpoint` nodes (one per served `(method, path)`) and `calls-endpoint` edges (a UI call site → the endpoint it hits) — the cross-boundary join the structural graph could not previously express. The overlay is purely additive: a v1 artifact migrates forward to an empty overlay, and every pre-flow query is untouched. The consuming call site's coordinates ride on the edge, so the map works even where graphify never ran (a pure-frontend repo) — the flow layer stays graphify-independent. - **`vyuh-dxkit flow`** writes the overlay and prints every endpoint with its consuming UI surfaces, plus the served-but-unconsumed set (a dead-route or cross-repo-consumer candidate — surfaced, not flagged as a defect). - **`vyuh-dxkit flow trace " "`** shows one endpoint's handler, every UI call site, and the change blast radius — direct consumers extended transitively through the structural call graph. - Both take `--json`. New pure queries (`endpointCallers`, `flowTrace`, `flowBlastRadius`, `flowMapQuery`) live in the canonical query module; the overlay is regenerated each run (never accumulated). ### Added — advisory file-size budget in the pre-commit slop check A warn-only, diff-scoped 500-LoC budget nudges when a changed source file sprawls, exempting the modules that are large by architectural mandate (language packs, the canonical query/registry modules, the CLI dispatch). It never blocks. ## [2.19.0] - 2026-06-30 ### Added — application-flow extraction (`flow extract`) `vyuh-dxkit flow extract` statically maps a frontend's HTTP calls to a backend's routes and writes the result as CSVs. It is the first slice of the Flow feature (UI to API traceability). - **AST-based, not regex.** A new in-process, graphify-independent tree-sitter layer (`src/ast/`, web-tree-sitter/wasm) parses source; a per-language `httpFlow` descriptor declares which constructs are HTTP clients and routes, so no framework literal is hardcoded in the analyzer. - **Both sides plus the join.** It extracts outbound client calls (fetch/axios/wrappers) and inbound routes (LoopBack/NestJS decorators, Express app/router), canonicalizes URLs and route paths to a common shape, and binds each call to the route it targets with a confidence score. - **OpenAPI when present.** An existing OpenAPI spec is consumed as the authoritative served side (`--specs`), unioned with static extraction (spec for authority, static for recall, since generated specs are often incomplete). - **Usage:** `flow extract [--frontend ] [--backend ] [--specs a.json,b.json] [--out ]`; writes `api_calls.csv`, `routes.csv`, `api_route_mapping.csv`. TypeScript and JavaScript in this release; more languages follow. ## [2.18.1] - 2026-06-23 ### Fixed — next-step hints now use a resolvable invocation Actionable "run this next" hints printed by the CLI (after `init`, in `doctor`, `tools`, the loop preflight, and elsewhere) referenced the bare `vyuh-dxkit` binary. After the common devDependency install (`npm init @vyuhlabs/dxkit`), that binary is not on the global PATH, so copy-pasting the hint failed with "command not found". Every actionable hint now routes through the canonical `npx vyuh-dxkit` invocation, which resolves a project-local devDependency or a global install, so the suggested command runs as-is. The demo's conversion CTA is fixed the same way: it shows `npm init @vyuhlabs/dxkit` to bootstrap a repo with no dxkit yet, and the `npx` form for the follow-up commands. ## [2.18.0] - 2026-06-23 ### Added — `demo loop-guardrail` converts into setup After the offline walkthrough, `vyuh-dxkit demo loop-guardrail` now shows a next step tailored to where it was run, and offers to wire the Stop-gate in for you: - A **context-aware call to action.** A git repo with no dxkit yet gets the wire-up sequence (`init --claude-loop` then `baseline create` then `loop doctor`); an already-set-up repo gets just `loop doctor`; a non-repo points you at your project. No more one-size-fits-all hint. - An **interactive opt-in**, offered only when it is safe to ask: in a git repo with no dxkit yet, on a TTY. It defaults to **no**, never prompts in a piped or CI run (so `npx ... | cat` and CI steps cannot block), and never touches a repo that already has dxkit. On yes it runs the additive, reversible `init --claude-loop` and stops there. You run `baseline create` yourself when you are ready to grandfather today's debt. A `demo` still never silently changes your repo: the opt-in is the only thing that writes, and only with explicit consent. ## [2.17.0] - 2026-06-23 ### Changed — the guardrail gate skips dependency *remediation* enrichment `guardrail check` no longer runs the Tier-2 dependency **remediation** step — the structured `upgradePlan` produced by `osv-scanner fix`. That step exists only to suggest "upgrade X to Y" in the **reports** (`vulnerabilities`, `bom`, `health`); the gate never reads it, and finding identity explicitly excludes it (`fingerprint.ts`), so it cannot affect a verdict or a baseline match. Why it matters: `osv-scanner fix` resolves the dependency tree by running the **package manager** (`npm install`) on the scanned code. On a vulnerability-laden manifest that dominated latency — a 2-file PR on a small sample app took **191s**, and OWASP NodeGoat **~250s**, both timing out the hosted PR-gate. Profiling isolated it precisely: `npm audit` and `osv-scanner scan` are ~1–5s; the `osv-scanner fix` remediation (with `npm install`) was the ~120s/side cost. The same NodeGoat PR now gates in **~14s**, and a clean dep-change PR in **~6s**, with an **identical verdict** (still blocks net-new critical/high dependency vulns). This also closes a real **security** concern for hosted/agent scenarios: running `npm install` on untrusted PR code can execute arbitrary install scripts. Skipping the remediation step removes that from every guardrail path — including the **loop Stop-gate**, which previously ran `npm install` on stops that touched JS/TS dependencies. - **Scope:** affects only the guardrail/gate gather path (the new `skipRemediation` flow, set by `runGuardrailCheck`). `health`, `vulnerabilities`, `bom`, and `baseline create` keep the full remediation enrichment. The remediation step is TS/JS-only (`osv-scanner fix`); other packs derive `upgradePlan` from their own audit output without installing. - npm-audit's free-text "upgrade X to Y" advice (no install) still rides along in the gate's repair hint. ### Added — `guardrail check --untrusted` for hosted gates on attacker-controlled source A hosted PR gate scans code it does not control. Dependency audits must never **execute** that code. `--untrusted` enforces that: the Python pack stops using `pip-audit .` (project mode), whose PEP 517 build backend can run arbitrary project code, and instead audits a `requirements.txt` (which never builds) or reports the dependency audit unavailable with a "run dxkit locally" message — rather than building untrusted source. npm-audit and osv-scanner `scan` (TS/Java/Kotlin/Go/Rust/Ruby) are already read-only, so they're unaffected. - **Opt-in and off by default.** Without `--untrusted`, `buildPipAuditCommand` is byte-identical to before, so reports, `baseline create`, CI, and the loop Stop-gate on your own (trusted) repo keep full project-mode coverage. - Trade-off: in `--untrusted` mode a Python project with only a `pyproject.toml`/`setup.py` (no `requirements.txt`) reports its dep audit unavailable instead of building — a deliberate safety-over-coverage choice for untrusted input, surfaced in the message. ## [2.16.0] - 2026-06-23 ### Changed — `--incremental` skips the dependency audit when no manifest changed `vyuh-dxkit guardrail check --incremental` now, in ref-based mode, **skips the OSV dependency-vulnerability audit entirely when the change touched no dependency manifest or lockfile.** This is the dominant latency win for the incremental path: profiling a 4-file documentation PR on this repo showed the dep audit accounting for ~100s of a ~119s scan (gitleaks was 0.4s; the rest of a secrets+deps scoped gather was sub-second) — and that audit ran twice (base and head) over an unchanged dependency set, so it could not surface anything net-new. The same end-to-end check now completes in **~7s**. The skip is **sound and verdict-preserving in ref-based mode only**: a net-new dependency vulnerability requires a manifest/lockfile change, and ref-based audits both sides against the *same* advisory snapshot, so an unchanged dependency is identical on both sides and never net-new. It deliberately does **not** apply to committed modes, where the baseline is an older snapshot and a newly-disclosed CVE on an unchanged dependency genuinely *is* net-new and must still surface. When the change *does* touch a manifest, the audit runs as before and net-new critical/high dependency vulnerabilities block normally. - **Manifest patterns are now a pack-declared fact** (CLAUDE.md Rule 6): each language pack's `depVulns` capability declares a **required** `manifestPatterns` field (its manifests + lockfiles). The skip consults the active packs' union via `allDependencyManifestPatterns` / `changedFilesTouchDependencyManifest`, so adding a language auto-extends the skip's awareness. A pack that adds dependency auditing but omits the patterns fails to compile **and** fails `test/languages-contract.test.ts`; `test/recipe-playbook.test.ts` proves the union stays pack-driven via a synthetic pack. Without `--incremental`, behavior is byte-identical to 2.15. `health`, `vulnerabilities`, and `committed-full`/`committed-sanitized` guardrail checks are unaffected. ## [2.15.0] - 2026-06-22 ### Fixed — ref-based guardrail no longer false-blocks on `secret-hmac` In ref-based mode (the default for public repos), dxkit mints a locator-less `secret-hmac` companion alongside each located `secret` for cross-file relocation matching. On a fresh or shallow checkout the two sides of the diff can derive different salts, so the companions never match and read as net-new — a **false block**, even though the located `secret` twins match correctly. `secret-hmac` now joins `duplication` and `test-gap` in the set of kinds excluded from the ref-based diff (they can't be gathered comparably across a detached worktree). The located `secret` kind still gates net-new credentials; only the redundant companion is dropped. **Committed modes are unaffected.** ### Added — opt-in `--incremental` for `guardrail check` `vyuh-dxkit guardrail check --incremental` scopes the gather to the analyzers the active policy can actually block on (reusing the loop Stop-gate's `scopeForPolicy`) and, in ref-based mode, scopes semgrep to the changed files on both sides. Same verdict, far less work — the check scales with PR size rather than repo size. **Opt-in and verdict-preserving:** without the flag, behavior is byte-identical to 2.14; it falls back to a full scan whenever the changed set can't be computed completely. The CLI flag exposes what the loop Stop-gate already did internally, so a ref-based CI guardrail (or a hosted PR-gate) can run the fast path too. ### Changed — positioning: two pillars (context + gate) - README hero, package description, and `--help` tagline now lead with both pillars: **"a deterministic stop condition and code-graph context layer for AI coding agents."** The README opening + "What dxkit does" foreground the code graph (callers, callees, blast radius) the agent uses *while making a change*, then the deterministic stop-gate that blocks net-new regressions *before it exits* — so the graph is no longer undersold as a footnote. ## [2.14.0] - 2026-06-22 ### Changed — the loop Stop-gate gathers far less work per stop After 2.13.3 made the gate cache-aware, two further optimizations cut what an unattended loop's `security-only` Stop-gate actually scans on every stop. Both are **opt-in** (only the loop Stop-gate enables them) and **verdict- preserving** — CI `baseline check`, `createBaseline`, and the `health` report are byte-identical and still render every warning. End-to-end on a 3,748-file repo with a realistic 2-file loop diff (separate processes), the security-only gather dropped from **42.8s → 11.3s (74%)** while still blocking the same net-new finding with identical blocking pairs. - **Preset-scoped gather.** A guardrail can only block on the finding kinds its policy escalates, so a `security-only` posture no longer runs the analyzers that feed only non-blockable kinds (jscpd, lint, coverage, cloc, test-gaps, graphify, licenses). The scope is derived declaratively from the policy (`scopeForPolicy`): a `full-debt` posture, CI, and `health` all resolve to the full scan. A scoped result is partial by construction and never enters the shared `AnalysisResult` cache. (~42.8s → 24.0s.) - **Incremental file-scoped scanning.** The current side's semgrep now scans only the files that changed vs the baseline commit. Sound because semgrep is intraprocedural — a net-new code finding can only appear in a file the diff touched. The changed-file set is computed completely or falls back to a full scan on any uncertainty (base unreachable, not a git repo), so a scan can only ever over-cover, never under-cover. The ref/baseline side stays full. (~24.0s → 11.3s.) Both behaviors are confined to the loop Stop-gate; nothing about the CI guardrail or the standalone reports changes. ## [2.13.3] - 2026-06-22 ### Fixed — the loop Stop-gate no longer pays a full re-scan on every stop The Stop hook fires on **every** Claude Code stop, not only autonomous-loop turns, and it re-ran the full guardrail gather each time — including re-scanning an unchanged `origin/main` in ref-based mode. On a large repo that took long enough to surface as a Claude Code "Stop hook error" (a timeout), and it made interactive sessions in a loop-initialized repo slow. Two content-addressed caches and a timeout fix this, with no change to the gate's verdict: - **Tree-signature verdict cache.** When the working tree is byte-identical to the last gather (an interactive Q&A turn, or a re-stop after a block with no edit), the gate replays the previous verdict instead of re-gathering. The signature captures HEAD, the comparison base, every tracked change vs HEAD, and the contents of every untracked file, so a cache hit is only ever a genuinely identical tree — the cache can never skip a real net-new finding. Bypass with `DXKIT_LOOP_NO_CACHE=1`. - **Ref-side scan cache.** The `origin/main` (ref-based) gather is cached, keyed on `(ref commit, dxkit version, identity scheme, salt)`, so an unchanged ref is not re-scanned on every stop. Lives under the already- gitignored `.dxkit/cache/`. Bypass with `DXKIT_NO_REF_CACHE=1`. - **Generous Stop-hook timeout.** `init --claude-loop` now installs the Stop hook with a 600s timeout, so a cold first gather on a large repo finishes instead of being killed and reported as an error. ### Changed — the Stop-gate is now loop-scoped The Stop hook fires on every Claude Code stop, including interactive turns (when the agent stops to ask a question), so the gate ran on work where a human is already reviewing. The gate is for **unattended** loops, so it now no-ops instantly on a stop unless the run is unattended: - **Auto-detected, no config.** When Claude Code reports an unattended `permission_mode` on the hook payload (`bypassPermissions`, what `--dangerously-skip-permissions` / `--permission-mode bypassPermissions` resolve to — the canonical way to run a headless loop), the gate activates automatically. Interactive modes (`default` / `plan` / `acceptEdits`) never trigger it. - **Explicit override.** Because `permission_mode` is not guaranteed on every event, a loop that wants a hard gating guarantee sets `DXKIT_LOOP_ACTIVE=1` in the launching environment, or drops a `.dxkit/loop/active` sentinel file. - Absent all of these, the Stop hook is an instant no-op allow — interactive sessions are never slowed. The CI guardrail still gates the branch, so interactive work is not left unprotected. This, together with the caches above, is what makes the Stop-gate unobtrusive: interactive turns do nothing, and an active unattended loop only re-gathers when the tree actually changed. ### Changed — CI guardrail surfaces the block reason in the job log The CI guardrail workflow (`dxkit-guardrails.yml`) wrote its report to a PR comment but the job log showed only `exit 1`. It now prints the blocking findings into the log on failure — a collapsible group plus a GitHub error annotation — so a blocked PR is diagnosable from the Actions run itself, not only the comment. ## [2.13.2] - 2026-06-22 ### Fixed - `init` and `update` now gitignore `.dxkit/loop/`, so the loop pack's runtime output (the ledger and the last-guardrail snapshot) is no longer committed by repos that use the Stop-gate. Latent since the loop pack shipped in 2.13.0. ### Changed - `loop doctor` now validates the **actual** registered Stop hook command. It understands both the `npx vyuh-dxkit` / installed-binary form (the binary must resolve) and a local-build `node