nixcfg

Author	SHA1	Message	Date
Harald Hoyer	3e515d54eb	feat(opencode): allow agents to read external Rust crate source @make, @test, @check often need to inspect dependency source (trait definitions, impl details, test patterns) to inform implementation or verify findings. Opencode applies a CWD check on tool access, so reads outside the worktree previously prompted for each access. - Add permission.read/grep/glob path allowlists for the three locations cargo deps live: ~/.cargo/registry/src/, ~/.cargo/git/checkouts/, and /nix/store/-vendor-/ for crane / buildRustPackage projects. - Document the discovery pattern in each agent: `cargo metadata --format-version 1` returns absolute paths via packages[].manifest_path. - Cross-reference the registry paths from the permission.bash allowlist comment so future readers see the bash inspection commands (rg/ls) intentionally accept paths outside CWD. - @check gets its first permission block (was tools-only before). Path-pattern syntax for read/grep/glob isn't fully documented; if opencode rejects it, fall back to `permission: { external_directory: allow }` at the project config level.	2026-05-08 13:24:30 +02:00
Harald Hoyer	4dc3cffba6	refactor(opencode): allow @test inside #[cfg(test)] mod blocks, drop file gate The previous design routed Rust unit tests to NOT_TESTABLE: Rust unit-only because @test was forbidden from touching src/, which forced @make to write both the production code and the inline #[cfg(test)] mod tests in one dispatch — losing TDD's RED→GREEN separation. But Rust module tests inside #[cfg(test)] mod tests { ... } are the canonical unit-testing idiom, not an edge case. @test's File Constraint now allows modifying src/*/.rs, but strictly inside #[cfg(test)] mod <name> { ... } blocks. Every line outside such a block stays read-only — adding pub, importing crates, declaring siblings, or any other production change is forbidden. Integration tests at tests/*/.rs continue to work as before. The Phase 6 post-step file gate (git status snapshot + comm -23 diff against test-pattern globs) is removed. With @test legitimately writing inside src/, a path-based gate proves nothing — production edits and cfg(test) edits live in the same files. The boundary is enforced by the prompt rule and Phase 8 reviewer scrutiny. Phase 5 test-file guidance updated to distinguish module vs integration tests for Rust, with stub-first TDD applying to both when symbols don't yet exist. The "Rust integration TDD: stub-first" section is renamed to "Rust stub-first TDD" and now covers module tests too. NOT_TESTABLE's "Rust unit-only" reason is replaced with "Missing testability seam" for cases where the production code needs a small change before tests can be authored.	2026-05-08 10:20:16 +02:00
Harald Hoyer	8373e32f34	fix(opencode): forbid RED-state references in test names A workflow run produced test names like move_enemies_following_path_ panics_on_todo, path_types_randomly_assigned, and spawn_enemies_ special_stage_panics_on_todo. The first and third leak the stub-first RED mechanic into the test name; once @make's body pass turns them GREEN, the name lies. The middle one is too vague to describe a contract. Adds a Test Naming subsection to @test's Test Philosophy stating the TDD survival principle — the name describes the contract under test, not the current state, and must remain accurate after the body pass. Bans ..._panics_on_todo / ..._fails_red / ..._stub_works / generic placeholders / vague verbs / implementation-detail leakage. Requires action + observable outcome and shows bad-to-good rewrites of the three names from this run.	2026-05-08 10:20:16 +02:00
Harald Hoyer	91ba5bd272	fix(opencode): close two false-green test loopholes and the orchestrator-as-implementer escape hatch A workflow run on a Bevy weaving feature exposed two compounding failures: 1. @test wrote 8 structural-only Rust tests that never invoked weave_enemies or trigger_weaving. Every test passed against the stub-first @make pre-pass because none of them called the stubbed symbols, so todo!() never fired. The body-pass committed code that "passed" the suite and silently broke trigger_weaving in special stages. 2. @check found the trigger_weaving regression at Phase 8 (final review) and the orchestrator decided to "fix them directly" rather than dispatching @make — taking the license offered by the existing review-loop wording. Test-quality fixes: - Phase 3 Test Design now requires each behavior to be expressed as an action + observable outcome. Structural facts ("enum has 3 variants", "struct has these fields") are explicitly disqualified. - Phase 6 stub-first flow gains a mandatory Panic-coverage check: after @test returns, the orchestrator re-runs the test command and rejects the output unless every test panics on todo!() (i.e. every test exercises at least one stubbed symbol). Any passing test is structural-only and routes back to @test. - Phase 6 decision table gets a "Stub-first run: tests pass with zero todo!() panics" row covering the same case. - @test's Test Philosophy gains an explicit Do-NOT-write list of structural-only patterns (variant_count, type ascriptions, Box::new(my_fn), struct-literal-only flows, all-pass-on-stubs) plus a positive rule: every test must call a function and assert on observable outcome, or return NOT_TESTABLE rather than pad the suite. Orchestrator boundary fix: - Phase 8 review loop replaces "fix them directly (no need to re-dispatch @make for small fixes)" with the principle "the orchestrator does not write production code; @make does". BLOCK, behavioral, correctness, and test-quality findings round-trip through @make. Only AST-preserving cosmetic edits (typos in comments, trailing newlines) may be applied directly. Compiler- detected issues (unused imports, dead code) go through @make.	2026-05-08 10:20:16 +02:00
Harald Hoyer	5b5c59aa84	feat(opencode): mandate stub-first @make pre-pass for Rust integration TDD Rust integration tests live in a separate test crate that imports from lib.rs, so any test referencing not-yet-existing public API can only RED at build time. The build error masks assertion diagnostics and makes the RED state opaque — no stack trace, no left/right values. For Rust tasks whose @test step writes an integration test against public API that does not yet exist, the orchestrator now dispatches a stub-first @make pass before @test runs: 1. @make adds the planned public API as todo!()-bodied stubs in lib.rs and any new src/<module>.rs. Signatures lifted verbatim from the Phase 5 task spec. Acceptance criterion is cargo check only — no test command runs. 2. @test writes the integration test, which now compiles and panics at todo!() with a stack trace — a clean MISSING_BEHAVIOR RED. 3. Phase 7 dispatches @make again to replace the todo!() bodies with real implementations. Two atomic commits per task: scaffold then implement. Phase 5's Rust test-path guidance now flags the two-dispatch requirement up front. test.md's Rust failure-classification hints recognize todo!() / unimplemented!() panics as MISSING_BEHAVIOR with a pointer to the workflow's stub-first section.	2026-05-07 05:42:16 +02:00
Harald Hoyer	d5d90d8b9f	fix(opencode): reject Rust src/tests/ paths as a wrong task spec A workflow run on a Bevy/Rust project produced the test-file path `src/tests/test_<feature>.rs`, which @test correctly flagged as contradictory: it isn't a valid Rust test location (would require declaring `mod tests;` in production source, which @test cannot do) yet the file-gate glob `/tests//*.rs` accidentally matched it. Phase 5 now gives language-aware Test File guidance: Python uses colocated or top-level `tests/`, Rust uses crate-level `tests/<feature>.rs`, and Rust unit-only tasks are routed to NOT_TESTABLE for @make to handle inline. Phase 6's file gate gains an explicit anti-pattern clause discarding any new file under `src/` even when the glob matches. @test's own File Constraint mirrors the anti-pattern so the agent rejects the bad path with BLOCKED before the orchestrator's gate even runs — defense in depth on both sides of the dispatch boundary.	2026-05-06 18:31:14 +02:00
Harald Hoyer	8fcf7e5d34	feat(opencode): make @make and @test polyglot (Python, Rust, nix devshell) Both agents previously hardcoded the Python/uv toolchain. They now detect the language from marker files (pyproject.toml, Cargo.toml, flake.nix) and run the appropriate test/lint/format/type-check commands for Python, Rust, or both. When a flake.nix devshell is present, every toolchain command is wrapped in `nix develop -c …`. @make's permission allowlist gains `cargo ` and `nix develop -c `, plus matching denies for cargo add/remove/install/publish. The Verification Tiers and Baseline Verification sections are rewritten as per-language bullets, and output/TDD-evidence examples are now language-neutral. Generalised the "no Kubernetes deployments" constraint to cover any deploy/publish. @test gains the same devshell + cargo allows (scoped to test, check, clippy, fmt only — no build/run/install). Its file constraint adds `tests/*/.rs` for Rust integration tests, with an explicit note that Rust unit tests stay with @make because they live inside production source files. Failure-classification hints add Rust compiler-error mappings, and the NOT_TESTABLE table gets a "Rust unit-only" row.	2026-05-06 17:09:34 +02:00
Harald Hoyer	37be2d9505	fix(opencode): remove agent models and temperature	2026-05-06 15:33:11 +02:00
Harald Hoyer	4ec1561af4	feat(opencode): add multi-agent workflow agents and commands Adds @check, @simplify, @test, @make, @pm subagents and the /workflow and /review slash commands from the autonomous multi-agent workflow gist by ppries. @pm is rewritten to manage issues in a local ./TODO.md file instead of Linear (file-only access, documented schema, structured JSON output). /workflow is adapted: TODO.md-based issue context, generic worktree paths (no hardcoded ~/repos/veo/sunstone), generic branch examples, and a Phase 1 guard that verifies origin is on GitHub before any work begins.	2026-05-06 14:56:42 +02:00

9 commits