R76.H2 Contract — `colibri-verification` SKILL.md behavioral contract

Task: R76.H2. Artifact under contract: .agents/skills/colibri-verification/SKILL.md (canon) + .claude/skills/colibri-verification/SKILL.md (mirror, byte-identical to canon after this task).

Purpose (unchanged)

The skill provides verification checklists for every stage of a Colibri execution session: pre-execution readiness, execution-time safety, post-execution completion, and per-phase acceptance criteria for GATHER / ANALYZE / PLAN / EXECUTE / VERIFY / CLOSE. It is referenced from every T3 executor chain and from colibri-pm / colibri-executor / colibri-tier1-chains.

The behavioral purpose is not modified by R76.H2. Only the tool names that the checklists call out are rewritten to the Phase 0 shipped 14-tool surface.

Allowed tool set (Phase 0, 14 tools)

Every tool citation on the live path of the rewritten SKILL.md must come from this set — or must drop to a narrative / file-read / git / npm reference:

Family	Tool	Purpose
β (Task Pipeline)	`task_create`	Create a task
β	`task_list`	List tasks
β	`task_get`	Read one task
β	`task_update`	Update fields (including `status`, which routes via internal state machine)
β	`task_next_actions`	Unblocked-task query
ζ (Decision Trail)	`audit_session_start`	Start an audit session (includes optional context)
ζ	`thought_record`	Record reasoning / plan / decision / reflection (single Phase 0 reasoning-capture tool)
ζ	`thought_record_list`	Query recorded thoughts
ζ	`audit_verify_chain`	Verify hash chain integrity
η (Proof Store)	`merkle_finalize`	Finalize the Merkle tree; seals the session
η	`merkle_root`	Retrieve the root hash (Phase 0 attestation primitive)
ε (Skill Registry)	`skill_list`	List registered skills
system	`server_ping`	Liveness check
system	`server_health`	Health/status check

Forbidden tool families (all struck or relabelled HERITAGE)

No live-path reference to any of the following is allowed outside an explicit > HERITAGE: block:

roadmap_* (entire family) — not in Phase 0; roadmap is a narrative doc
unified_* (entire family: unified_init, unified_set_project, unified_vitals, unified_backup, unified_metrics, etc.)
context_* (entire family: context_create, context_verify, context_ensure, context_diff)
memory_* (entire family: memory_pack, memory_bundle, memory_verify, memory_gc, memory_smart_store, memory_tier_migrate, memory_cross_session, memory_semantic_search, memory_map)
gsd_* (entire family: gsd_plan, gsd_verify_comprehensive, gsd_workflow_create, gsd_agent_pool_create, gsd_agent_spawn)
task_eisenhower, task_deps, task_depends_on, task_link_roadmap, task_create_batch, task_transition, task_delete — donor-era task-edge tools (Phase 0 β is exactly 5 β tools)
thought_plan, thought_decide — collapsed into thought_record in Phase 0
audit_session_set_context — merged into audit_session_start
audit_session_end — not in Phase 0; session seals at merkle_finalize
merkle_attest — not in Phase 0; attestation = merkle_root hash
analysis_* (incl. analysis_recommend, analysis_rag_index, analysis_rag_search, rag_gc)
ams_session_resume
server_info, server_shutdown — never shipped in Phase 0

Behavioral contract of the rewritten skill

Quick Reference table (line 13-25): identical content; no rewrite.
Pre-Execution Readiness Checklist:
- Roadmap Validation subsection: renamed to Task Context and rewritten to reference docs/guides/implementation/task-breakdown.md (file read, not a tool call), the colibri-docs-check skill for doc integrity, and task_get for task lookup.
- Task Validation: rewritten to use task_list + task_get only. unified_set_project row dropped (no project-switching in Phase 0). task_deps row rewritten to “dependencies noted in task description field”.
- Environment Validation: unified_init row dropped (auto-init on MCP handshake). unified_vitals replaced with server_health. Worktree + branch rows kept unchanged (they reference git, not MCP tools).
- Audit Session: audit_session_set_context row rewritten to “context arg passed inline to audit_session_start”.
Execution Safety Checklist:
- Dry-Run First: kept (it is a pattern, not a tool).
- Worktree Isolation: kept (git-only).
- Incremental Progress: context_create row dropped. Other rows (task_update, thought_record, error handling) kept.
- Rollback Preparation: unified_backup / context_create row dropped and replaced with “use git reset + worktree discard; seal prior state via thought_record before the destructive step”.
Post-Execution Completion Checklist:
- Colibri Writeback: kept (task_update).
- Thought Record: kept (thought_record).
- Memory Pack: entire subsection dropped. No memory surface in Phase 0.
- Merkle Proof: merkle_attest row replaced with “attestation hash retrieved via merkle_root”.
- Git Checkpoint: kept (git-only).
Phase Acceptance Criteria tables:
- GATHER: roadmap_list / roadmap_nodes rows dropped. context_create row replaced with thought_record. Other rows kept.
- ANALYZE: task_eisenhower, task_deps, roadmap_prerequisites_check, analysis_recommend, gsd_plan rows dropped. thought_plan and thought_decide rows collapsed into thought_record.
- PLAN: task_create_batch, task_link_roadmap, task_deps/task_depends_on, gsd_workflow_create, gsd_agent_pool_create rows cleaned. Only task_create and thought_record remain in PLAN table with narrative cells.
- EXECUTE: context_create row replaced with thought_record. Other rows kept.
- VERIFY: roadmap_validate, gsd_verify_comprehensive, context_verify rows dropped. Replaced with audit_verify_chain row and the npm run build && npm run lint && npm test gate as a narrative row (per CLAUDE.md §5).
- CLOSE: memory_pack row dropped. audit_session_end mention reworded. All rows otherwise kept and cite only the 14-tool set.

Verification Tools Quick Reference (JS example): rewritten to the CLAUDE.md §7 6-call chain:

audit_session_start(...)
// ... work ...
audit_verify_chain(session_id)
thought_record(session_id, thought_type: "reflection", ...)
merkle_finalize(session_id)
merkle_root(session_id)

With a comment noting the ordering rule (final thought_record MUST precede merkle_finalize).

Common Verification Failures table: Roadmap validation fails, Context verify fails, Memory pack empty rows dropped. Replaced with Phase 0-shaped rows: “Audit chain verify fails”, “thought_record ordering wrong”, “Merkle root missing”.
Integration with Tier-1 Chains section: kept — it only references skill names, not tools.
See Also section: kept. colibri-tier1-chains, colibri-pm, colibri-audit-proof all exist. data/tool-contracts/ reference replaced with docs/reference/mcp-tools-phase-0.md (the live Phase 0 tool contract doc).

Frontmatter contract

name: colibri-verification — unchanged.
description: — kept quoted (YAML safety per Wave H memory lesson on gray-matter; contains colons in Greek: η (eta) segment). Wording lightly tightened to drop donor tool hints but keep η + π Greek-letter labels.

Reality-stamp blockquote at line 9

Replaced with a cleaner R76.H2 stamp:

Phase 0 reality stamp (R76 — 2026-04-18). This checklist skill runs on the Phase 0 shipped 14-tool surface: task_create, task_list, task_get, task_update, task_next_actions, audit_session_start, thought_record, thought_record_list, audit_verify_chain, merkle_finalize, merkle_root, skill_list, server_ping, server_health. Donor-era tool families (roadmap_*, unified_*, context_*, memory_*, gsd_*, task_eisenhower/deps/link_roadmap, thought_plan/decide, audit_session_set_context/end, merkle_attest) are either deferred to later phases or collapsed into the 14-tool set. See ADR-004 R75 Wave H amendment and CLAUDE.md §7 for the canonical verification chain.

Mirror resync rule

After canon is clean, the mirror at .claude/skills/colibri-verification/SKILL.md is overwritten from canon byte-for-byte. This discharges one of the Wave H carry-over drift items for this one skill (other skills remain drifted and are out of scope for R76.H2).

Acceptance gates

rg -n "gsd_|unified_|roadmap_|memory_pack|context_|thought_plan|thought_decide|audit_session_set_context|merkle_attest|task_eisenhower|task_deps|task_link_roadmap|task_create_batch|task_depends_on|audit_session_end|task_transition|server_info|server_shutdown" .agents/skills/colibri-verification/SKILL.md returns zero matches.
diff .agents/skills/colibri-verification/SKILL.md .claude/skills/colibri-verification/SKILL.md is empty.
4 chain docs exist: audit / contract / packet / verification under docs/…/r76-h2-verification-rewrite-*.md.
npm run build && npm run lint && npm test all green.

Out-of-scope confirmations

No edit to any other skill under .agents/skills/ or .claude/skills/.
No rename or move.
No src/ touch.
No new tools invented.
No docs-sync to Obsidian vault.
No new frontmatter fields.
No new dependencies.

R76.H2 Contract — colibri-verification SKILL.md behavioral contract