3.8 KiB
3.8 KiB
sgClaw Scene Skill Real Sample Validation Roadmap Plan
Status: Draft Date: 2026-04-18 Author: Codex Upstream Spec: 2026-04-18-scene-skill-real-sample-validation-roadmap-design.md
Plan Intent
This plan starts after the post-roadmap execution board and first-round validation layer are in place.
Its purpose is to:
- execute selected real samples for
G2,G1-E, andG3 - use validation outcomes to decide the next bounded implementation scope
- avoid drifting back into fixture-first or asset-first work
Scope Guardrails
- Do not reopen completed repo-local baseline implementation for
G1/G2/G3. - Do not create new board-only assets unless they unblock current validation execution.
- Do not open
G4/G5implementation before formal entry decisions are documented. - Do not pull
G6/G7/G8into the next build round without explicit validation pressure.
Workstreams
WS1Mainline Real Sample ExecutionWS2Validation Result TriageWS3Boundary Runtime Entry DecisionWS4Deferred Family Entry Decision
Phase 0: Execute Mainline Real Samples
Objective
Convert selected G2, G1-E, and G3 anchors into executed real-sample records.
Tasks
- Execute
G2anchor validation updates from the current mismatch baseline. - Keep
G1-Ereal pass anchor as the current positive baseline. - Execute the pending
G3real sample. - Write all outcomes into the validation record layer.
Deliverables
- updated real-sample validation records
- updated mismatch taxonomy usage
- updated execution-board validation statuses
Acceptance Criteria
G2,G1-E, andG3each have executed real-sample recordsselected-not-yet-runno longer remains for current mainline anchors
Phase 1: Triage Results Into Scope Decisions
Objective
Use validation results, not fixture status, to choose the next bounded scope.
Tasks
- classify each mainline family result as
stable,mismatch-driven, orblocked-by-runtime - identify which problems are compiler-family gaps and which are runtime gaps
- define the next recommended scope from validation evidence
Deliverables
- validation triage report
- next-scope recommendation
Acceptance Criteria
- the next scope is justified by executed validation evidence
- repo-local success no longer acts as the sole decision signal
Phase 2: Boundary Runtime Entry Decision
Objective
Decide whether G6/G7/G8 should stay boundary-only or enter a runtime-focused roadmap.
Tasks
- compare boundary-family runtime gaps against executed validation pressure
- decide whether any boundary family should enter the next roadmap
- document non-entry decisions explicitly when scope stays closed
Deliverables
- boundary runtime decision note
- next-roadmap inclusion or exclusion list
Acceptance Criteria
G6/G7/G8entry decisions are explicit- no boundary family enters by drift
Phase 3: Deferred Family Entry Decision
Objective
Decide whether G4/G5 should remain closed or enter a later roadmap.
Tasks
- compare deferred-family criteria against current validation pressure
- confirm whether
G4/G5remain deferred or degraded - record the decision before any new implementation starts
Deliverables
- deferred family decision note
- updated next-roadmap scope boundary
Acceptance Criteria
G4/G5entry decisions are explicit- deferred families do not enter implementation implicitly
Completion Criteria
This plan is complete when:
- all selected mainline anchors have executed real-sample records
- the next implementation scope is selected from validation outcomes
- boundary and deferred family entry decisions are documented