7.5 KiB
Scene Skill 102 Mock Runtime Harness Implementation Plan
Date: 2026-04-20 Status: Draft Upstream Design:
docs/superpowers/specs/2026-04-20-scene-skill-102-mock-runtime-harness-implementation-design.mdInput Matrix:tests/fixtures/generated_scene/scene_skill_102_mock_runtime_validation_matrix_2026-04-20.json
Plan Intent
Implement and execute bounded mock runtime harnesses for representative generated scene skills.
This plan validates generated script control flow under fake dependencies. It does not validate production access, real data correctness, or browser-integrated host behavior.
Fixed Inputs
examples/scene_skill_102_final_materialization_2026-04-19/skillstests/fixtures/generated_scene/scene_skill_102_static_validation_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_dispatch_dry_run_validation_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_mock_runtime_validation_matrix_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_pseudoprod_readiness_2026-04-20.json
Planned Outputs
tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.jsondocs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md
Allowed Files
- new mock harness files under
tests/ortests/fixtures/generated_scene/ tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.jsondocs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md
Forbidden Files
src/generated_scene/analyzer.rssrc/generated_scene/generator.rssrc/generated_scene/ir.rsexamples/scene_skill_102_final_materialization_2026-04-19/skills/**tests/fixtures/generated_scene/scene_execution_board_2026-04-18.json
Workstreams
WS1Mock harness foundationWS2Mainline fetch archetype harnessesWS3Small bucket harnessesWS4Boundary/runtime harnessesWS5Integrated result reporting
Phase 0: Freeze Mock Runtime Boundary
Objective
Freeze mock validation as a non-production, non-browser, non-network stage.
Tasks
- Confirm static validation is
102 / 102. - Confirm deterministic dispatch dry-run is
102 / 102. - Confirm this plan does not mutate generated skill packages.
- Confirm this plan does not require production credentials or network access.
Deliverables
- baseline section in final mock runtime harness report
Acceptance Criteria
- no production environment is accessed
- no generated skill is modified
- no official board status is changed
Phase 1: Mock Harness Foundation
Objective
Create the shared fake runtime primitives used by all representative harnesses.
Tasks
- define fake
fetch - define fake browser DOM surface
- define fake artifact writer
- define fake host bridge callback surface
- define fake local-doc service surface
- define common result schema:
script-load-passmock-runtime-passmock-runtime-partialmock-runtime-fail
Deliverables
- shared mock harness implementation
Acceptance Criteria
- harness foundation does not call real network
- harness foundation can run without browser or credentials
- harness foundation can load a generated script from the final materialization root
Phase 2: Route 1 - Paginated Enrichment Harness
Objective
Validate the largest archetype bucket first.
Fixed Representatives
sweep-001-scenesweep-002-scenesweep-003-scene
Tasks
- load each representative script
- provide fake primary page response
- provide fake enrichment response
- verify expected request order where observable
- verify artifact metadata or structured result is produced
Deliverables
- paginated enrichment mock result records
Acceptance Criteria
- each representative receives a
mock-runtime-*status - no real request is sent
- failures include named failure reason
Phase 3: Route 2 - G2 And G1-E Fetch Harnesses
Objective
Validate fetch-based mainline small buckets.
Fixed Representatives
multi_mode_request:
sweep-020-scenesweep-023-scenesweep-030-scene
single_request_enrichment:
sweep-013-scenesweep-016-scenesweep-068-scene
Tasks
- run representative scripts with fake fetch
- verify mode/request paths for multi-mode scenes
- verify enrichment path for single-request enrichment scenes
- record pass/fail reason
Deliverables
- multi-mode request mock result records
- single-request enrichment mock result records
Acceptance Criteria
- each representative receives a
mock-runtime-*status - real-sample or production execution is not started
Phase 4: Route 3 - Inventory And Page-State Harnesses
Objective
Validate the small specialized buckets.
Fixed Representatives
multi_endpoint_inventory:
sweep-084-scenesweep-085-scene
page_state_eval:
sweep-066-scenesweep-094-scene
Tasks
- run multi-endpoint representatives with fake endpoint responses
- run page-state representatives with fake DOM state
- record pass/fail reason
Deliverables
- inventory mock result records
- page-state mock result records
Acceptance Criteria
- each representative receives a
mock-runtime-*status - no host browser is required
Phase 5: Route 4 - Local-Doc And Host-Bridge Harnesses
Objective
Validate boundary runtime families with fake local-doc and fake host-bridge surfaces.
Fixed Representatives
local_doc_pipeline:
sweep-012-scenesweep-017-scenesweep-019-scene
host_bridge_workflow:
sweep-007-scenesweep-009-scenesweep-010-scene
Tasks
- run local-doc representatives with fake local document query and export responses
- run host-bridge representatives with fake action and callback completion responses
- classify boundary failures as mock harness gaps or script contract gaps
Deliverables
- local-doc mock result records
- host-bridge mock result records
Acceptance Criteria
- no real host bridge is invoked
- no local document service is invoked
- failures are explicitly categorized
Phase 6: Integrated Mock Runtime Report
Objective
Publish representative execution results and propagated matrix interpretation.
Tasks
- write
scene_skill_102_mock_runtime_harness_results_2026-04-20.json - summarize representative pass/fail by archetype
- summarize which non-representative scenes are covered only by representative inference
- identify which archetypes still require direct mock expansion
- recommend whether to proceed to pseudo-production batch planning
Deliverables
tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.jsondocs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md
Acceptance Criteria
- report distinguishes representative execution from propagated coverage
- report does not claim production execution
- report does not update official board
Completion Criteria
This plan is complete when:
- every fixed representative has a mock runtime result record
- integrated mock runtime results JSON is published
- mock runtime report is published
- generated skill packages remain unchanged
- no real browser or production environment was used
Stop Statement
Stop after publishing mock runtime harness results and report.
Do not start pseudo-production or real-environment validation under this plan.