admin/claw

Files

木炎 956f0c2b68 feat: add generated scene skill platform hardening

2026-04-21 23:19:06 +08:00

7.5 KiB

Raw Blame History

Scene Skill 102 Mock Runtime Harness Implementation Plan

Date: 2026-04-20 Status: Draft Upstream Design: docs/superpowers/specs/2026-04-20-scene-skill-102-mock-runtime-harness-implementation-design.md Input Matrix: tests/fixtures/generated_scene/scene_skill_102_mock_runtime_validation_matrix_2026-04-20.json

Plan Intent

Implement and execute bounded mock runtime harnesses for representative generated scene skills.

This plan validates generated script control flow under fake dependencies. It does not validate production access, real data correctness, or browser-integrated host behavior.

Fixed Inputs

examples/scene_skill_102_final_materialization_2026-04-19/skills
tests/fixtures/generated_scene/scene_skill_102_static_validation_2026-04-20.json
tests/fixtures/generated_scene/scene_skill_102_dispatch_dry_run_validation_2026-04-20.json
tests/fixtures/generated_scene/scene_skill_102_mock_runtime_validation_matrix_2026-04-20.json
tests/fixtures/generated_scene/scene_skill_102_pseudoprod_readiness_2026-04-20.json

Planned Outputs

tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.json
docs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md

Allowed Files

new mock harness files under tests/ or tests/fixtures/generated_scene/
tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.json
docs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md

Forbidden Files

src/generated_scene/analyzer.rs
src/generated_scene/generator.rs
src/generated_scene/ir.rs
examples/scene_skill_102_final_materialization_2026-04-19/skills/**
tests/fixtures/generated_scene/scene_execution_board_2026-04-18.json

Workstreams

WS1 Mock harness foundation
WS2 Mainline fetch archetype harnesses
WS3 Small bucket harnesses
WS4 Boundary/runtime harnesses
WS5 Integrated result reporting

Phase 0: Freeze Mock Runtime Boundary

Objective

Freeze mock validation as a non-production, non-browser, non-network stage.

Tasks

Confirm static validation is 102 / 102.
Confirm deterministic dispatch dry-run is 102 / 102.
Confirm this plan does not mutate generated skill packages.
Confirm this plan does not require production credentials or network access.

Deliverables

baseline section in final mock runtime harness report

Acceptance Criteria

no production environment is accessed
no generated skill is modified
no official board status is changed

Phase 1: Mock Harness Foundation

Objective

Create the shared fake runtime primitives used by all representative harnesses.

Tasks

define fake fetch
define fake browser DOM surface
define fake artifact writer
define fake host bridge callback surface
define fake local-doc service surface
define common result schema:
- script-load-pass
- mock-runtime-pass
- mock-runtime-partial
- mock-runtime-fail

Deliverables

shared mock harness implementation

Acceptance Criteria

harness foundation does not call real network
harness foundation can run without browser or credentials
harness foundation can load a generated script from the final materialization root

Phase 2: Route 1 - Paginated Enrichment Harness

Objective

Validate the largest archetype bucket first.

Fixed Representatives

sweep-001-scene
sweep-002-scene
sweep-003-scene

Tasks

load each representative script
provide fake primary page response
provide fake enrichment response
verify expected request order where observable
verify artifact metadata or structured result is produced

Deliverables

paginated enrichment mock result records

Acceptance Criteria

each representative receives a mock-runtime-* status
no real request is sent
failures include named failure reason

Phase 3: Route 2 - G2 And G1-E Fetch Harnesses

Objective

Validate fetch-based mainline small buckets.

Fixed Representatives

multi_mode_request:

sweep-020-scene
sweep-023-scene
sweep-030-scene

single_request_enrichment:

sweep-013-scene
sweep-016-scene
sweep-068-scene

Tasks

run representative scripts with fake fetch
verify mode/request paths for multi-mode scenes
verify enrichment path for single-request enrichment scenes
record pass/fail reason

Deliverables

multi-mode request mock result records
single-request enrichment mock result records

Acceptance Criteria

each representative receives a mock-runtime-* status
real-sample or production execution is not started

Phase 4: Route 3 - Inventory And Page-State Harnesses

Objective

Validate the small specialized buckets.

Fixed Representatives

multi_endpoint_inventory:

sweep-084-scene
sweep-085-scene

page_state_eval:

sweep-066-scene
sweep-094-scene

Tasks

run multi-endpoint representatives with fake endpoint responses
run page-state representatives with fake DOM state
record pass/fail reason

Deliverables

inventory mock result records
page-state mock result records

Acceptance Criteria

each representative receives a mock-runtime-* status
no host browser is required

Phase 5: Route 4 - Local-Doc And Host-Bridge Harnesses

Objective

Validate boundary runtime families with fake local-doc and fake host-bridge surfaces.

Fixed Representatives

local_doc_pipeline:

sweep-012-scene
sweep-017-scene
sweep-019-scene

host_bridge_workflow:

sweep-007-scene
sweep-009-scene
sweep-010-scene

Tasks

run local-doc representatives with fake local document query and export responses
run host-bridge representatives with fake action and callback completion responses
classify boundary failures as mock harness gaps or script contract gaps

Deliverables

local-doc mock result records
host-bridge mock result records

Acceptance Criteria

no real host bridge is invoked
no local document service is invoked
failures are explicitly categorized

Phase 6: Integrated Mock Runtime Report

Objective

Publish representative execution results and propagated matrix interpretation.

Tasks

write scene_skill_102_mock_runtime_harness_results_2026-04-20.json
summarize representative pass/fail by archetype
summarize which non-representative scenes are covered only by representative inference
identify which archetypes still require direct mock expansion
recommend whether to proceed to pseudo-production batch planning

Deliverables

tests/fixtures/generated_scene/scene_skill_102_mock_runtime_harness_results_2026-04-20.json
docs/superpowers/reports/2026-04-20-scene-skill-102-mock-runtime-harness-report.md

Acceptance Criteria

report distinguishes representative execution from propagated coverage
report does not claim production execution
report does not update official board

Completion Criteria

This plan is complete when:

every fixed representative has a mock runtime result record
integrated mock runtime results JSON is published
mock runtime report is published
generated skill packages remain unchanged
no real browser or production environment was used

Stop Statement

Stop after publishing mock runtime harness results and report.

Do not start pseudo-production or real-environment validation under this plan.

7.5 KiB Raw Blame History

Scene Skill 102 Mock Runtime Harness Implementation Plan

Plan Intent

Fixed Inputs

Planned Outputs

Allowed Files

Forbidden Files

Workstreams

Phase 0: Freeze Mock Runtime Boundary

Objective

Tasks

Deliverables

Acceptance Criteria

Phase 1: Mock Harness Foundation

Objective

Tasks

Deliverables

Acceptance Criteria

Phase 2: Route 1 - Paginated Enrichment Harness

Objective

Fixed Representatives

Tasks

Deliverables

Acceptance Criteria

Phase 3: Route 2 - G2 And G1-E Fetch Harnesses

Objective

Fixed Representatives

Tasks

Deliverables

Acceptance Criteria

Phase 4: Route 3 - Inventory And Page-State Harnesses

Objective

Fixed Representatives

Tasks

Deliverables

Acceptance Criteria

Phase 5: Route 4 - Local-Doc And Host-Bridge Harnesses

Objective

Fixed Representatives

Tasks

Deliverables

Acceptance Criteria

Phase 6: Integrated Mock Runtime Report

Objective

Tasks

Deliverables

Acceptance Criteria

Completion Criteria

Stop Statement

7.5 KiB

Raw Blame History