5.9 KiB
Generated Scene Source-First Runtime Semantics Hardening Plan
Date: 2026-04-20 Status: Draft Parent design:
docs/superpowers/specs/2026-04-20-generated-scene-source-first-runtime-semantics-hardening-design.md
Plan Intent
Replace the weaker generated-skill-first analysis path with a stronger source-first roadmap:
- scan all 102 original source scenes
- detect scenes that can reproduce the same runtime-semantics defect classes exposed by
sweep-030-scene - convert those findings into rule-level hardening routes
- require full 102-scene rematerialization after rule changes
- refresh the full validation stack after rematerialization
Why This Plan Exists
The project goal is not to describe already-surfaced gaps after they break in inner-network testing.
The goal is to prevent the same class of defect from reappearing across the remaining source scenes.
Therefore this plan is driven by original source-scene evidence, not generated skill artifacts alone.
Fixed Inputs
- Original source root:
D:/desk/智能体资料/全量业务场景/一平台场景
- Current final generated skills:
examples/scene_skill_102_final_materialization_2026-04-19/skills
- Current 102-skill materialization manifest
- Current invocation / parameter readiness assets
sweep-030-sceneinner-network runtime findings
Scope Guardrails
Allowed:
- scan all 102 original source-scene directories
- compare source evidence against current generated skills
- produce risk ledgers, reports, and downstream bounded plans
Forbidden in this parent plan:
- no implementation changes in
src/ - no skill manifest edits
- no rematerialization execution yet
- no validation reruns yet
- no inner-network patching as a substitute for source-first analysis
Workstreams
WS1Source Evidence ScanWS2Runtime-Semantics Risk LedgerWS3Rule Hardening Route DesignWS4Full Rematerialization and Validation Refresh Planning
Phase 0: Freeze Parent Scope
Objective
Make this the new parent roadmap for generated-scene runtime semantics hardening.
Tasks
- freeze the five gap classes
- freeze the source-first principle
- freeze rematerialization as a required downstream step
Acceptance
- future work must start from source-scene evidence
- future fixes must be rule-level before scene-level
Phase 1: Full 102 Source Cross-Scan
Objective
Systematically scan the original 102 source scenes for high-signal evidence related to the five runtime-semantics gap classes.
Required scan targets
- dictionary / enum / tree files
- default parameter logic
- request payload field names
- runtime URL candidates
- operator-facing wording and alias sources
Tasks
- map each scene id to its original source directory
- run a bounded evidence scan over all 102 source directories
- tag source-side evidence flags per scene
Deliverables
- source evidence scan JSON
- source evidence scan report
Acceptance
- all 102 scenes have source evidence flags
sweep-030-sceneis validated as anchor evidence
Phase 2: Build the Source-First Runtime Semantics Ledger
Objective
Merge source-side evidence with generated-skill evidence into a full runtime-semantics risk ledger.
Tasks
- compare source evidence with generated manifests and references
- assign gap classes per scene
- assign risk level per scene
- distinguish:
- generator-level rule gap
- runtime-only residual
Deliverables
generated_scene_source_first_runtime_semantics_ledger_2026-04-20.json- source-first runtime semantics report
Acceptance
- all 102 scenes are represented
- each scene has
gaps,riskLevel, andrecommendedFixRoutes
Phase 3: Convert Ledger into Rule-Hardening Routes
Objective
Turn the source-first ledger into bounded implementation routes that modify reusable generation rules rather than scene-specific patches.
Candidate hardening routes
- alias generation hardening
- embedded dictionary extraction hardening
- parameter default semantics recovery hardening
- resolver-to-request mapping hardening
- runtime URL classification hardening
Tasks
- count scenes affected by each route
- prioritize routes by coverage gain and reuse
- define bounded implementation slices for the top routes
Deliverables
- child-plan sequence for runtime semantics hardening
- bounded route plans for top reusable fixes
Acceptance
- no route is scene-name hardcoded
- route priority is based on 102-scene reuse, not anecdotal debugging order
Phase 4: Require Full 102 Rematerialization
Objective
Ensure that hardened rules are propagated into the final generated skill inventory.
Tasks
- define full 102 rematerialization as mandatory after route implementation
- define materialization outputs that must be refreshed
- define how canonical final skill bundle is replaced
Deliverables
- full rematerialization refresh plan
Acceptance
- no runtime-semantics hardening route may be considered complete without rematerialization
Phase 5: Require Validation Refresh
Objective
Refresh downstream validation after rematerialization so improved rules are measured end-to-end.
Required refresh layers
- deterministic invocation readiness
- natural-language parameter readiness
- static validation
- direct mock execution
- pseudo-production handoff refresh
Deliverables
- validation refresh plan
Acceptance
- the new final 102-skill bundle is revalidated before more inner-network testing
Immediate Next Output
This parent plan should immediately lead to a new bounded child plan:
2026-04-20-generated-scene-source-evidence-cross-scan-plan.md
That child plan should perform the actual source cross-scan over the 102 original scenes.
Stop Statement
Stop after publishing this parent plan and its design.
Do not execute the source cross-scan or implementation inside this plan.