3.6 KiB
3.6 KiB
Generated Scene Runtime Semantics Gap Analysis Plan
Status: Superseded by
docs/superpowers/plans/2026-04-20-generated-scene-source-first-runtime-semantics-hardening-plan.md
Parent
Goal
Analyze the 102 final generated scene skills for runtime-semantics divergence, using sweep-030-scene as the anchor case and systematizing the five gap classes exposed during inner-network validation.
This plan is analysis-only.
Fixed Inputs
examples/scene_skill_102_final_materialization_2026-04-19/skillstests/fixtures/generated_scene/scene_skill_102_deterministic_invocation_readiness_after_keyword_refinement_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_natural_language_parameter_readiness_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_parameter_dictionary_template_normalization_2026-04-20.json- Anchor source:
D:/desk/智能体资料/全量业务场景/一平台场景/台区线损大数据-月_周累计线损率统计分析
Boundaries
Allowed:
- Read skill manifests, reports, references, and selected source-scene evidence
- Produce JSON inventory and report
Forbidden:
- No edits in
src/ - No edits to generated skills
- No rerun materialization
- No execution board updates
- No pseudo-production execution
- No implementation patch for any scene
Phase 0: Freeze Gap Taxonomy
Tasks:
- Fix the five runtime-semantics gap classes from the anchor case
- Define high / medium / low risk buckets
- Lock analysis outputs and stop rule
Acceptance:
- The five gap classes are explicit and stable
- The plan remains analysis-only
Phase 1: Anchor-Case Evidence Extraction
Tasks:
- Read
sweep-030-scenegenerated assets:scene.tomlreferences/generation-report.jsonreferences/org-dictionary.json- generated script
- Read source-scene evidence from the original
台区线损大数据-月_周累计线损率统计分析 - Record direct evidence for:
- alias gap
- dictionary recovery gap
- parameter default semantics gap
- resolver-to-request mapping gap
- runtime URL semantics gap
Acceptance:
sweep-030-scenehas explicit evidence for each applicable gap class
Phase 2: 102-Scene Inventory Scan
Tasks:
- Scan all 102 final skills
- Extract:
- deterministic keywords
- params presence
- dictionary reference presence
- bootstrap target presence
- generation-report URL evidence
- Tag scenes with likely gap classes using bounded heuristics
Acceptance:
- Every scene gets a runtime-semantics record
- Every scene has
riskLevelandgaps
Phase 3: Family / Archetype Grouping
Tasks:
- Group findings by archetype / family
- Count gap incidence by bucket
- Separate:
- generator-level fix candidates
- runtime-only residuals
Acceptance:
- Summary counts exist per gap type and per archetype
- Report can distinguish generator vs runtime responsibilities
Phase 4: Publish Analysis Assets
Deliverables:
tests/fixtures/generated_scene/generated_scene_runtime_semantics_gap_analysis_2026-04-20.jsondocs/superpowers/reports/2026-04-20-generated-scene-runtime-semantics-gap-analysis-report.md
Acceptance:
- All 102 scenes are represented
sweep-030-sceneis explicitly called out as anchor evidence- The report recommends next implementation routes, but does not execute them
Stop Statement
Stop after publishing the JSON inventory and report.