admin/claw

Files

木炎 956f0c2b68 feat: add generated scene skill platform hardening

2026-04-21 23:19:06 +08:00

3.6 KiB

Raw Permalink Blame History

Generated Scene Runtime Semantics Gap Analysis Plan

Status: Superseded by docs/superpowers/plans/2026-04-20-generated-scene-source-first-runtime-semantics-hardening-plan.md

Parent

Parent design: 2026-04-20-generated-scene-runtime-semantics-gap-analysis-design.md

Goal

Analyze the 102 final generated scene skills for runtime-semantics divergence, using sweep-030-scene as the anchor case and systematizing the five gap classes exposed during inner-network validation.

This plan is analysis-only.

Fixed Inputs

examples/scene_skill_102_final_materialization_2026-04-19/skills
tests/fixtures/generated_scene/scene_skill_102_deterministic_invocation_readiness_after_keyword_refinement_2026-04-20.json
tests/fixtures/generated_scene/scene_skill_102_natural_language_parameter_readiness_2026-04-20.json
tests/fixtures/generated_scene/scene_skill_102_parameter_dictionary_template_normalization_2026-04-20.json
Anchor source:
- D:/desk/智能体资料/全量业务场景/一平台场景/台区线损大数据-月_周累计线损率统计分析

Boundaries

Allowed:

Read skill manifests, reports, references, and selected source-scene evidence
Produce JSON inventory and report

Forbidden:

No edits in src/
No edits to generated skills
No rerun materialization
No execution board updates
No pseudo-production execution
No implementation patch for any scene

Phase 0: Freeze Gap Taxonomy

Tasks:

Fix the five runtime-semantics gap classes from the anchor case
Define high / medium / low risk buckets
Lock analysis outputs and stop rule

Acceptance:

The five gap classes are explicit and stable
The plan remains analysis-only

Phase 1: Anchor-Case Evidence Extraction

Tasks:

Read sweep-030-scene generated assets:
- scene.toml
- references/generation-report.json
- references/org-dictionary.json
- generated script
Read source-scene evidence from the original 台区线损大数据-月_周累计线损率统计分析
Record direct evidence for:
- alias gap
- dictionary recovery gap
- parameter default semantics gap
- resolver-to-request mapping gap
- runtime URL semantics gap

Acceptance:

sweep-030-scene has explicit evidence for each applicable gap class

Phase 2: 102-Scene Inventory Scan

Tasks:

Scan all 102 final skills
Extract:
- deterministic keywords
- params presence
- dictionary reference presence
- bootstrap target presence
- generation-report URL evidence
Tag scenes with likely gap classes using bounded heuristics

Acceptance:

Every scene gets a runtime-semantics record
Every scene has riskLevel and gaps

Phase 3: Family / Archetype Grouping

Tasks:

Group findings by archetype / family
Count gap incidence by bucket
Separate:
- generator-level fix candidates
- runtime-only residuals

Acceptance:

Summary counts exist per gap type and per archetype
Report can distinguish generator vs runtime responsibilities

Phase 4: Publish Analysis Assets

Deliverables:

tests/fixtures/generated_scene/generated_scene_runtime_semantics_gap_analysis_2026-04-20.json
docs/superpowers/reports/2026-04-20-generated-scene-runtime-semantics-gap-analysis-report.md

Acceptance:

All 102 scenes are represented
sweep-030-scene is explicitly called out as anchor evidence
The report recommends next implementation routes, but does not execute them

Stop Statement

Stop after publishing the JSON inventory and report.

3.6 KiB Raw Permalink Blame History

Generated Scene Runtime Semantics Gap Analysis Plan

Parent

Goal

Fixed Inputs

Boundaries

Phase 0: Freeze Gap Taxonomy

Phase 1: Anchor-Case Evidence Extraction

Phase 2: 102-Scene Inventory Scan

Phase 3: Family / Archetype Grouping

Phase 4: Publish Analysis Assets

Stop Statement

3.6 KiB

Raw Permalink Blame History