admin/claw

Files

木炎 956f0c2b68 feat: add generated scene skill platform hardening

2026-04-21 23:19:06 +08:00

5.9 KiB

Raw Blame History

Generated Scene Source-First Runtime Semantics Hardening Plan

Date: 2026-04-20 Status: Draft Parent design: docs/superpowers/specs/2026-04-20-generated-scene-source-first-runtime-semantics-hardening-design.md

Plan Intent

Replace the weaker generated-skill-first analysis path with a stronger source-first roadmap:

scan all 102 original source scenes
detect scenes that can reproduce the same runtime-semantics defect classes exposed by sweep-030-scene
convert those findings into rule-level hardening routes
require full 102-scene rematerialization after rule changes
refresh the full validation stack after rematerialization

Why This Plan Exists

The project goal is not to describe already-surfaced gaps after they break in inner-network testing.

The goal is to prevent the same class of defect from reappearing across the remaining source scenes.

Therefore this plan is driven by original source-scene evidence, not generated skill artifacts alone.

Fixed Inputs

Original source root:
- D:/desk/智能体资料/全量业务场景/一平台场景
Current final generated skills:
- examples/scene_skill_102_final_materialization_2026-04-19/skills
Current 102-skill materialization manifest
Current invocation / parameter readiness assets
sweep-030-scene inner-network runtime findings

Scope Guardrails

Allowed:

scan all 102 original source-scene directories
compare source evidence against current generated skills
produce risk ledgers, reports, and downstream bounded plans

Forbidden in this parent plan:

no implementation changes in src/
no skill manifest edits
no rematerialization execution yet
no validation reruns yet
no inner-network patching as a substitute for source-first analysis

Workstreams

WS1 Source Evidence Scan
WS2 Runtime-Semantics Risk Ledger
WS3 Rule Hardening Route Design
WS4 Full Rematerialization and Validation Refresh Planning

Phase 0: Freeze Parent Scope

Objective

Make this the new parent roadmap for generated-scene runtime semantics hardening.

Tasks

freeze the five gap classes
freeze the source-first principle
freeze rematerialization as a required downstream step

Acceptance

future work must start from source-scene evidence
future fixes must be rule-level before scene-level

Phase 1: Full 102 Source Cross-Scan

Objective

Systematically scan the original 102 source scenes for high-signal evidence related to the five runtime-semantics gap classes.

Required scan targets

dictionary / enum / tree files
default parameter logic
request payload field names
runtime URL candidates
operator-facing wording and alias sources

Tasks

map each scene id to its original source directory
run a bounded evidence scan over all 102 source directories
tag source-side evidence flags per scene

Deliverables

source evidence scan JSON
source evidence scan report

Acceptance

all 102 scenes have source evidence flags
sweep-030-scene is validated as anchor evidence

Phase 2: Build the Source-First Runtime Semantics Ledger

Objective

Merge source-side evidence with generated-skill evidence into a full runtime-semantics risk ledger.

Tasks

compare source evidence with generated manifests and references
assign gap classes per scene
assign risk level per scene
distinguish:
- generator-level rule gap
- runtime-only residual

Deliverables

generated_scene_source_first_runtime_semantics_ledger_2026-04-20.json
source-first runtime semantics report

Acceptance

all 102 scenes are represented
each scene has gaps, riskLevel, and recommendedFixRoutes

Phase 3: Convert Ledger into Rule-Hardening Routes

Objective

Turn the source-first ledger into bounded implementation routes that modify reusable generation rules rather than scene-specific patches.

Candidate hardening routes

alias generation hardening
embedded dictionary extraction hardening
parameter default semantics recovery hardening
resolver-to-request mapping hardening
runtime URL classification hardening

Tasks

count scenes affected by each route
prioritize routes by coverage gain and reuse
define bounded implementation slices for the top routes

Deliverables

child-plan sequence for runtime semantics hardening
bounded route plans for top reusable fixes

Acceptance

no route is scene-name hardcoded
route priority is based on 102-scene reuse, not anecdotal debugging order

Phase 4: Require Full 102 Rematerialization

Objective

Ensure that hardened rules are propagated into the final generated skill inventory.

Tasks

define full 102 rematerialization as mandatory after route implementation
define materialization outputs that must be refreshed
define how canonical final skill bundle is replaced

Deliverables

full rematerialization refresh plan

Acceptance

no runtime-semantics hardening route may be considered complete without rematerialization

Phase 5: Require Validation Refresh

Objective

Refresh downstream validation after rematerialization so improved rules are measured end-to-end.

Required refresh layers

deterministic invocation readiness
natural-language parameter readiness
static validation
direct mock execution
pseudo-production handoff refresh

Deliverables

validation refresh plan

Acceptance

the new final 102-skill bundle is revalidated before more inner-network testing

Immediate Next Output

This parent plan should immediately lead to a new bounded child plan:

2026-04-20-generated-scene-source-evidence-cross-scan-plan.md

That child plan should perform the actual source cross-scan over the 102 original scenes.

Stop Statement

Stop after publishing this parent plan and its design.

Do not execute the source cross-scan or implementation inside this plan.

5.9 KiB Raw Blame History

Generated Scene Source-First Runtime Semantics Hardening Plan

Plan Intent

Why This Plan Exists

Fixed Inputs

Scope Guardrails

Workstreams

Phase 0: Freeze Parent Scope

Objective

Tasks

Acceptance

Phase 1: Full 102 Source Cross-Scan

Objective

Required scan targets

Tasks

Deliverables

Acceptance

Phase 2: Build the Source-First Runtime Semantics Ledger

Objective

Tasks

Deliverables

Acceptance

Phase 3: Convert Ledger into Rule-Hardening Routes

Objective

Candidate hardening routes

Tasks

Deliverables

Acceptance

Phase 4: Require Full 102 Rematerialization

Objective

Tasks

Deliverables

Acceptance

Phase 5: Require Validation Refresh

Objective

Required refresh layers

Deliverables

Acceptance

Immediate Next Output

Stop Statement

5.9 KiB

Raw Blame History