2.8 KiB
2.8 KiB
Deterministic Keyword Scoring Refinement Plan
Date: 2026-04-20 Design:
2026-04-20-deterministic-keyword-scoring-refinement-design.md
Plan Intent
Close the 9 deterministic dispatch ambiguity gaps by bounded manifest keyword refinement and dry-run verification.
Fixed Inputs
tests/fixtures/generated_scene/scene_skill_102_deterministic_invocation_readiness_2026-04-20.jsonexamples/scene_skill_102_final_materialization_2026-04-19/scene_skill_102_index.jsonexamples/scene_skill_102_final_materialization_2026-04-19/skills/*/scene.toml
Allowed Files
examples/scene_skill_102_final_materialization_2026-04-19/skills/*/scene.tomltests/fixtures/generated_scene/deterministic_keyword_scoring_refinement_2026-04-20.jsontests/fixtures/generated_scene/scene_skill_102_deterministic_invocation_readiness_after_keyword_refinement_2026-04-20.jsondocs/superpowers/reports/2026-04-20-deterministic-keyword-scoring-refinement-report.md
Forbidden Files
src/compat/scene_platform/dispatch.rssrc/compat/scene_platform/resolvers.rssrc/generated_scene/analyzer.rssrc/generated_scene/generator.rs- generated
scripts/* tests/fixtures/generated_scene/scene_execution_board_2026-04-18.json
Phase 0: Freeze Gap Set
Tasks
- Load readiness gaps from the parent readiness asset.
- Confirm the fixed gap set is exactly 9 ambiguous dispatch entries.
Acceptance Criteria
- No additional gap categories are pulled into scope.
sweep-012-sceneremains excluded.
Phase 1: Refine Manifest Keywords
Tasks
- For each fixed gap, identify direct collision partner.
- Narrow include keywords to distinctive full phrases.
- Remove broad standalone collision tokens where they create ties.
- Add explicit exclude keywords only when a pair is mutually exclusive.
Acceptance Criteria
- The fixed 9 scenes retain non-empty include keywords.
- No generated script is changed.
Phase 2: Dispatch Dry-Run Verification
Tasks
- Re-run dispatch dry-run for all 101 complete packages.
- Verify the fixed 9 gaps uniquely select their expected scene by full-name sample.
- Check that no previously-ready scene regresses into ambiguity or no-match.
Acceptance Criteria
dispatchReady = 101or all residual gaps are explicitly justified.ambiguous = 0unless escalated to a separate runtime scoring plan.
Phase 3: Publish Report
Tasks
- Publish refinement JSON.
- Publish post-refinement readiness JSON.
- Publish report.
Acceptance Criteria
- Report states before/after ready and ambiguous counts.
- Report states whether runtime scoring changes are needed.
Stop Statement
Stop after refinement assets and report are published. Do not start browser execution, runtime dispatch implementation, or sweep-012-scene recovery under this plan.