Skip to content

feat(skills): add capability-tuning skill + refresh Claude/Codex capability facts#1

Merged
DocksDocks merged 13 commits into
mainfrom
claude/dreamy-dijkstra-xu8opp
Jun 10, 2026
Merged

feat(skills): add capability-tuning skill + refresh Claude/Codex capability facts#1
DocksDocks merged 13 commits into
mainfrom
claude/dreamy-dijkstra-xu8opp

Conversation

@DocksDocks

Copy link
Copy Markdown
Owner

Research-driven pass over the kit's prompt surfaces (3 deep-research agents:
Karpathy's method, Claude Code mid-2026 config, Codex mid-2026 config):

  • new productivity/capability-tuning skill: max-capability settings.json /
    config.toml templates for both runtimes, instruction-file budgets,
    cross-model phrasing rules, Karpathy context-engineering layer (16/16)
  • skills/AGENTS.md: Codex catalog truncation corrected (even truncation,
    2%-of-window-in-tokens budget; 8k chars is only the fallback) + new
    goals-over-step-lists rule for Fable 5 / Opus 4.8
  • codex-agents-builder.md: effort set gains 'none', Claude max->xhigh
    mapping, sonnet maps to mainline gpt-5.4 (codex line absorbed at 5.4);
    codex-facts.sh guard now pins 'none'
  • code-review: reject for missing evidence, never for low confidence —
    recall protection for literal-instruction Opus models
  • capability-tuning listed in plugin/marketplace descriptions + READMEs

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

claude added 13 commits June 10, 2026 03:11
…bility facts

Research-driven pass over the kit's prompt surfaces (3 deep-research agents:
Karpathy's method, Claude Code mid-2026 config, Codex mid-2026 config):

- new productivity/capability-tuning skill: max-capability settings.json /
  config.toml templates for both runtimes, instruction-file budgets,
  cross-model phrasing rules, Karpathy context-engineering layer (16/16)
- skills/AGENTS.md: Codex catalog truncation corrected (even truncation,
  2%-of-window-in-tokens budget; 8k chars is only the fallback) + new
  goals-over-step-lists rule for Fable 5 / Opus 4.8
- codex-agents-builder.md: effort set gains 'none', Claude max->xhigh
  mapping, sonnet maps to mainline gpt-5.4 (codex line absorbed at 5.4);
  codex-facts.sh guard now pins 'none'
- code-review: reject for missing evidence, never for low confidence —
  recall protection for literal-instruction Opus models
- capability-tuning listed in plugin/marketplace descriptions + READMEs

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
…discovery

Codex discovers .claude-plugin/plugin.json natively (DISCOVERABLE_PLUGIN_MANIFEST_PATHS,
re-verified 2026-06-10 against openai/codex HEAD) — the mirror's value is the
marketplace catalog, Codex-tailored interface + degradation surfacing, and
version lockstep, not basic discovery.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
Evidence-gated fixes from a full engineering-category audit:
- code-review + fix-workflow: retire stale '/security 3 parallel scanners' /
  '/refactor command' architecture refs (kit is sequential single-context);
  approval gates rephrased to the enforceable turn-ending form
- dep-vuln-workflow: drop fabricated tokio 1->2 migration row (no tokio 2.x
  exists), fix axum 0.8 + reqwest 0.12 break details, replace nonexistent
  'uv pip audit' with uv export | pip-audit; tag bare fences; add Not-for
  exclusion to description
- design-tokenization: Tailwind v4 DOES auto-detect sources (gitignored /
  out-of-root paths are what @source covers); kill dead v4-beta docs link
- react-component-patterns: useEffectEvent is stable since React 19.2 (was
  'experimental, do not use'); trim description 502->445 chars
- lint-no-suppressions: add BAD/GOOD fence, bare-suppression-scope gotchas,
  Rust #[expect] (1.81+); ESLint flat-config row (13 -> 16 score)
- solid refs: '# type: ignore' removed from a GOOD example, assert_never is
  Python 3.11+ not PEP 661, dyn-compatibility note for native async fn
  traits, once_cell::Lazy -> std LazyLock
- type-safety: #private is class-only syntax (factory closures are what's
  private), was claimed to work on plain objects
- security: remove constraint-contradicting MAY-run-concurrently clause
- human-docs-workflow: description 552 -> 488 chars

All engineering skills now 16/16 except vendored make-interfaces (10, body
frozen by policy).

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
…I gate

Productivity skills (evidence-gated, real content only):
- plan-init + skill-agent-pipeline: descriptions trimmed under the 500-char
  CSO tier (15 -> 16 each)
- write-skill: third constraint promoting the updated+content-hash
  bookkeeping rule (14 -> 16)
- zoom-out: 7-module cap promoted to a constraint, situation -> output-form
  table, BAD/GOOD map example, fences tagged (9 -> 16)
- caveman: persistence rule promoted to constraint, drop/keep rules table,
  BAD/GOOD labels (8 -> 12; stays sub-16 by design — it's a brevity skill)
- plan-review agent: missing ## Output Format section (14 -> 15; both
  shipped agents now 15/15)

Validator hardening (no floor loosened):
- shellcheck -S warning is now a CI gate: ci.sh §3b (self-skips when not
  installed locally) + ci.yml guard job + scripts/AGENTS.md validator row
- 6 shellcheck findings fixed: cd||exit in ci.sh + idempotency test, unused
  loop var in release.sh, xargs -> sed path derivation in tree/guard.sh,
  2 documented SC2043 disables
- slop scorer strips fenced blocks + code spans first — quoting a banned
  word (ban lists, BAD examples) is not prose slop
- BSD date fallback for the freshness point (macOS parity)
- UTF-8 locale forced so description tiers count chars, not bytes
- dead extract_yaml_value removed from skills scorer
- agents scorer credits explicit full claude-* model IDs

Category totals: engineering 212 -> 218, productivity 189 -> 204,
agents 29 -> 30. ci.sh fully green including the new gate.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
…ersionless examples

The Codex marketplace policy values are documented now (installation:
AVAILABLE/NOT_AVAILABLE/INSTALLED_BY_DEFAULT; authentication:
ON_INSTALL/ON_USE) — drop the 'not documented yet' hedge. Worked examples
use X.Y.Z placeholders instead of a stale hardcoded 0.3.0 (no version
numbers in prose).

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
…ng form

Loop tick 1/12: deep-read skill-maintenance (clean) + multi-tool-bridge —
its Step-3 split gate used the bypassable 'wait for confirmation' phrasing
the kit itself documents as ignored by literal-instruction models.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
…o-author-scripts pattern

Loop tick 2/12: context-tree's shipped body + 2 references named the docks
author script tree/guard.sh in the bare form (no scripts/ prefix) the
no-author-scripts guard couldn't see — broken the moment the skill runs in
a consumer repo. Replaced with the self-contained pair check; the guard
pattern now also catches the distinctive bare names (tree/guard.sh,
content-hash.sh, transform-guard.sh, no-author-scripts.sh, codex-facts.sh,
guard-spec.sh). plan-manager + plan-sidecar re-read: clean.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
Loop tick 3/12: deep-read scaffold + plan-init + all 9 skill-agent-pipeline
phase references — all era-accurate; the only fix is scaffold's gate, which
used the bypassable 'wait for confirmation' phrasing. Full shipped surface
(every SKILL.md + reference + agent body) has now been read this session.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
Loop tick 4/12: read the final un-read references (write-skill
data-preservation, lint pre-commit-hook, plan-init template + scaffold
spec-schema spot-checks) — all clean except a legacy .eslintrc.* mention
now framed as eslint.config.js-first. Hook payload shapes re-verified.

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX
@DocksDocks DocksDocks merged commit 84601ae into main Jun 10, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants