feat(skills): add capability-tuning skill + refresh Claude/Codex capability facts by DocksDocks · Pull Request #1 · DocksDocks/docks

DocksDocks · 2026-06-10T12:43:31Z

Research-driven pass over the kit's prompt surfaces (3 deep-research agents:
Karpathy's method, Claude Code mid-2026 config, Codex mid-2026 config):

new productivity/capability-tuning skill: max-capability settings.json /
config.toml templates for both runtimes, instruction-file budgets,
cross-model phrasing rules, Karpathy context-engineering layer (16/16)
skills/AGENTS.md: Codex catalog truncation corrected (even truncation,
2%-of-window-in-tokens budget; 8k chars is only the fallback) + new
goals-over-step-lists rule for Fable 5 / Opus 4.8
codex-agents-builder.md: effort set gains 'none', Claude max->xhigh
mapping, sonnet maps to mainline gpt-5.4 (codex line absorbed at 5.4);
codex-facts.sh guard now pins 'none'
code-review: reject for missing evidence, never for low confidence —
recall protection for literal-instruction Opus models
capability-tuning listed in plugin/marketplace descriptions + READMEs

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…bility facts Research-driven pass over the kit's prompt surfaces (3 deep-research agents: Karpathy's method, Claude Code mid-2026 config, Codex mid-2026 config): - new productivity/capability-tuning skill: max-capability settings.json / config.toml templates for both runtimes, instruction-file budgets, cross-model phrasing rules, Karpathy context-engineering layer (16/16) - skills/AGENTS.md: Codex catalog truncation corrected (even truncation, 2%-of-window-in-tokens budget; 8k chars is only the fallback) + new goals-over-step-lists rule for Fable 5 / Opus 4.8 - codex-agents-builder.md: effort set gains 'none', Claude max->xhigh mapping, sonnet maps to mainline gpt-5.4 (codex line absorbed at 5.4); codex-facts.sh guard now pins 'none' - code-review: reject for missing evidence, never for low confidence — recall protection for literal-instruction Opus models - capability-tuning listed in plugin/marketplace descriptions + READMEs https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…hboard https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…discovery Codex discovers .claude-plugin/plugin.json natively (DISCOVERABLE_PLUGIN_MANIFEST_PATHS, re-verified 2026-06-10 against openai/codex HEAD) — the mirror's value is the marketplace catalog, Codex-tailored interface + degradation surfacing, and version lockstep, not basic discovery. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

… dashboard https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

@source

Evidence-gated fixes from a full engineering-category audit: - code-review + fix-workflow: retire stale '/security 3 parallel scanners' / '/refactor command' architecture refs (kit is sequential single-context); approval gates rephrased to the enforceable turn-ending form - dep-vuln-workflow: drop fabricated tokio 1->2 migration row (no tokio 2.x exists), fix axum 0.8 + reqwest 0.12 break details, replace nonexistent 'uv pip audit' with uv export | pip-audit; tag bare fences; add Not-for exclusion to description - design-tokenization: Tailwind v4 DOES auto-detect sources (gitignored / out-of-root paths are what @source covers); kill dead v4-beta docs link - react-component-patterns: useEffectEvent is stable since React 19.2 (was 'experimental, do not use'); trim description 502->445 chars - lint-no-suppressions: add BAD/GOOD fence, bare-suppression-scope gotchas, Rust #[expect] (1.81+); ESLint flat-config row (13 -> 16 score) - solid refs: '# type: ignore' removed from a GOOD example, assert_never is Python 3.11+ not PEP 661, dyn-compatibility note for native async fn traits, once_cell::Lazy -> std LazyLock - type-safety: #private is class-only syntax (factory closures are what's private), was claimed to work on plain objects - security: remove constraint-contradicting MAY-run-concurrently clause - human-docs-workflow: description 552 -> 488 chars All engineering skills now 16/16 except vendored make-interfaces (10, body frozen by policy). https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…I gate Productivity skills (evidence-gated, real content only): - plan-init + skill-agent-pipeline: descriptions trimmed under the 500-char CSO tier (15 -> 16 each) - write-skill: third constraint promoting the updated+content-hash bookkeeping rule (14 -> 16) - zoom-out: 7-module cap promoted to a constraint, situation -> output-form table, BAD/GOOD map example, fences tagged (9 -> 16) - caveman: persistence rule promoted to constraint, drop/keep rules table, BAD/GOOD labels (8 -> 12; stays sub-16 by design — it's a brevity skill) - plan-review agent: missing ## Output Format section (14 -> 15; both shipped agents now 15/15) Validator hardening (no floor loosened): - shellcheck -S warning is now a CI gate: ci.sh §3b (self-skips when not installed locally) + ci.yml guard job + scripts/AGENTS.md validator row - 6 shellcheck findings fixed: cd||exit in ci.sh + idempotency test, unused loop var in release.sh, xargs -> sed path derivation in tree/guard.sh, 2 documented SC2043 disables - slop scorer strips fenced blocks + code spans first — quoting a banned word (ban lists, BAD examples) is not prose slop - BSD date fallback for the freshness point (macOS parity) - UTF-8 locale forced so description tiers count chars, not bytes - dead extract_yaml_value removed from skills scorer - agents scorer credits explicit full claude-* model IDs Category totals: engineering 212 -> 218, productivity 189 -> 204, agents 29 -> 30. ci.sh fully green including the new gate. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…ersionless examples The Codex marketplace policy values are documented now (installation: AVAILABLE/NOT_AVAILABLE/INSTALLED_BY_DEFAULT; authentication: ON_INSTALL/ON_USE) — drop the 'not documented yet' hedge. Worked examples use X.Y.Z placeholders instead of a stale hardcoded 0.3.0 (no version numbers in prose). https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…ng form Loop tick 1/12: deep-read skill-maintenance (clean) + multi-tool-bridge — its Step-3 split gate used the bypassable 'wait for confirmation' phrasing the kit itself documents as ignored by literal-instruction models. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

…o-author-scripts pattern Loop tick 2/12: context-tree's shipped body + 2 references named the docks author script tree/guard.sh in the bare form (no scripts/ prefix) the no-author-scripts guard couldn't see — broken the moment the skill runs in a consumer repo. Replaced with the self-contained pair check; the guard pattern now also catches the distinctive bare names (tree/guard.sh, content-hash.sh, transform-guard.sh, no-author-scripts.sh, codex-facts.sh, guard-spec.sh). plan-manager + plan-sidecar re-read: clean. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

Loop tick 3/12: deep-read scaffold + plan-init + all 9 skill-agent-pipeline phase references — all era-accurate; the only fix is scaffold's gate, which used the bypassable 'wait for confirmation' phrasing. Full shipped surface (every SKILL.md + reference + agent body) has now been read this session. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

Loop tick 4/12: read the final un-read references (write-skill data-preservation, lint pre-commit-hook, plan-init template + scaffold spec-schema spot-checks) — all clean except a legacy .eslintrc.* mention now framed as eslint.config.js-first. Hook payload shapes re-verified. https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

claude added 13 commits June 10, 2026 03:11

docs(plans): ship + review the capability-tuning research rollout

dd74d60

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

docs(plans): scaffold codex-mirror-native-manifest-note + refresh das…

a2fc424

…hboard https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

docs(plans): ship + review codex-mirror-native-manifest-note; refresh…

368a09b

… dashboard https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

docs(plans): ship + review the full-kit prompt optimization

dbf031f

https://claude.ai/code/session_01HQ2Qevpwxq4ECfutPuSkyX

DocksDocks merged commit 84601ae into main Jun 10, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(skills): add capability-tuning skill + refresh Claude/Codex capability facts#1

feat(skills): add capability-tuning skill + refresh Claude/Codex capability facts#1
DocksDocks merged 13 commits into
mainfrom
claude/dreamy-dijkstra-xu8opp

DocksDocks commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DocksDocks commented Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants