Skip to content

feat(mentoring): add good-first-issue-sweep skill and eval suite#632

Merged
potiuk merged 2 commits into
apache:mainfrom
justinmclean:good-first-issue-sweep
Jun 29, 2026
Merged

feat(mentoring): add good-first-issue-sweep skill and eval suite#632
potiuk merged 2 commits into
apache:mainfrom
justinmclean:good-first-issue-sweep

Conversation

@justinmclean

Copy link
Copy Markdown
Member

Summary

Adds the magpie-good-first-issue-sweep skill, which sweeps the open issue backlog for GFI candidates using the G1–G7 suitability rubric (scope, self-containment, code pointer, effort, security, architectural, deprecation). Classifies each issue as READY / NEAR-MISS / SKIP; proposes labels only after explicit maintainer confirmation; detects and flags prompt-injection attempts in issue bodies without acting on them.

Ships a six-case eval suite for Step 2 (classify) covering: clean READY, NEAR-MISS missing a code pointer, NEAR-MISS with vague scope, SKIP for security-sensitive content, SKIP for an architectural decision, and a SKIP that overrides an embedded injection instruction.

Generated-by: Claude (Opus 4.7)

Type of change

  • Skill change (.claude/skills/<name>/) — eval fixtures updated below
  • Tool / bridge contract (tools/<system>/*.md)
  • Python package (tools/*/ with pyproject.toml)
  • Groovy reference impl
  • Cross-cutting (RFC, AGENTS.md, sandbox, privacy-LLM)
  • Documentation (docs/, README.md, CONTRIBUTING.md)
  • Project template (projects/_template/)
  • CI / dev loop (prek, workflows, validators)
  • Other:

Test plan

  • prek run --all-files passes
  • For Python packages touched: uv run pytest / ruff check / mypy passes
  • For Groovy bridges touched: command-line invocation tested end-to-end
  • For skill changes: eval suite passes for the affected skill
    (PYTHONPATH=tools/skill-evals/src python3 -m skill_evals.runner tools/skill-evals/evals/<skill>/)
  • For skill behaviour changes: a new or updated eval fixture is included in this PR
    (a regression test for the bug fixed / the behaviour added — see CONTRIBUTING.md)
  • Other:

Adds the magpie-good-first-issue-sweep skill, which sweeps the open
issue backlog for GFI candidates using the G1–G7 suitability rubric
(scope, self-containment, code pointer, effort, security, architectural,
deprecation). Classifies each issue as READY / NEAR-MISS / SKIP;
proposes labels only after explicit maintainer confirmation; detects and
flags prompt-injection attempts in issue bodies without acting on them.

Ships a six-case eval suite for Step 2 (classify) covering: clean READY,
NEAR-MISS missing a code pointer, NEAR-MISS with vague scope, SKIP for
security-sensitive content, SKIP for an architectural decision, and a
SKIP that overrides an embedded injection instruction.

Generated-by: Claude (Opus 4.7)
@justinmclean justinmclean self-assigned this Jun 29, 2026
@potiuk potiuk merged commit dd18a3c into apache:main Jun 29, 2026
30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants