Skip to content

XY-926: Add live daily-use benchmark suites#191

Merged
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-926
Jun 11, 2026
Merged

XY-926: Add live daily-use benchmark suites#191
yvette-carlisle merged 1 commit into
mainfrom
y/elf-xy-926

Conversation

@yvette-carlisle

Copy link
Copy Markdown
Member

Summary

  • add full live adapter coverage for ELF consolidation, knowledge-page, operator-debugging, and existing capture/write-policy suites
  • preserve qmd and unsupported competitor boundaries as typed non-pass records instead of fake losses or fake payloads
  • refresh benchmark docs/manifest claim boundaries for 55 jobs across 13 checked-in suites

Validation

  • cargo make fmt
  • cargo make lint-fix
  • cargo test -p elf-eval --test real_world_job_benchmark declared_not_encoded_consolidation_jobs_do_not_require_fake_proposals -- --exact --nocapture
  • cargo make real-world-memory-live-adapters
  • cargo make checks

Notes

  • Docker live sweep completed with Qdrant client/server compatibility warnings, but generated ELF/qmd live reports and summary successfully.
  • Decodex attempt xy-926-attempt-1-1781209961 is stalled with no active lease and no live process; this PR is a manual takeover of the retained worktree.

@yvette-carlisle yvette-carlisle merged commit 788600f into main Jun 11, 2026
13 checks passed
@yvette-carlisle yvette-carlisle deleted the y/elf-xy-926 branch June 11, 2026 21:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant