Skip to content

fix: surface full azd eval failure details and persist raw logs#291

Merged
placerda merged 1 commit into
developfrom
feature/azd-failure-diagnostics
Jun 10, 2026
Merged

fix: surface full azd eval failure details and persist raw logs#291
placerda merged 1 commit into
developfrom
feature/azd-failure-diagnostics

Conversation

@placerda

Copy link
Copy Markdown
Contributor

When execution: azd fails, agentops eval run previously bubbled up only the active azd spinner line (e.g. Resolving eval context...), giving users no actionable diagnostic.

Changes:

  • _format_command_failure now includes the full command, exit code, and both stderr and stdout (defensively truncated).
  • run_azd_eval accepts debug_dir and persists raw streams to .agentops/results/<ts>/azd_eval_run_{stdout,stderr}.log (plus azd_eval_show_* when applicable) on any failure.
  • Orchestrator passes options.output_dir as debug_dir so logs land next to results.json on real runs.

921 unit tests pass.

When `execution: azd` fails, `agentops eval run` previously bubbled up only the active azd spinner line (e.g. `Resolving eval context...`), giving users no actionable diagnostic. The runner now includes the full command, exit code, and both stderr and stdout in the error, and persists raw streams to `.agentops/results/<ts>/azd_eval_run_{stdout,stderr}.log` (plus matching `azd_eval_show_*` files when the show step fails).

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@placerda placerda merged commit 27a7187 into develop Jun 10, 2026
12 checks passed
@placerda placerda deleted the feature/azd-failure-diagnostics branch June 10, 2026 11:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant