Skip to content

fix(experiments): pass base_experiment_id to summarize#512

Open
Barrett Pyke (barrettpyke) wants to merge 1 commit into
mainfrom
barrettpyke/base-experiment-id
Open

fix(experiments): pass base_experiment_id to summarize#512
Barrett Pyke (barrettpyke) wants to merge 1 commit into
mainfrom
barrettpyke/base-experiment-id

Conversation

@barrettpyke

Copy link
Copy Markdown
Contributor

Description

Eval stores base_experiment_id correctly on the experiment but the final summary does not pass it as the explicit comparison ID. As a result, summary comparison can fall back to project/default baseline resolution and show wrong diffs.

Fix

Pass evaluator.base_experiment_id into experiment.summarize(comparison_experiment_id=...), so score and metric diffs are computed against the explicit experiment baseline.

Also resolve the explicit comparison experiment name so the returned summary displays the correct “compared to” name. Previously, comparison_experiment_id was None, so summarize() called POST /api/base_experiment/get_id; that resolver can apply UI/default-baseline behavior, including letting a project default baseline override the experiment’s explicit base_exp_id.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

base_experiment_name also has the same bug here. We need to fix that too

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Eval summary can compare against wrong baseline

2 participants