Parallelized Eva Evaluation by achi2023 · Pull Request #44 · MedARC-AI/eva-probe

achi2023 · 2026-04-30T22:06:24Z

Purpose

This PR introduces a Slurm script that parallelizes the evaluation for OpenMidnight on the Eva evaluation suite. The default parameters are set for ViT Base.

Usage

Run the packed eval script as follows:
sbatch OpenMidnight/slurm_run_evals_packed.sbatch
/path/to/sweep_or_eval
/admin/home/achihoub/openmidnight_eval_results

Optional parameters include:

CKPT_PARALLELISM: number of ckpts to run in parallel
TASK_PARALLELISM: number of tasks to run in parallel
MAX_CHECKPOINTS: max number of ckpts for this run
TASK_DATASETS: specific datasets to run from the Eva suite
BACKBONE_MODEL, MODEL_NAME, IN_FEATURES: Dino model type, model size, number of input features
N_DATA_WORKERS, PREDICT_BATCH_SIZE

CLAassistant · 2026-04-30T22:06:31Z

All committers have signed the CLA.

PaulScotti · 2026-05-06T20:38:14Z

Thanks Anis, but can you clean this up so I can push to main?

Why do you have both a slurm_run_evals.sbatch and slurm_run_evals_packed.sbatch? I dont understand the difference of how these are meant to be used

Also generally looks like lots of AI-generated code that is not necessary e.g., there are lines of code like "This is the failure mode behind the CUDA OOMs in job 33308" which has no meaning to other people. Also lots of new lines of code that I'm not sure are necessary and which I don't know how to interpret

stuff like

if [[ ! -f "${REPO_ROOT}/.venv/bin/activate" ]]; then
    echo "Virtualenv not found: ${REPO_ROOT}/.venv/bin/activate"
    exit 1
  fi

are not necessary; a PR should strictly implement the intended feature and leave everything unchanged that can be left unchanged

achi2023 · 2026-05-06T20:42:19Z

Hi Paul,

To answer your question, slurm_run_evals.sbatch was added by accident. The packed script is what will parallelize the eva evals. I will take down slurm_run_evals.sbatch so that there is no confusion. I will also clean up the packed script to get rid of the unnecessary code asap.

Add files via upload

55c606b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelized Eva Evaluation#44

Parallelized Eva Evaluation#44
achi2023 wants to merge 1 commit into
MedARC-AI:mainfrom
achi2023:main

achi2023 commented Apr 30, 2026

Uh oh!

CLAassistant commented Apr 30, 2026 •

edited

Loading

Uh oh!

PaulScotti commented May 6, 2026

Uh oh!

achi2023 commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

achi2023 commented Apr 30, 2026

Purpose

Usage

Uh oh!

CLAassistant commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PaulScotti commented May 6, 2026

Uh oh!

achi2023 commented May 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

CLAassistant commented Apr 30, 2026 •

edited

Loading