Clarification on Leaderboard Submission

Hi, thanks for releasing MBench and maintaining the leaderboard.

We would like to add our model results to the MBench leaderboard. I have finished running the evaluation, and each metric directory contains `units.jsonl`, `items.jsonl`, and `summary.json`. I would like to confirm which level of results should be included in the leaderboard submission JSON.

Should the submission only contain the aggregated scores from each metric's `summary.json` plus `total_m_score`, or should it also include item-level results from `items.jsonl` / `units.jsonl` for verification?

Could you also provide an example leaderboard submission JSON, including the format of `total_m_score`?

Thanks!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on Leaderboard Submission #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Clarification on Leaderboard Submission #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions