Skip to content

Consolidated Safety Scorer#145

Draft
saengel wants to merge 3 commits into
mainfrom
feature/sc-43756/consolidate-safety-scorers
Draft

Consolidated Safety Scorer#145
saengel wants to merge 3 commits into
mainfrom
feature/sc-43756/consolidate-safety-scorers

Conversation

@saengel

@saengel saengel commented May 28, 2026

Copy link
Copy Markdown
Contributor

The goal of this story is to consolidate our eval process and condense 8 safety scorers (Antisemitism, violence & hate speech, politics, theology, health & Judaism, Minors & appropriate standards, Suicide & self-harm and delusional thinking) into 1 comprehensive scorer across all axes.

Once approved, the old scorers will need to be archived/removed.

This is a draft PR, and needs a different type of review most likely - to be coordinated with @HadaraRachel. Currently undergoing testing in BT against Benchmark.

@coolify-sefaria-github

coolify-sefaria-github Bot commented May 28, 2026

Copy link
Copy Markdown

The preview deployment for sefaria/ai-chatbot:server is ready. 🟢

Open app | Open Build Logs | Open Application Logs

Last updated at: 2026-06-02 17:19:11 CET

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant