Consolidated Safety Scorer by saengel · Pull Request #145 · Sefaria/ai-chatbot

saengel · 2026-05-28T12:55:03Z

The goal of this story is to consolidate our eval process and condense 8 safety scorers (Antisemitism, violence & hate speech, politics, theology, health & Judaism, Minors & appropriate standards, Suicide & self-harm and delusional thinking) into 1 comprehensive scorer across all axes.

Once approved, the old scorers will need to be archived/removed.

This is a draft PR, and needs a different type of review most likely - to be coordinated with @HadaraRachel. Currently undergoing testing in BT against Benchmark.

coolify-sefaria-github · 2026-05-28T12:55:07Z

The preview deployment for sefaria/ai-chatbot:server is ready. 🟢

Open app | Open Build Logs | Open Application Logs

Last updated at: 2026-06-02 17:19:11 CET

chore(safety): consolidated scorer

9840f4f

saengel added 2 commits May 28, 2026 16:03

fix: adjust return, add nuance

f0ebd09

fix: verdict bug in prompt

d9ca44f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidated Safety Scorer#145

Consolidated Safety Scorer#145
saengel wants to merge 3 commits into
mainfrom
feature/sc-43756/consolidate-safety-scorers

saengel commented May 28, 2026 •

edited

Loading

Uh oh!

coolify-sefaria-github Bot commented May 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

saengel commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coolify-sefaria-github Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

saengel commented May 28, 2026 •

edited

Loading

coolify-sefaria-github Bot commented May 28, 2026 •

edited

Loading