Add ivrit.ai Hebrew model (Whisper Large v3 Turbo) by Nitzan94 · Pull Request #415 · altic-dev/FluidVoice

Nitzan94 · 2026-06-24T00:17:43Z

What

Adds whisperIvritV3Turbo, a Hebrew-specialized Whisper model from ivrit-ai/whisper-large-v3-turbo-ggml (1.62 GB), as a first-class speech model.

Hebrew currently only routes to Nemotron (experimental), generic Whisper (auto-detect), and Apple Speech. This adds a Hebrew-tuned model that is far stronger on Hebrew and on mixed Hebrew-English speech (it is fine-tuned on real Israeli speech, which is full of English loanwords).

How

No new engine — it is a ggml model, so it loads through the existing WhisperProvider (SwiftWhisper / whisper.cpp). Model→provider routing already falls through to Whisper, so no ASRService changes were needed.
Recommended for Hebrew — VoiceEngineLanguageCatalog lists it first for he.
Forced Hebrew decode — sets whisper.params.language = .hebrew after load so whisper.cpp does not auto-detect on short/accented audio. (Side note: the provider previously never set a language at all, leaving SwiftWhisper's .auto default — this PR only forces it for this language-specialized model.)
Per-model download URL — adds SpeechModel.whisperDownloadURL so this model loads from the ivrit.ai HF repo, with the remote filename (ggml-model.bin) decoupled from the local cache filename (ggml-ivrit-v3-turbo.bin).

Files

SettingsStore.swift — new enum case + metadata + whisperDownloadURL / forcedWhisperLanguageCode
WhisperProvider.swift — per-model download URL, size-integrity floor, language forcing
VoiceEngineLanguageCatalog.swift — Hebrew → ivrit route (first)
WelcomeView.swift — onboarding subtitle
Tests/…/IvritHebrewModelTests.swift — routing + metadata tests

Testing

Added unit tests for catalog routing and model metadata.
Verified the download URL resolves to the real file (HTTP 200, content-length 1,624,555,275).
Note: I could not run a full Xcode build in my environment — please confirm CI compiles. All new enum cases were cross-checked against every exhaustive switch over SpeechModel.

🤖 Generated with Claude Code

Adds whisperIvritV3Turbo, a Hebrew-specialized Whisper model from ivrit-ai/whisper-large-v3-turbo-ggml, as a first-class speech model. - Routes through the existing SwiftWhisper path (ggml format, no new engine) - Surfaces as the recommended engine when Hebrew is selected - Forces Hebrew decode language so whisper.cpp does not auto-detect on short or accented audio - Per-model download URL so the model loads from the ivrit.ai HF repo, decoupled from the local cache filename Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 7d647c116c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-24T00:21:22Z

+
+@testable import FluidVoice_Debug
+
+final class IvritHebrewModelTests: XCTestCase {


Add this test file to the Xcode test target

This new test class is never compiled or run by the configured test scheme: I checked Fluid.xcodeproj/project.pbxproj, and the FluidDictationIntegrationTests PBXSourcesBuildPhase still lists only AudioFixtureLoader.swift and DictationE2ETests.swift, with no file reference/build file for IvritHebrewModelTests.swift. Because this repo's test target is manually enumerated rather than file-system-synchronized, the ivrit routing/metadata checks added here provide no CI coverage until the file is added to the target.

Useful? React with 👍 / 👎.

Good catch — fixed in 39232b7. Added IvritHebrewModelTests.swift to the FluidDictationIntegrationTests target (file reference + build file + group membership + Sources build phase), mirroring DictationE2ETests.swift, so the routing/metadata checks are now compiled and run by the test scheme.

The new test file existed on disk but was not referenced in project.pbxproj, so the manually-enumerated FluidDictationIntegrationTests target never compiled it (no CI coverage). Add the file reference, build file, group membership, and sources build-phase entry, mirroring DictationE2ETests.swift. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

chatgpt-codex-connector Bot reviewed Jun 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ivrit.ai Hebrew model (Whisper Large v3 Turbo)#415

Add ivrit.ai Hebrew model (Whisper Large v3 Turbo)#415
Nitzan94 wants to merge 2 commits into
altic-dev:mainfrom
Nitzan94:nitzan/hebrew-ivrit-ai-model

Nitzan94 commented Jun 24, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Uh oh!

Nitzan94 Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		@testable import FluidVoice_Debug

		final class IvritHebrewModelTests: XCTestCase {

Uh oh!

Conversation

Nitzan94 commented Jun 24, 2026

What

How

Files

Testing

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Nitzan94 Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants