infrence
Here are 8 public repositories matching this topic...
A pure Elixir Datalog engine with semi-naive fixpoint evaluation, stratified negation, provenance tracking, and telemetry.
-
Updated
May 15, 2026 - Elixir
Совместный проект курса в МФТИ по методам эффективной реализации моделей искусственного интеллекта
-
Updated
Jun 3, 2025 - C++
Nectar-X-Studio is a powerful, Local AI-Inferencing application that allows the user download, create, run agents and run large language models on their own machine. With no internet connection required, Nectar ensures privacy-first, high-performance inference using cutting-edge open-source models from Hugging Face, Ollama, and beyond.
-
Updated
Mar 20, 2026 - Python
Rust SDK for writing custom backends for NVIDIA Triton Inference Server
-
Updated
Apr 11, 2026 - Rust
Local-first LLM toolkit: token/KV-cache/VRAM analyzer, Ollama paraphrase pipeline, and a weighted load-balancer with health checks. 100% local — no API keys, no data leaves your machine. 51 passing tests.
-
Updated
Jun 1, 2026 - Python
Duplex is an advanced, strictly client-side application designed to interface with multiple Large Language Models simultaneously. Run local instances through Ollama and multiple cloud APIs in a unified, privacy-first interface.
-
Updated
Jun 11, 2026 - TypeScript
Improve this page
Add a description, image, and links to the infrence topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the infrence topic, visit your repo's landing page and select "manage topics."