local-embeddings

Star

Here are 17 public repositories matching this topic...

ravila4 / obsidian-semantic-search

Star

Semantic search for Obsidian vaults using LanceDB and cloud or local embedding models

embeddings obsidian semantic-search knowledge-management local-embeddings

Updated Jun 15, 2026
Python

Violet-sword / Deep-Knowledge-Chatbot

Star

A Python project that deploys a Local RAG chatbot using Ollama API. Refines answers with Deep Research from external websites, and uses both Embedding and LLM models.

python chatbot embeddings rag llm ollama deep-research local-embeddings

Updated Apr 14, 2025
Python

BaiGanio / aperio

Star

One brain. Every AI agent. Nothing forgotten. — Self-hosted memory layer via MCP + Postgres + pgvector

docker postgres mcp self-hosted pgvector lancedb local-ai ollama local-embeddings

Updated Jun 19, 2026
JavaScript

agarwalvishal / mcp-rag-server

Star

MCP RAG server — local embeddings, your docs never leave your machine. Private knowledge base + web search for Claude, Cursor, and Ollama. Drop your docs, connect your AI client, done.

python privacy mcp self-hosted cursor knowledge-base semantic-search claude rag vector-database qdrant llm local-llm retrieval-augmented-generation ollama mcp-server local-embeddings

Updated Apr 9, 2026
Python

jashutch / zeddal

Star

Turn your voice into intelligent, linked notes inside Obsidian

multilingual productivity transcription context-aware semantic-search whisper voice-to-text rag obsidian-plugin gpt4 local-embeddings ai-note-taking note-linking

Updated Nov 15, 2025
JavaScript

Violet-sword / Local-RAG-Cookbot

Star

A Python project that deploys a Local RAG chatbot using Ollama API. Refines answers with internal RAG knowledge base, and uses both Embedding and LLM models.

python chatbot embeddings rag llm local-llm ollama local-embeddings

Updated Apr 14, 2025
Python

Violet-sword / Local-RAG-Chatbot-Rerank

Star

A Python project that deploys a Local RAG chatbot using Ollama API and vLLM API. Refines answers with internal RAG knowledge base, using both Embedding and Rerank models to improve accuracy of context provided to LLM models.

python chatbot embeddings rag llm vllm local-llm ollama rerank local-embeddings local-rerank

Updated Apr 21, 2025
Python

Shun0212 / OwlSpotLight

Sponsor

Star

Semantic code search for VS Code, powered by NightOwl-CodeEmbedding — my own ModernBERT Bi-Encoder trained from scratch. Codex/MCP ready!!

vscode code-search vscode-extension codex faiss local-ai local-embeddings

Updated Jun 17, 2026
Python

AuraFriday / llm_mcp

Star

MCP server that runs local LLMs (with full access to MCP tools included). Callable by Python to chain MCP tools with local intelligence.

ai gpu mcp npu llm local-inference local-llm local-ai mcp-servers mcp-server local-embeddings

Updated Jan 19, 2026
Python

AlwaysSany / huggingface-local-embedding

Star

A Fast API server that provides local text and multi-modal embedding using LlamaIndex Hugging Face Embedding

python docker embeddings google-colab fastapi llama-index huggingface-embeddings local-embeddings

Updated Jul 3, 2025
Jupyter Notebook

wuxxin / agents-shared

Star

Sandboxed local AI (openclaw compatible) assistants and inference orchestrator

systemd orchestration sandboxing signal-cli ai-agents bubblewrap local-inference whisper-cpp llama-cpp local-llm local-embeddings local-rerank openclaw zeroclaw moltis ironclaw librefang local-speech-to-text local-tex-to-speech

Updated Jun 19, 2026
Shell

harshithreddyv9 / secure-RAG-agent-with-Groq

Star

A lightweight Retrieval-Augmented Generation (RAG) agent powered by Groq AI and local embeddings, built to process and understand text data efficiently. It retrieves relevant context from your own files and generates accurate, natural-language responses -all while keeping your data private and running locally.

Updated Nov 6, 2025
Python

joshimohanlalit1303-ctrl / ContextOS

Star

Memory-as-a-Service for AI Agents & LLMs. Add persistent memory, pgvector-based semantic search, and automatic semantic deduplication with 3 simple REST API endpoints. Comes with an LRU embedding cache and a developer analytics dashboard.

nodejs postgres typescript developer-tools python-sdk semantic-search onnx saas-boilerplate docker-ready pgvector llm-memory ai-memory semantic-deduplication agent-memory local-embeddings

Updated Jun 10, 2026
TypeScript

hermes-labs-ai / claude-router

Star

claude-router is a local prompt router that picks the right Claude model tier and prepends the right scaffold using local embeddings before you call the API. A deterministic routing layer for eval, research, content, and review prompts that helps teams stop overspending on Sonnet and Opus when Haiku plus structure is enough.