Skip to content
#

local-embeddings

Here are 17 public repositories matching this topic...

A lightweight Retrieval-Augmented Generation (RAG) agent powered by Groq AI and local embeddings, built to process and understand text data efficiently. It retrieves relevant context from your own files and generates accurate, natural-language responses -all while keeping your data private and running locally.

  • Updated Nov 6, 2025
  • Python

Memory-as-a-Service for AI Agents & LLMs. Add persistent memory, pgvector-based semantic search, and automatic semantic deduplication with 3 simple REST API endpoints. Comes with an LRU embedding cache and a developer analytics dashboard.

  • Updated Jun 10, 2026
  • TypeScript

claude-router is a local prompt router that picks the right Claude model tier and prepends the right scaffold using local embeddings before you call the API. A deterministic routing layer for eval, research, content, and review prompts that helps teams stop overspending on Sonnet and Opus when Haiku plus structure is enough.

  • Updated May 31, 2026
  • Python

Improve this page

Add a description, image, and links to the local-embeddings topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the local-embeddings topic, visit your repo's landing page and select "manage topics."

Learn more