RAG App

A pure RAG pipeline with TUI, designed to work well with smaller local LLMs (via Ollama).

rag-app-readme.webm

Overview

This project is a streamlined RAG application.
By sticking to a pure RAG pipeline, this app works well with smaller models, such as llama3.2:3b.
Smaller models don't handle agentic workflows and tool calling very well; they often get stuck in a loop or make things up.
With this app you get answers referencing only the uploaded documents. When the information is not there, the app will tell you instead of making things up.

Requirements

Python: 3.14 or newer.
Package Manager: uv
LLM Engine: Ollama installed and running locally.
- Make sure Ollama is running (ollama serve). Ideally, configure it to autostart on boot.
- You need to download an LLM (e.g., ollama pull llama3.2:3b) and an embedding model (e.g., ollama pull nomic-embed-text).
- Run ollama ls to verify that the models are downloaded.

Installation

Install Globally with UV

uv tool install git+https://github.com/jotalac/rag-app.git

Manual clone (alternative)

Clone the repository:

git clone git@github.com:jotalac/rag-app.git
cd rag-app

Install app:
```
uv tool install .
```

Updating the App

If you installed globally via Git: Run the upgrade command:

uv tool upgrade rag-app

If you installed via local clone (Alternative): Pull the latest code first, then upgrade:

git pull
uv tool install . --force

Usage

Start the TUI by running:

rag-app

Note on Cold Starts: The first generation query is always slow because Ollama needs to load the model into memory. This delay also occurs anytime you change the model in the configuration.

In the app, run /help to see all available options.
ctrl+p opens the default menu, where you can change theme or do other actions

Adding resources (in TUI)

Create a folder anywhere on your device where all your resources will be stored.
In the TUI config dialog (/config), set the resources directory to your created folder.
Run /add-resources file1 file2 ... or /add-resources-dir dir_name to embed the files into the vector database.
After the files are embedded, you can safely delete them from the resources directory.

Asking about resources

Type your prompt in the input, and the app will automatically look at the uploaded resources.
Smaller models might struggle if the resources are not in English.
If no relevant data is retrieved from the vector database, generation won't start, and you will see a info message.

Current Limitations

Single Workspace: You cannot separate your resources; all resources are available for all prompts.
Language Support: For smaller models, querying in languages other than English often yields poor or hallucinated results.
Thinking Models: Thinking output is not currently visible.

To-do

Add support for importing resources directly from web URLs.
Add support for embedding and querying images, audio, and other media resources.
Add support for cloud LLM providers.
Adding resources from any folder (not only from one resources directory)

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
src/rag_app		src/rag_app
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
todo.md		todo.md
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG App

Overview

Requirements

Installation

Install Globally with UV

Manual clone (alternative)

Updating the App

Usage

Adding resources (in TUI)

Asking about resources

Current Limitations

To-do

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG App

Overview

Requirements

Installation

Install Globally with UV

Manual clone (alternative)

Updating the App

Usage

Adding resources (in TUI)

Asking about resources

Current Limitations

To-do

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages