Project Kiwi

Project Kiwi is a fork of Just Rayen's Project Riko.

I saw their YouTube video and wanted to look through the code.

I found it... interesting.

Here is my attempt at making it more modular and easier to understand, while keeping the core functionality generally intact.

I should mention, however, that I did literally tear the entire original project apart at the seams and reassemble it in a way that made more sense to me, so there are significant changes to the code structure and organisation.

Tested with Python 3.12 on Linux Mint 22.2 (Zara)

Design Philosophy

Run everything locally. I want this to be able to run fully offline with zero external dependencies.
Keep it modular. Within reason, each component should be able to run independently for testing and development purposes.
Simple & Working > Complex & Broken. I want to have a functional version of the project as soon as possible, and then build on it from there.
Leading from that, clarity > cool tricks. I want my code to be easy to understand and follow.

Features

LLM-based Dialogue using Ollama. (configurable system prompt)
JSON-based Conversation Memory to keep context during interactions.
YAML-based Config for personality configuration.
Voice Activity Detection using Silero VAD.
Speech Recognition using Faster-Whisper.
Voice Generation using Resemble AI's Chatterbox.

Pipeline

Currently:

Takes in a user input from the console
Passes it to LLM model (with history)
Generates a response
Prints the output back to the console

Technical capability: (The code I've written works, I just don't have enough VRAM on my RTX 3060Ti to run the full pipeline)

Listens to your voice via microphone (Voice Activity Detection with Silero VAD)
Transcribes it with Faster-Whisper
Passes it to GPT (with history)
Generates a response
Synthesises a voice reply using Resemble AI's Chatterbox
Plays the output back to you

Pre-fork (original project):

~~Riko listens to your voice via microphone (push to talk)~~
~~Transcribes it with Faster-Whisper~~
~~Passes it to GPT (with history)~~
~~Generates a response~~
~~Synthesises Riko's voice using GPT-SoVITS~~
~~Plays the output back to you~~

Voice Generation and Speech Recognition are currently unused due to personally not having the hardware to run them, but the code is there and should work if you have the necessary resources.

JSON-based Conversation Memory is a placeholder, and will be replaced with a more robust solution like a vector database.

Configuration

All prompts and parameters are stored in character_config.yaml.

You can define personalities by modifying the config file.

Setup

Install Dependencies

pip install -r requirements.txt

Usage

1. Run the main script:

python main.py

Each module is technically designed to be capable of independent operation, mostly for testing purposes, but the main script will run the full application, and is the recommended way to experience the full functionality of the project.

TODO / Future Improvements

My own to-do list:

Remove all the external API calls and make everything run locally
Reorganise code structure, with better modularity and separation of concerns
Re-implement voice generation and speech recognition modules.
Replace JSON-based conversation memory with a vector database (probably ChromaDB)
Implement a dynamic emotional modelling system to hook into the LLM and voice synthesis to allow for more expressive and emotionally varied responses.

From the original project:

GUI or web interface
Live microphone input support
~~Emotion or tone control in speech synthesis~~ (edited and yoinked over to my own to-do list)
VRM model frontend

Credits

Just Rayen's Project Riko (https://github.com/rayenfeng/riko_project)
Ollama for the local LLM hosting solution
Chatterbox-TTS by Resemble AI (https://github.com/resemble-ai/chatterbox)
Faster-Whisper by SYSTRAN (https://github.com/SYSTRAN/faster-whisper)
Silero VAD by Silero AI (https://github.com/snakers4/silero-vad)

License

MIT — feel free to clone, modify, and build your own voice companion.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
character_files		character_files
modules		modules
.gitignore		.gitignore
README.md		README.md
character_config.yaml		character_config.yaml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Kiwi

Design Philosophy

Features

Pipeline

Configuration

Setup

Install Dependencies

Usage

1. Run the main script:

TODO / Future Improvements

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project Kiwi

Design Philosophy

Features

Pipeline

Configuration

Setup

Install Dependencies

Usage

1. Run the main script:

TODO / Future Improvements

Credits

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages