gpu-management

Here are 15 public repositories matching this topic...

Project-HAMi / HAMi

Heterogeneous GPU Sharing on Kubernetes

kubernetes cncf nvidia gpu-acceleration kubernetes-gpu-cluster metax device-plugin cambricon gpu-management ascend vgpu gpu-shareable gpu-virtualization hygon iluvatar mthreads vgpu-hypervisor

Updated May 27, 2026
Go

togettoyou / kpilot

Star

KPilot: Unified control plane for multi-cluster Kubernetes management, GPU compute scheduling, and model serving.

kubernetes job-scheduler cloud-native model-serving batch-systems kubernetes-gpu-cluster volcano gpu-management vgpu victoriametrics gpu-shareable gpu-virtualization llm victorialogs hami-core kpilot

Updated May 26, 2026
Go

NexusGPU / tensor-fusion-site

Star

TensorFusion landing page and product docs

ai tensorflow gpu cuda pytorch nvidia gpu-acceleration nvidia-cuda gpu-monitoring gpu-management gpu-usage gpu-virtualization rcuda gpu-sharing gpu-pooling

Updated Dec 31, 2025
Vue

xschahl / vAquila

Star

The Ollama developer experience with the vLLM production power. Deploy local LLMs via Docker with smart and automatic GPU VRAM management.

python docker cli production inference self-hosted nvidia self-hosting mlops gpu-management huggingface llm vllm ollama-alternative

Updated Mar 12, 2026
Python

dynamicheart / lockbot

Star

A smart GPU/node resource locking bot for teams — manage exclusive & shared access to cluster nodes and devices via chat commands or web UI.

python devops chatbot cluster-management vue3 gpu-management fastapi resource-scheduler feishu-bot team-tool lock-bot

Updated May 15, 2026
Python

HaNguyen-prog / whisperlivekit-enhanced

Star

🎤 Enhance speech-to-text with ultra-low-latency processing and smart GPU management for efficient, self-hosted solutions.

docker real-time ai pytorch lazy-loading multi-language speech-to-text whisper gpu-management fastapi

Updated May 27, 2026
Python

neosun100 / llasa-tts-8b-webui

Star

🎙️ High-quality Text-to-Speech system based on Llasa-8B with intelligent GPU memory management. Features: 96% memory savings, Web UI + REST API + MCP, auto GPU selection, Docker deployment.

docker flask text-to-speech ai deep-learning mcp transformers pytorch tts speech-synthesis gradio voice-cloning gpu-management llasa

Updated Dec 5, 2025
Python

An intelligent NVIDIA GPU task scheduler with real-time monitoring and automatic job execution. Features persistent task queues, daemon scheduling, and multi-notification support for efficient GPU resource management.

tui task-scheduler gpu-monitoring gpu-management

Updated Nov 24, 2025
Python

saravanabalagi / mask-gpu

Sponsor

Star

A simple tool to expose only specified number of GPUs with desired memory to Tensorflow

tensorflow-gpu gpu-cluster gpu-management mask-gpu expose-gpu

Updated Dec 20, 2019
Python

Xza85hrf / ML-Framework_Checker

Star

ML Framework and CUDA Checker is a Python-based GUI application for checking PyTorch, TensorFlow, and CUDA installations. It provides detailed system specs, compatibility checks, advanced GPU management, and offers options to view instructions, export logs, and update machine learning frameworks.

python machine-learning tensorflow cuda pytorch gui-application compatibility gpu-management system-checker system-specs

Updated Apr 24, 2026
Python

neosun100 / whisperlivekit-enhanced

Star

Ultra-low-latency speech-to-text with intelligent GPU management - Enhanced version with lazy loading, auto resource release, and modern multi-language UI

docker real-time ai pytorch lazy-loading multi-language speech-to-text whisper gpu-management fastapi

Updated Dec 3, 2025
Python

jaden3289 / llasa-tts-8b-webui

Star

🎙️ Generate high-quality speech from text with Llasa-TTS-8B, featuring intelligent GPU management and multi-language support for seamless integration.