Heterogeneous GPU Sharing on Kubernetes
-
Updated
May 27, 2026 - Go
Heterogeneous GPU Sharing on Kubernetes
KPilot: Unified control plane for multi-cluster Kubernetes management, GPU compute scheduling, and model serving.
TensorFusion landing page and product docs
The Ollama developer experience with the vLLM production power. Deploy local LLMs via Docker with smart and automatic GPU VRAM management.
A smart GPU/node resource locking bot for teams — manage exclusive & shared access to cluster nodes and devices via chat commands or web UI.
🎤 Enhance speech-to-text with ultra-low-latency processing and smart GPU management for efficient, self-hosted solutions.
🎙️ High-quality Text-to-Speech system based on Llasa-8B with intelligent GPU memory management. Features: 96% memory savings, Web UI + REST API + MCP, auto GPU selection, Docker deployment.
An intelligent NVIDIA GPU task scheduler with real-time monitoring and automatic job execution. Features persistent task queues, daemon scheduling, and multi-notification support for efficient GPU resource management.
A simple tool to expose only specified number of GPUs with desired memory to Tensorflow
ML Framework and CUDA Checker is a Python-based GUI application for checking PyTorch, TensorFlow, and CUDA installations. It provides detailed system specs, compatibility checks, advanced GPU management, and offers options to view instructions, export logs, and update machine learning frameworks.
Ultra-low-latency speech-to-text with intelligent GPU management - Enhanced version with lazy loading, auto resource release, and modern multi-language UI
🎙️ Generate high-quality speech from text with Llasa-TTS-8B, featuring intelligent GPU management and multi-language support for seamless integration.
GPU management on multi-GPU platforms
Multi-server GPU slot management for LM Studio and other OpenAI-compatible inference backends
a Vulkan-based Multi-Process Service (MPS)
Add a description, image, and links to the gpu-management topic page so that developers can more easily learn about it.
To associate your repository with the gpu-management topic, visit your repo's landing page and select "manage topics."