Franck Delmas Turbo31150

🏆 Trophées GitHub

⚡ JARVIS OS — Ce que je construis

Je remplace les services cloud IA à 500€/mois par un cluster GPU local. Même puissance. Zéro abonnement. Données 100% chez vous. RGPD natif.

┌─ SESSION CERTIFIÉE · 2026-06-08 ────────────────────────────────────┐
│  233 échanges  ·  319 tokens input  ·  96.9% cache hit              │
│  12 LLM locaux actifs  ·  0 appel cloud  ·  latence < 3s            │
│  Source : Anthropic Console + Wireshark packet capture               │
│  → Proof : github.com/Turbo31150/jarvis-core/blob/master/docs/PROOF │
└──────────────────────────────────────────────────────────────────────┘

🖥️ Architecture Cluster JARVIS OS

╔══════════════════════════════════════════════════════════════════════╗
║              RÉSEAU LOCAL 192.168.1.x — LAN 1 Gbps                  ║
╠══════════════════════╦═══════════════════════════════════════════════╣
║  M1 · La Créatrice   ║  M2 · Reasoning                               ║
║  192.168.1.85        ║  192.168.1.26                                  ║
║  ──────────────────  ║  ─────────────────────────────────────────    ║
║  RTX 3080   10 GB    ║  Quadro RTX 4000 ×3    8 GB × 3 = 24 GB      ║
║  RTX 2060   12 GB    ║                                                ║
║  GTX 1660S ×2  12 GB ║  LMS deepseek-r1-0528  :1234                  ║
║  ─────────────────── ║  sql-bridge             :8200                  ║
║  🐳 Swarm Manager    ║  content-scorer         :8300                  ║
║  15 services Docker  ║  pinecone-indexer       async                  ║
║  postgres  :5432     ║  redis-m2               :6380                  ║
║  redis     :6379     ║  LB fallback            :8080                  ║
║  n8n       :5678     ╠═══════════════════════════════════════════════╣
║  openclaw  :18789    ║  M5 · Bibliothèque                             ║
║  LMS qwen3 :1234     ║  192.168.1.113                                  ║
║  loadbal.  :80/443   ║  ─────────────────────────────────────────    ║
╠══════════════════════╣  GTX 1660S  6 GB + GTX 1050Ti 4 GB            ║
║  OL1 · Local         ║  Ollama :11434 · Actions :8400                 ║
║  127.0.0.1           ║  gemma3:4b · deepseek-r1 · qwen3:1.7b         ║
║  gemma3 · deepseek   ╠═══════════════════════════════════════════════╣
║  qwen3 · kimi-k2.5   ║  M4 · PAMERYS · 192.168.1.62                  ║
╚══════════════════════╩═══════════════════════════════════════════════╝

  Routing LLM automatique — cascade, 0 intervention :
  Ultra-rapide ──► OL1 : qwen3:1.7b     │ Rapide ──► M5 : gemma3:4b
  Qualité ──────► M1  : qwen3.5-9b      │ Reasoning ► M2 : deepseek-r1-0528

🐧 Linux Vanilla → JARVIS Linux

Ubuntu recconfiguré pour être un OS IA de production. Chaque paramètre a une raison.

Couche	Linux Vanilla	JARVIS Linux	Gain mesuré
Swap	HDD/SSD (lent)	ZRAM 16 GB lz4 — 0 I/O disque	+35% RAM effective
CPU	powersave 1.9 GHz	HWP boost → 3.5 GHz auto	+84% fréquence
swappiness	60 (défaut)	5 — RAM prioritaire	moins d'échange disque
Kernel	générique	6.8.0-117 + CUDA 12.x	GPU Docker natif
GPU Docker	❌ non exposé	nvidia-container-runtime	CUDA dans containers
Réseau	défaut distro	DNS + iptables DOCKER-USER	isolation réseau
Services	~80 systemd	80 + 38 user-services JARVIS	38 agents IA live
Orchestration	aucune	Swarm 3 nœuds + overlay net	HA automatique
Stockage LLM	HDD/SSD SATA	NVMe migration (×2.21 perf)	+121% I/O
Boot order	aléatoire	postgres→redis→openclaw→LB	démarrage garanti

Services JARVIS actifs au démarrage (M2 — extrait) :

jarvis-domino.service      → Domino auto-trigger engine (chaînes correctives)
lmstudio.service           → LM Studio headless via Xvfb :1234
jarvis-pipeline.service    → Pipeline traitement agents
content-scorer.service     → Scoring contenu /api/score :8300
sql-bridge.service         → Proxy HTTP → Postgres M1 :8200
... + 33 autres services

🆚 JARVIS OS vs Freelance — Tâche réelle chronométrée

Mission : intégrer un chatbot IA RAG (base de connaissances PDF) sur site client Mission la plus commandée en 2025 — catégorie "Développement IA" · Malt Baromètre 2025

👤 Freelance humain — marché 2025

Étape	Temps	Coût
Analyse besoin + setup environnement	3 h	195 €
RAG : chunking, embeddings Pinecone	4 h	260 €
API chatbot FastAPI + endpoints CRUD	6 h	390 €
Intégration frontend + widget JS	4 h	260 €
Tests, corrections, mise en prod	3 h	195 €
TOTAL	20 h	1 300 €

Sources : Malt Baromètre 2025 (Dev IA senior : 55–90 €/h) · Codeur.fr 2025 (chatbot IA : 800–2 000 €) · Upwork AI Report 2024 (AI specialist : $75/h médiane)

🤖 JARVIS OS — même mission

# 1 commande, dispatché en parallèle sur 4 agents spécialisés
python3 ~/jarvis/agent-dispatch/universal-dispatch.py query \
  "Crée chatbot RAG Pinecone+FastAPI, chunking PDF, widget JS, docker-compose prod"

# Exécution automatique :
# [agent data-pipeline]  → chunking + embeddings Pinecone          ≈ 3 min
# [agent code-ops]       → FastAPI + CRUD + endpoints              ≈ 8 min
# [agent browser-admin]  → tests automatisés Playwright            ≈ 2 min
# [agent cowork-system]  → docker-compose + déploiement            ≈ 3 min
# ─────────────────────────────────────────────────────────────────
#  Durée totale : 16 à 25 minutes   │   Coût tokens : 0 € (LLM local)

Métrique	👤 Freelance	🤖 JARVIS OS	Écart
Temps	20 heures	25 minutes	-98%
Coût	1 300 €	0 €	-100%
Révisions	Refacturées	Illimitées	∞
Disponibilité	Heures ouvrées	24 h/24 · 7 j/7	Toujours
Parallélisme	1 tâche	249 agents simultanés	×249
Confidentialité	Données partagées	100% local · RGPD natif	✅

📄 Preuve reproductible — 96.9% cache hit certifié →

📦 Projets — 16 repos

🔴 JARVIS OS — Infrastructure IA (6 repos)

Repo	Ce que ça fait en production	Stack	⭐
jarvis-core	Moteur central : routing LLM cascade, 15 services Swarm, 53 actions zéro-token, auto-healing	Python · Swarm · MCP · Redis · Postgres	1
jarvis-cowork	Workspace 249 agents spécialisés, 86 skills hot-loadées, auto-QA, OpenClaw 42 agents sandbox	Python · Claude Agent SDK · MCP	2
jarvis-mcp-toolkit	88+ handlers MCP plug & play : SQL, GPU, Docker, Web, Telegram, Pinecone	Python · Node.js · MCP	0
jarvis-browser-mcp	Browser automation CDP natif, alternative Playwright : injection DOM, multi-onglets, capture réseau	Python · CDP · MCP	1
jarvis-whisper-flow	Pipeline voix on-premise <300ms : wake-word → Whisper GPU → LLM → Piper TTS	Python · PyTorch · CUDA · Whisper	1
jarvis-profile	Dashboard monitoring GPU/services temps réel, WebSocket live, interface cyberpunk	Python · WebSocket · nvidia-smi	5

🟠 Trading & Finance (3 repos)

Repo	Ce que ça fait	Stack	⭐
TradeOracle	Consensus multi-LLM (Claude+GPT+Gemini), signaux futures MEXC, Monte Carlo backtesting, risk management	Python · FastAPI · MEXC API · Elastic	2
TradeOracle-Nexus-Elastic	Backend analytique : Elasticsearch, 10 000 itérations Monte Carlo, analytics temps réel	Python · Elasticsearch · FastAPI	1
gemini-live-trading-agent	Trading à la voix : parole → analyse → exécution ordre, hands-free — Gemini API Competition	Python · Gemini Live API	0

🟡 Sécurité & Hackathons (1 repo)

Repo	Ce que ça fait	Stack	⭐
find-evil-jarvis	Détection menaces multi-agents : analyse logs SIEM, corrélation anomalies, rapports SOC auto — FIND EVIL 2026 ($22K)	Python · Claude SDK · SIEM · MCP	0

🟢 SaaS & Produits (3 repos)

Repo	Ce que ça fait	Stack
passcerfa-site	Pré-remplissage CERFA automatique via FranceConnect+ — zéro ressaisie	JavaScript · FranceConnect+ API
babysmart-platform	Marketplace babysitting : matching IA, scheduling auto, Telegram, workflows N8N	HTML · JS · N8N · REST
transcription-multi-langue	Whisper GPU optimisé CUDA, 50+ langues, transcription temps réel	Python · TypeScript · CUDA

📚 Open Source & Ressources (3 repos)

Repo	Ce que ça fait	⭐
bibliotheque-prompts-multi-ia	200+ prompts testés en production sur Claude / GPT / Gemini / Mistral avec benchmarks comparatifs	1
awesome-local-ai	Curated list : tout l'écosystème IA local — LLMs, embeddings, voice, tools 100% self-hosted	1
franckdelmas.dev	Portfolio professionnel — AI Systems Architect · JARVIS OS Creator	1

🛠️ Stack technique

💼 Services freelance

Service	Livrable	Délai	Prix
🔍 Audit IA locale	Rapport diagnostic + plan d'action	1 jour	440 €
🚀 Déploiement cluster LLM	Cluster GPU opérationnel on-premise	3–5 j	Devis
🤖 Intégration Claude Code / MCP	Agents + handlers actifs	2–3 j	Devis
☁️ Migration cloud → local	Infrastructure souveraine RGPD	1–2 sem	Devis
🎓 Formation équipe	Session pratique + support	½ j	220 €

→ Portfolio complet & Contact

📊 GitHub Stats

JARVIS OS tourne 24/7 en production sur matériel personnel — Toulouse, France

Provide feedback

Saved searches

Use saved searches to filter your results more quickly