GitHub - flagos-ai/KernelGenBench

Overview

KernelGenBench is a component of FlagOS — a unified, open-source AI system software stack that fosters an open technology ecosystem by seamlessly integrating various models, systems, and chips. Following the principle of "develop once, migrate across various chips", FlagOS aims to unlock the full computational potential of hardware, break down barriers between different chip software stacks, and effectively reduce migration costs.

KernelGenBench is a benchmark framework for evaluating LLM and agent-based Triton kernel generation across multiple hardware platforms.

Features

210 operators across three sources: ATen (110), vLLM (50), cuBLAS (50)
Multi-chip support: NVIDIA, Ascend NPU, MUSA, Hygon DCU, Iluvatar, MetaX
Two evaluation tracks: LLM Track (Pass@K) and Agent Track (iterative generation)
Multiple agent methods: Claude Code, OpenCode, AutoKernel, AKO4ALL, cuda-optimized-skill
Automatic verification: accuracy testing with tolerance-based comparison

Quick Start

# NVIDIA platform
pip install -r requirements/requirements_nvidia.txt
pip install -e .

# Test single operator
python scripts/generate_kernel_and_verify.py \
    --op-name aten::add \
    --single-test \
    --server-type openai

👉 For detailed setup, see Getting Started.

Documentation

📚 Full documentation: docs/source/

Section	Description
Overview	What is KernelGenBench and why use it
Getting Started	Installation for all platforms
LLM Track	Pass@K evaluation guide
Agent Track	Agent-based evaluation guide
Reference	Datasets, operators, hardware
Development	Contributing and extending
FAQ	Common questions

Related Projects

Project	Description
awesome-LLM-driven-kernel-generation	Survey of AI-driven kernel generation
KernelGen	High-performance platform for automated Triton kernel generation

Citation

@software{kernelgenbench2026,
  title={KernelGenBench: A Benchmark for LLM and Agent-Based Triton Kernel Generation},
  author={KernelGen Team},
  url={https://github.com/flagos-ai/KernelGenBench},
  year={2026}
}

License

Apache 2.0 License

Name		Name	Last commit message	Last commit date
Latest commit History 477 Commits
.github		.github
agent_bench		agent_bench
docs		docs
requirements		requirements
scripts		scripts
sota_agents		sota_agents
src		src
.coderabbit.yaml		.coderabbit.yaml
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Features

Quick Start

Documentation

Related Projects

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Features

Quick Start

Documentation

Related Projects

Citation

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages