iAmBoosted iamboosted

📊

Benchmarking

Benchmarks have been pulled until SOP is in place and models are re-tested under much more strict/rigorous conditions. I apologize for any inconveniences.

1 follower · 1 following

Popular repositories Loading

Qwen3.5-9B-OSS-Distill Qwen3.5-9B-OSS-Distill Public

Distilling GPT-OSS reasoning traces into Qwen 3.5 9B to fix reasoning spirals — no-answer rate 36.2% → 0.5% on a hard holdout.
falcon-h1-slerp-merge falcon-h1-slerp-merge Public

First SLERP merge of Mamba-2 hybrid LLMs (Falcon-H1-7B-Instruct × H1R-7B). Includes merge script, benchmarks, and architecture documentation.

Python
Zamba2-SLERP-Merge Zamba2-SLERP-Merge Public

SLERP merge of Zamba2-7B hybrid models. Merge succeeds but weight-sharing architecture prevents evaluation. Second in a series on non-transformer SLERP merging.

Python
falcon-h1-deep-reasoning falcon-h1-deep-reasoning Public

QLoRA math reasoning adapter for Falcon-H1-1.5B-Deep, the deepest Mamba-2 hybrid (66 layers, 1.5B params). 50% → 65% on math benchmarks with 2000 training examples.

Python
Qwen3.5-9B-Dense-To-Moe Qwen3.5-9B-Dense-To-Moe Public

Attempted dense-to-MoE conversion of Qwen 3.5 9B (DeltaNet hybrid) using CMoE and D2DMoE. Documents why post-hoc MoEfication fails on SwiGLU models without extensive sparsification. Negative result.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

iAmBoosted iamboosted

Block or report iamboosted

Popular repositories Loading

Uh oh!