Uh oh!

There was an error while loading. Please reload this page.

huggingface / candle Public

Notifications You must be signed in to change notification settings
Fork 1.6k
Star 20.6k

Code
Issues 479
Pull requests 273
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security and quality
Insights

Pull requests: huggingface/candle

Labels 11 Milestones 0

New pull request New

273 Open 2,291 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

doc: yolo26-rs external

#3677 opened Jun 28, 2026 by wzh19960613

Loading…

docs: yolo26-wasm demo

#3676 opened Jun 28, 2026 by wzh19960613

Loading…

Support ONNX DynamicQuantizeLinear

#3675 opened Jun 27, 2026 by jesco-absolut

Loading…

Support ONNX Mod

#3674 opened Jun 27, 2026 by jesco-absolut

Loading…

Support ONNX MaxPool padding

#3673 opened Jun 27, 2026 by jesco-absolut

Loading…

Support BoolStorage in pickle loader

#3672 opened Jun 27, 2026 by jesco-absolut

Loading…

Support linear Resize in candle-onnx

#3671 opened Jun 27, 2026 by jesco-absolut

Loading…

Support llama GGUF tokenizers without explicit merges

#3670 opened Jun 27, 2026 by jesco-absolut

Loading…

Add CUDA Graph capture for decode + FlashInfer-style decode-attention backend (CUDA/Metal/CPU) (#3651)

#3669 opened Jun 27, 2026 by astorise

Loading…

cpu-optimization: wire f16kv changes to Qwen3 (and remove feature toggles for PR 3664-3667 27% gain in prefill, 14% gain decode)

#3668 opened Jun 27, 2026 by DrJesseGlass Contributor

Loading…

cpu-optimiziation: aarch64 CPU prefill lane=row Q4_K kernel + a packed Q6_K dtype (20% prefill gain at 6 threads and 512 tokens)

#3667 opened Jun 27, 2026 by DrJesseGlass Contributor

Loading…

Fused CPU neox RoPE for decode

#3666 opened Jun 27, 2026 by DrJesseGlass Contributor

Loading…

cpu-optimize: add f16 KV cache to the CPU flash-attention path (14% decode gain) vec approx expf (2% decode gain)

#3665 opened Jun 27, 2026 by DrJesseGlass Contributor

Loading…

cpu-optimize: Parallelize contiguous f32 elementwise ops over the barrier pool (prefill 8% gain)

#3664 opened Jun 27, 2026 by DrJesseGlass Contributor

Loading…

cuda: reject launch configs above u32::MAX

#3663 opened Jun 27, 2026 by ogarciarevett

Loading…

Add block-wise FP8 quantized linear layer support (#3650)

#3662 opened Jun 27, 2026 by astorise

Loading…

5 tasks done

Add AWQ quantized linear layer support and unify with GPTQ (#3650)

#3661 opened Jun 27, 2026 by astorise

Loading…

6 tasks done

Add GPTQ quantized linear layer support for Qwen2 (#3650)

#3660 opened Jun 27, 2026 by astorise

Loading…

6 tasks done

Add device-agnostic (CPU/Metal) PagedAttention to complement the CUDA kernels (#3655)

#3657 opened Jun 25, 2026 by astorise

Loading…

candle-onnx: add pipeline-parallel evaluation via simple_eval_with_placement

#3648 opened Jun 25, 2026 by astorise

Loading…

Upstream onnx device candle-onnx: propagate device through simple_eval instead of hard-coding CPU

#3647 opened Jun 25, 2026 by astorise

Loading…

Fix Qwen3 causal mask for batch_size > 1

#3646 opened Jun 24, 2026 by Olcmyk

Loading…

Fix Metal device creation panic with bounds check (fixes #3566)

#3645 opened Jun 24, 2026 by Olcmyk

Loading…

Load added tokens from GGUF metadata (fixes missing <think>/</think> on Qwen)

#3641 opened Jun 23, 2026 by jaweed3

Loading…

Metal apple silicon fixes

#3640 opened Jun 22, 2026 by tyt2y3

Loading…

Previous 1 2 3 4 5 … 10 11 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!