I am on Surface Pro 11, ARM Windows - and yet only get CPU/GPU models.
Using CLI at 0.10.0+174be11ea7aeacd8d0d67b0ba1daebec615284b1 which I believe is the latest?
PS C:\Users\filip> foundry --version
0.10.0+174be11ea7aeacd8d0d67b0ba1daebec615284b1
PS C:\Users\filip> foundry model ls --device npu
╭────────────┬──────┬──────┬────────┬───────┬────────╮
│ Model Name │ Type │ Size │ Device │ Tools │ Cached │
╰────────────┴──────┴──────┴────────┴───────┴────────╯
PS C:\Users\filip> foundry model list
╭─────────────────────────────────────┬────────────┬─────────┬────────┬───────┬────────╮
│ Model Name │ Type │ Size │ Device │ Tools │ Cached │
├─────────────────────────────────────┼────────────┼─────────┼────────┼───────┼────────┤
│ deepseek-r1-distill-qwen-7b-qnn-npu │ Other │ — │ — │ · │ ● │
│ qwen3.5-9b │ Multimodal │ 9.4 GB │ GPU │ ● │ ● │
│ deepseek-r1-14b │ Chat │ 10.3 GB │ GPU │ ○ │ ○ │
│ deepseek-r1-7b │ Chat │ 5.6 GB │ GPU │ ○ │ ○ │
│ gpt-oss-20b │ Chat │ 11.8 GB │ GPU │ ○ │ ○ │
│ ministral-3-3b-instruct-2512 │ Multimodal │ 3.6 GB │ GPU │ ● │ ○ │
│ mistral-7b-v0.2 │ Chat │ 4.1 GB │ GPU │ ○ │ ○ │
│ mistral-nemo-12b-instruct │ Chat │ 7.3 GB │ GPU │ ● │ ○ │
│ nemotron-3.5-asr-streaming-0.6b │ Speech │ 756 MB │ CPU │ ○ │ ○ │
│ nemotron-speech-streaming-en-0.6b │ Speech │ 696 MB │ CPU │ ○ │ ○ │
│ nemotron-speech-streaming-es-0.6b │ Speech │ 729 MB │ CPU │ ○ │ ○ │
│ olmo-3-7b-instruct │ Chat │ 5.5 GB │ GPU │ ● │ ○ │
│ phi-3-mini-128k │ Chat │ 2.1 GB │ GPU │ ○ │ ○ │
│ phi-3-mini-4k │ Chat │ 2.1 GB │ GPU │ ○ │ ○ │
│ phi-3.5-mini │ Chat │ 2.2 GB │ GPU │ ○ │ ○ │
│ phi-4 │ Chat │ 8.4 GB │ GPU │ ○ │ ○ │
│ phi-4-mini │ Chat │ 3.7 GB │ GPU │ ● │ ○ │
│ phi-4-mini-reasoning │ Chat │ 3.1 GB │ GPU │ ○ │ ○ │
│ phi-4-reasoning │ Chat │ 8.4 GB │ GPU │ ○ │ ○ │
│ qwen2.5-0.5b │ Chat │ 700 MB │ GPU │ ● │ ○ │
│ qwen2.5-1.5b │ Chat │ 1.5 GB │ GPU │ ● │ ○ │
│ qwen2.5-14b │ Chat │ 9.3 GB │ GPU │ ● │ ○ │
│ qwen2.5-7b │ Chat │ 5.2 GB │ GPU │ ● │ ○ │
│ qwen2.5-coder-0.5b │ Chat │ 528 MB │ GPU │ ● │ ○ │
│ qwen2.5-coder-1.5b │ Chat │ 1.3 GB │ GPU │ ● │ ○ │
│ qwen2.5-coder-14b │ Chat │ 8.8 GB │ GPU │ ● │ ○ │
│ qwen2.5-coder-7b │ Chat │ 4.7 GB │ GPU │ ● │ ○ │
│ qwen3-0.6b │ Chat │ 529 MB │ GPU │ ● │ ○ │
│ qwen3-1.7b │ Chat │ 1.4 GB │ GPU │ ● │ ○ │
│ qwen3-14b │ Chat │ 9.9 GB │ GPU │ ● │ ○ │
│ qwen3-4b │ Chat │ 2.9 GB │ GPU │ ● │ ○ │
│ qwen3-8b │ Chat │ 6.0 GB │ GPU │ ● │ ○ │
│ qwen3-embedding-0.6b │ Embedding │ 515 MB │ GPU │ ○ │ ○ │
│ qwen3-embedding-8b │ Embedding │ 6.0 GB │ GPU │ ○ │ ○ │
│ qwen3-vl-2b-instruct │ Multimodal │ 1.3 GB │ CPU │ ● │ ○ │
│ qwen3-vl-4b-instruct │ Multimodal │ 2.7 GB │ CPU │ ● │ ○ │
│ qwen3-vl-8b-instruct │ Multimodal │ 5.0 GB │ CPU │ ● │ ○ │
│ qwen3.5-0.8b │ Multimodal │ 1.3 GB │ GPU │ ● │ ○ │
│ qwen3.5-2b │ Multimodal │ 3.1 GB │ GPU │ ● │ ○ │
│ qwen3.5-2b-text │ Chat │ 1.3 GB │ GPU │ ● │ ○ │
│ qwen3.5-4b │ Multimodal │ 5.4 GB │ GPU │ ● │ ○ │
│ smollm3-3b │ Chat │ 2.2 GB │ GPU │ ● │ ○ │
│ whisper-base │ Speech │ 205 MB │ CPU │ ○ │ ○ │
│ whisper-large-v3-turbo │ Speech │ 1.3 GB │ CPU │ ○ │ ○ │
│ whisper-medium │ Speech │ 937 MB │ CPU │ ○ │ ○ │
│ whisper-small │ Speech │ 438 MB │ CPU │ ○ │ ○ │
│ whisper-tiny │ Speech │ 131 MB │ CPU │ ○ │ ○ │
╰─────────────────────────────────────┴────────────┴─────────┴────────┴───────┴────────╯
System Information:
Processor Snapdragon(R) X 12-core X1E80100 @ 3.40 GHz (3.42 GHz)
Installed RAM 32.0 GB (31.6 GB usable)
System type 64-bit operating system, ARM-based processor
Pen and touch Pen and touch support with 10 touch points
Edition Windows 11 Home
Version 26H1
Installed on 10.01.2026
Evaluation expires on 11.08.2026 20:09
OS build 28020.1546
Experience Windows Feature Experience Pack 1000.26100.317.0
I am on Surface Pro 11, ARM Windows - and yet only get CPU/GPU models.
Using CLI at
0.10.0+174be11ea7aeacd8d0d67b0ba1daebec615284b1which I believe is the latest?System Information: