Skip to content

[cli] no NPU models shown on Surface Pro 11 #797

@filipw

Description

@filipw

I am on Surface Pro 11, ARM Windows - and yet only get CPU/GPU models.

Using CLI at 0.10.0+174be11ea7aeacd8d0d67b0ba1daebec615284b1 which I believe is the latest?

PS C:\Users\filip> foundry --version
0.10.0+174be11ea7aeacd8d0d67b0ba1daebec615284b1
PS C:\Users\filip> foundry model ls --device npu
╭────────────┬──────┬──────┬────────┬───────┬────────╮
│ Model Name │ Type │ Size │ Device │ Tools │ Cached │
╰────────────┴──────┴──────┴────────┴───────┴────────╯
PS C:\Users\filip> foundry model list
╭─────────────────────────────────────┬────────────┬─────────┬────────┬───────┬────────╮
│ Model Name                          │ Type       │ Size    │ Device │ Tools │ Cached │
├─────────────────────────────────────┼────────────┼─────────┼────────┼───────┼────────┤
│ deepseek-r1-distill-qwen-7b-qnn-npu │ Other      │ —       │ —      │ ·     │ ●      │
│ qwen3.5-9b                          │ Multimodal │ 9.4 GB  │ GPU    │ ●     │ ●      │
│ deepseek-r1-14b                     │ Chat       │ 10.3 GB │ GPU    │ ○     │ ○      │
│ deepseek-r1-7b                      │ Chat       │ 5.6 GB  │ GPU    │ ○     │ ○      │
│ gpt-oss-20b                         │ Chat       │ 11.8 GB │ GPU    │ ○     │ ○      │
│ ministral-3-3b-instruct-2512        │ Multimodal │ 3.6 GB  │ GPU    │ ●     │ ○      │
│ mistral-7b-v0.2                     │ Chat       │ 4.1 GB  │ GPU    │ ○     │ ○      │
│ mistral-nemo-12b-instruct           │ Chat       │ 7.3 GB  │ GPU    │ ●     │ ○      │
│ nemotron-3.5-asr-streaming-0.6b     │ Speech     │ 756 MB  │ CPU    │ ○     │ ○      │
│ nemotron-speech-streaming-en-0.6b   │ Speech     │ 696 MB  │ CPU    │ ○     │ ○      │
│ nemotron-speech-streaming-es-0.6b   │ Speech     │ 729 MB  │ CPU    │ ○     │ ○      │
│ olmo-3-7b-instruct                  │ Chat       │ 5.5 GB  │ GPU    │ ●     │ ○      │
│ phi-3-mini-128k                     │ Chat       │ 2.1 GB  │ GPU    │ ○     │ ○      │
│ phi-3-mini-4k                       │ Chat       │ 2.1 GB  │ GPU    │ ○     │ ○      │
│ phi-3.5-mini                        │ Chat       │ 2.2 GB  │ GPU    │ ○     │ ○      │
│ phi-4                               │ Chat       │ 8.4 GB  │ GPU    │ ○     │ ○      │
│ phi-4-mini                          │ Chat       │ 3.7 GB  │ GPU    │ ●     │ ○      │
│ phi-4-mini-reasoning                │ Chat       │ 3.1 GB  │ GPU    │ ○     │ ○      │
│ phi-4-reasoning                     │ Chat       │ 8.4 GB  │ GPU    │ ○     │ ○      │
│ qwen2.5-0.5b                        │ Chat       │ 700 MB  │ GPU    │ ●     │ ○      │
│ qwen2.5-1.5b                        │ Chat       │ 1.5 GB  │ GPU    │ ●     │ ○      │
│ qwen2.5-14b                         │ Chat       │ 9.3 GB  │ GPU    │ ●     │ ○      │
│ qwen2.5-7b                          │ Chat       │ 5.2 GB  │ GPU    │ ●     │ ○      │
│ qwen2.5-coder-0.5b                  │ Chat       │ 528 MB  │ GPU    │ ●     │ ○      │
│ qwen2.5-coder-1.5b                  │ Chat       │ 1.3 GB  │ GPU    │ ●     │ ○      │
│ qwen2.5-coder-14b                   │ Chat       │ 8.8 GB  │ GPU    │ ●     │ ○      │
│ qwen2.5-coder-7b                    │ Chat       │ 4.7 GB  │ GPU    │ ●     │ ○      │
│ qwen3-0.6b                          │ Chat       │ 529 MB  │ GPU    │ ●     │ ○      │
│ qwen3-1.7b                          │ Chat       │ 1.4 GB  │ GPU    │ ●     │ ○      │
│ qwen3-14b                           │ Chat       │ 9.9 GB  │ GPU    │ ●     │ ○      │
│ qwen3-4b                            │ Chat       │ 2.9 GB  │ GPU    │ ●     │ ○      │
│ qwen3-8b                            │ Chat       │ 6.0 GB  │ GPU    │ ●     │ ○      │
│ qwen3-embedding-0.6b                │ Embedding  │ 515 MB  │ GPU    │ ○     │ ○      │
│ qwen3-embedding-8b                  │ Embedding  │ 6.0 GB  │ GPU    │ ○     │ ○      │
│ qwen3-vl-2b-instruct                │ Multimodal │ 1.3 GB  │ CPU    │ ●     │ ○      │
│ qwen3-vl-4b-instruct                │ Multimodal │ 2.7 GB  │ CPU    │ ●     │ ○      │
│ qwen3-vl-8b-instruct                │ Multimodal │ 5.0 GB  │ CPU    │ ●     │ ○      │
│ qwen3.5-0.8b                        │ Multimodal │ 1.3 GB  │ GPU    │ ●     │ ○      │
│ qwen3.5-2b                          │ Multimodal │ 3.1 GB  │ GPU    │ ●     │ ○      │
│ qwen3.5-2b-text                     │ Chat       │ 1.3 GB  │ GPU    │ ●     │ ○      │
│ qwen3.5-4b                          │ Multimodal │ 5.4 GB  │ GPU    │ ●     │ ○      │
│ smollm3-3b                          │ Chat       │ 2.2 GB  │ GPU    │ ●     │ ○      │
│ whisper-base                        │ Speech     │ 205 MB  │ CPU    │ ○     │ ○      │
│ whisper-large-v3-turbo              │ Speech     │ 1.3 GB  │ CPU    │ ○     │ ○      │
│ whisper-medium                      │ Speech     │ 937 MB  │ CPU    │ ○     │ ○      │
│ whisper-small                       │ Speech     │ 438 MB  │ CPU    │ ○     │ ○      │
│ whisper-tiny                        │ Speech     │ 131 MB  │ CPU    │ ○     │ ○      │
╰─────────────────────────────────────┴────────────┴─────────┴────────┴───────┴────────╯

System Information:

Processor	Snapdragon(R) X 12-core X1E80100 @ 3.40 GHz (3.42 GHz)
Installed RAM	32.0 GB (31.6 GB usable)
System type	64-bit operating system, ARM-based processor
Pen and touch	Pen and touch support with 10 touch points
Edition	Windows 11 Home
Version	26H1
Installed on	‎10.‎01.‎2026
Evaluation expires on	‎11.‎08.‎2026 20:09
OS build	28020.1546
Experience	Windows Feature Experience Pack 1000.26100.317.0

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions