H20启动FP8失败

by darvec - opened Jul 31, 2025

darvec

Jul 31, 2025

4张H20可以启动Qwen3-Coder-480B-A35B-Instruct-FP8吗，显存比模型小，但是MOE好像不需要占用全部显存？

10 days ago

4张H20可以启动Qwen3-Coder-480B-A35B-Instruct-FP8吗，显存比模型小，但是MOE好像不需要占用全部显存？

MOE是inference阶段不需要，但是部署加载阶段需要全部权重

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment