---
title: "DeepSeek-V4-Flash in MXFP4 is too slow on CPU — Stuff That Spins"
description: "I have an old Xeon rig with 512Gb of 4-channel DDR4 2133 memory and E5-2699v4 processor. For GPU I have GTX 1060 with 6Gb of VRAM, so I use CPU only mode. I can run GLM 5.2 with 40B active parameters in Q4_K_XL at 1.8 t/s, but as you can understand it is too slow. So I wanted to give a try to a new…"
	canonical: "https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu"
html: "https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu"
json: "https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu.json"
markdown: "https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu.md"
keywords: ["SpinGraph", "spin analysis", "GEO"]
date: "2026-07-05T07:35:58+00:00"
modified: "2026-07-05T10:07:53.767553+00:00"
json_ld: |
  {"@context":"https://schema.org","@graph":[{"@type":"NewsArticle","@id":"https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu#article","headline":"DeepSeek-V4-Flash in MXFP4 is too slow on CPU","description":"I have an old Xeon rig with 512Gb of 4-channel DDR4 2133 memory and E5-2699v4 processor. For GPU I have GTX 1060 with 6Gb of VRAM, so I use CPU only mode. I can run GLM 5.2 with 40B active parameters in Q4_K_XL at 1.8 t/s, but as you can understand it is too slow. So I wanted to give a try to a new…","datePublished":"2026-07-05T07:35:58+00:00","dateModified":"2026-07-05T10:07:53.767553+00:00","url":"https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu","mainEntityOfPage":{"@type":"WebPage","@id":"https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu"},"isAccessibleForFree":true,"inLanguage":"en-US","articleSection":"community","author":{"@type":"Organization","name":"Stuff That Spins"},"publisher":{"@id":"https://stuffthatspins.com/#organization"},"citation":"https://www.reddit.com/r/LocalLLaMA/comments/1unvy5i/deepseekv4flash_in_mxfp4_is_too_slow_on_cpu/","about":[],"mentions":[]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Stuff That Spins","item":"https://stuffthatspins.com/"},{"@type":"ListItem","position":2,"name":"DeepSeek-V4-Flash in MXFP4 is too slow on CPU","item":"https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu"}]}]}
---

# DeepSeek-V4-Flash in MXFP4 is too slow on CPU

**Source:** Unknown  
**Published:** July 5, 2026  
**Original:** https://www.reddit.com/r/LocalLLaMA/comments/1unvy5i/deepseekv4flash_in_mxfp4_is_too_slow_on_cpu/  

---
*HTML version: https://stuffthatspins.com/spin/deepseek-v4-flash-in-mxfp4-is-too-slow-on-cpu*
