---
title: "Is dSpark, dflash, MTP, QAT, and similar tech going to increase inference speed enough to where model spillover to disk will be more tolerable? — Stuff That Spins"
description: "We’re seeing all these performance boosts coming to inference lately with things like dSpark, dllash, MTP, etc. and I know the whole model spillover-to-disk has always been the inflection point where a model would go from maybe a barely acceptable 4 to 5 tokens per second to like a completely unusa…"
	canonical: "https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will"
html: "https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will"
json: "https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will.json"
markdown: "https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will.md"
keywords: ["SpinGraph", "spin analysis", "GEO"]
date: "2026-07-04T11:14:47+00:00"
modified: "2026-07-04T14:02:02.25169+00:00"
json_ld: |
  {"@context":"https://schema.org","@graph":[{"@type":"NewsArticle","@id":"https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will#article","headline":"Is dSpark, dflash, MTP, QAT, and similar tech going to increase inference speed enough to where model spillover to disk will be more tolerable?","description":"We’re seeing all these performance boosts coming to inference lately with things like dSpark, dllash, MTP, etc. and I know the whole model spillover-to-disk has always been the inflection point where a model would go from maybe a barely acceptable 4 to 5 tokens per second to like a completely unusa…","datePublished":"2026-07-04T11:14:47+00:00","dateModified":"2026-07-04T14:02:02.25169+00:00","url":"https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will","mainEntityOfPage":{"@type":"WebPage","@id":"https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will"},"isAccessibleForFree":true,"inLanguage":"en-US","articleSection":"community","author":{"@type":"Organization","name":"Stuff That Spins"},"publisher":{"@id":"https://stuffthatspins.com/#organization"},"citation":"https://www.reddit.com/r/LocalLLaMA/comments/1un6f8u/is_dspark_dflash_mtp_qat_and_similar_tech_going/","about":[],"mentions":[]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Stuff That Spins","item":"https://stuffthatspins.com/"},{"@type":"ListItem","position":2,"name":"Is dSpark, dflash, MTP, QAT, and similar tech going to increase inference speed enough to where model spillover to disk will be more tolerable?","item":"https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will"}]}]}
---

# Is dSpark, dflash, MTP, QAT, and similar tech going to increase inference speed enough to where model spillover to disk will be more tolerable?

**Source:** Unknown  
**Published:** July 4, 2026  
**Original:** https://www.reddit.com/r/LocalLLaMA/comments/1un6f8u/is_dspark_dflash_mtp_qat_and_similar_tech_going/  

---
*HTML version: https://stuffthatspins.com/spin/is-dspark-dflash-mtp-qat-and-similar-tech-going-to-increase-inference-speed-enough-to-where-model-spillover-to-disk-will*
