---
title: "Qwen3.6 27B on a 5090, 6.4k sample tok/s distribution after tuning MTP/cache settings — Stuff That Spins"
description: "Spent a while tuning llama.cpp for Qwen3.6 27B on a 9800X3D / 64GB / 5090 box and wanted to share the real distribution instead of just a headline number, since averages hide a lot. Ran with q8 KV cache, 192k context, MTP draft=10, spec-draft-p-min=0.5, batch/ubatch 512. Logged 6,454 samples across…"
	canonical: "https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings"
html: "https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings"
json: "https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings.json"
markdown: "https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings.md"
keywords: ["SpinGraph", "spin analysis", "GEO"]
date: "2026-07-04T15:11:17+00:00"
modified: "2026-07-04T19:02:16.140887+00:00"
json_ld: |
  {"@context":"https://schema.org","@graph":[{"@type":"NewsArticle","@id":"https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings#article","headline":"Qwen3.6 27B on a 5090, 6.4k sample tok/s distribution after tuning MTP/cache settings","description":"Spent a while tuning llama.cpp for Qwen3.6 27B on a 9800X3D / 64GB / 5090 box and wanted to share the real distribution instead of just a headline number, since averages hide a lot. Ran with q8 KV cache, 192k context, MTP draft=10, spec-draft-p-min=0.5, batch/ubatch 512. Logged 6,454 samples across…","datePublished":"2026-07-04T15:11:17+00:00","dateModified":"2026-07-04T19:02:16.140887+00:00","url":"https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings","mainEntityOfPage":{"@type":"WebPage","@id":"https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings"},"isAccessibleForFree":true,"inLanguage":"en-US","articleSection":"community","author":{"@type":"Organization","name":"Stuff That Spins"},"publisher":{"@id":"https://stuffthatspins.com/#organization"},"citation":"https://www.reddit.com/r/LocalLLaMA/comments/1unbi4a/qwen36_27b_on_a_5090_64k_sample_toks_distribution/","about":[],"mentions":[]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Stuff That Spins","item":"https://stuffthatspins.com/"},{"@type":"ListItem","position":2,"name":"Qwen3.6 27B on a 5090, 6.4k sample tok/s distribution after tuning MTP/cache settings","item":"https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings"}]}]}
---

# Qwen3.6 27B on a 5090, 6.4k sample tok/s distribution after tuning MTP/cache settings

**Source:** Unknown  
**Published:** July 4, 2026  
**Original:** https://www.reddit.com/r/LocalLLaMA/comments/1unbi4a/qwen36_27b_on_a_5090_64k_sample_toks_distribution/  

---
*HTML version: https://stuffthatspins.com/spin/qwen36-27b-on-a-5090-64k-sample-toks-distribution-after-tuning-mtpcache-settings*
