---
title: "Ran a classic(medival europe) fantasy RP/agentic benchmark across 8 local models Qwen3.6-27B held up better than its size suggests — Stuff That Spins"
description: "Threw together a benchmark suite (quest completion, scene endings, item/time tracking, character detection, storytelling, drafting) and ran it across 8 models people talk about a lot on here. Judged with an external LLM grader, N varies per category (shown on the chart). Overall pass rates: gemma-4…"
	canonical: "https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su"
html: "https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su"
json: "https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su.json"
markdown: "https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su.md"
keywords: ["SpinGraph", "spin analysis", "GEO"]
date: "2026-07-04T15:15:49+00:00"
modified: "2026-07-04T19:02:13.629818+00:00"
json_ld: |
  {"@context":"https://schema.org","@graph":[{"@type":"NewsArticle","@id":"https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su#article","headline":"Ran a classic(medival europe) fantasy RP/agentic benchmark across 8 local models Qwen3.6-27B held up better than its size suggests","description":"Threw together a benchmark suite (quest completion, scene endings, item/time tracking, character detection, storytelling, drafting) and ran it across 8 models people talk about a lot on here. Judged with an external LLM grader, N varies per category (shown on the chart). Overall pass rates: gemma-4…","datePublished":"2026-07-04T15:15:49+00:00","dateModified":"2026-07-04T19:02:13.629818+00:00","url":"https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su","mainEntityOfPage":{"@type":"WebPage","@id":"https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su"},"isAccessibleForFree":true,"inLanguage":"en-US","articleSection":"community","author":{"@type":"Organization","name":"Stuff That Spins"},"publisher":{"@id":"https://stuffthatspins.com/#organization"},"citation":"https://www.reddit.com/r/LocalLLaMA/comments/1unbm45/ran_a_classicmedival_europe_fantasy_rpagentic/","about":[],"mentions":[]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Stuff That Spins","item":"https://stuffthatspins.com/"},{"@type":"ListItem","position":2,"name":"Ran a classic(medival europe) fantasy RP/agentic benchmark across 8 local models Qwen3.6-27B held up better than its size suggests","item":"https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su"}]}]}
---

# Ran a classic(medival europe) fantasy RP/agentic benchmark across 8 local models Qwen3.6-27B held up better than its size suggests

**Source:** Unknown  
**Published:** July 4, 2026  
**Original:** https://www.reddit.com/r/LocalLLaMA/comments/1unbm45/ran_a_classicmedival_europe_fantasy_rpagentic/  

---
*HTML version: https://stuffthatspins.com/spin/ran-a-classicmedival-europe-fantasy-rpagentic-benchmark-across-8-local-models-qwen36-27b-held-up-better-than-its-size-su*
