Local OpenSource LLM's future feels very exciting, my ideal future model "wishlist" and attempted predictions for future local models.
View original on reddit.comSummary
I'm just a hobbyist lurker, so I'm definitely not that informed, but from what I see here has got me extremely excited and I just wanted to share. Having Qwen 3.6 27B running at ~24 tps on my dual 3060 machine has been such a life changer, I'm finding so many use cases for it. Following local llm developments has got me even more excited for future generations, and I wanted to jot some of them down as an ideal "wishlist". Here goes! 1: Unlocking the whole GPU Qwen 3.6 27B r
SpinGraph analysis pending — check back after processing.
Ask AI about this story
See how AI engines summarize this narrative — one click, prompt included.
More from Reddit r/LocalLLaMA
View all →- Qwen3.6 27B on a 5090, 6.4k sample tok/s distribution after tuning MTP/cache settings
- DGX Spark and Overtemps
- Gemma 4 12B - MLX Kernel
- Using local models with Hermes vs Claude code
- I merged fixes for quantized KV cache into my DeepSeek V4 branch
- Ran a classic(medival europe) fantasy RP/agentic benchmark across 8 local models Qwen3.6-27B held up better than its size suggests
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO