Best choice of model 40B+ Parameters
View original on reddit.comSummary
currently using Qwen3.6 35B as my main assistant model + coding agent but I think sometimes it misses basical general knowledge things, and it is more like executioner that assistant. That's why I though should I go with bigger models, But I don't want to lose speed I am on Strix Halo Having 30-40 t/s roughly on 131k context Thinking to switch on Qwen3.5 122B Any Suggestions? submitted by /u/FeiX7 [link] [comments]
SpinGraph analysis pending — check back after processing.
Ask AI about this story
Opens with the SpinGraph .md URL and structured context — one click, prompt included.
More from Reddit r/LocalLLaMA
View all →- Considering Buying Another RTX 3090 - Benefits?
- longcat 2.0 (1.6T, ~48B active) weights are now open under MIT license
- DeepSeek-V4-Flash in MXFP4 is too slow on CPU
- GH Copilot’s BYOK Blocking for Inline Completion Makes No Sense. [THE FIX]
- Agents-A1-Q8_0-GGUF works pretty well for me (anecdotal feedback)
- Any word on Qwen 3.7 9B? (Also looking for 9B-class alternatives to Qwen 3.5)
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO