Best choice of model 40B+ Parameters

Summary

currently using Qwen3.6 35B as my main assistant model + coding agent but I think sometimes it misses basical general knowledge things, and it is more like executioner that assistant. That's why I though should I go with bigger models, But I don't want to lose speed I am on Strix Halo Having 30-40 t/s roughly on 131k context Thinking to switch on Qwen3.5 122B Any Suggestions? submitted by /u/FeiX7 [link] [comments]

SpinGraph analysis pending — check back after processing.

Ask AI about this story

Opens with the SpinGraph .md URL and structured context — one click, prompt included.

ChatGPT Claude Perplexity Gemini Grok

More from Reddit r/LocalLLaMA