SPIN Unprocessed July 4, 2026 ai_technology community
We'll benchmark an Open weights LLM on any GPU you choose — drop your model + hardware and we'll run it. [D]
View original on reddit.comSummary
We run HexGrid Cloud, a platform for deploying open-source models on GPUs, and we're heads-down optimizing our serving/deployment layer. To pressure-test it we're benchmarking real models under real concurrency — and instead of guessing, we'd rather run what you actually want to see. --- Models available for benchmarking : Nemotron-3 Super 120B-A12B (only NVFP4) Nemotron-3 Nano 30B A3B Qwen-3.6 27B Llama 3.3 70B Instruct Gemma-4 31B Devstral-Small-2-24B-Instruct-2512 ?? ( you suggest
SpinGraph analysis pending — check back after processing.
Ask AI about this story
See how AI engines summarize this narrative — one click, prompt included.
More from Reddit r/MachineLearning
View all →- How to get more from your chatbot for less [P]
- Proposal: Use semantic compression as input diffusion to read sessions larger than the context window [R]
- BaryGraph - knowledge graph where every relationship is its own embedded document (not an edge) [R]
- What does "Safe AI" look like? [D]
- Small Language Model SLM [D]
- Tom Yeh's AI by hand? is it worth it? [D]
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO