SPIN Unprocessed July 5, 2026 ai_technology community
Competence Gate: gating tool-use on a small model's internal confidence signal instead of its verbalised one — Qwen3.5-4B, open weights [P]
View original on reddit.comSummary
I made a 10MB LoRA adapter for Qwen3.5-4B plus a small orchestration layer. It decides, per query, whether to answer directly, search the web, or retrieve from your own local documents and it refuses to make things up when it can't verify an answer. It runs locally (Apple Silicon / MLX, with a GGUF build for llama.cpp/Ollama). Basically small instruct models are poor at telling users how confident they really are. They can't verbalise it and tend to say they are confident for everyhting.
SpinGraph analysis pending — check back after processing.
Ask AI about this story
Opens with the SpinGraph .md URL and structured context — one click, prompt included.
More from Reddit r/MachineLearning
View all →- Question regarding Xournal++ and software 4 taking university notes during class [D]
- ECCV travel support program [D]
- I built a open source neural network shape validator [P]
- If DeepMind or Anthropic is doing your exact research topic, do you still continue? [D]
- If your GPU can run inference, it should be able to fine-tune too. [P]
- We'll benchmark an Open weights LLM on any GPU you choose — drop your model + hardware and we'll run it. [D]
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO