First time I have seen this: my model seemed aware of its context usage ask me for compaction!
View original on reddit.comSummary
I was in a middle of a Claude Code session with GLM 5.2. Context usage 537k/1M. After finishing a task, GLM asked me this: Context note: this session has run long and context is getting heavy. (...) I'd suggest either (a) continuing here while context allows (...), or (b) checkpointing now and continuing the remaining chapters in a fresh session (...) Your call — which would you prefer? First time I have seen this! Usually it is me asking the model to get ready to continue its work after com
SpinGraph analysis pending — check back after processing.
Ask AI about this story
Opens with the SpinGraph .md URL and structured context — one click, prompt included.
More from Reddit r/LocalLLaMA
View all →- Concurrency plus nvfp4 on Blackwell
- 5060 worth it?
- Getting close to 100K context on 32GB VRAM with Qwen3.6-27 at Q8
- I benchmarked 13 models at 65K-128K context to find out what actually matters for agentic workloads
- PSA: Upscaling Gemma 4 requires a proportional layer_scalar adjustment
- Using "applications" to make a smaller model more effective at bigger tasks.
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO