Using llama.cpp with pi
View original on reddit.comSummary
A lot of people have their own setups using pi with llama.cpp. I have my own so thought I'll share. This is the extension I and deepseek wrote: https://github.com/am17an/pi-llama-server . It allows you to do two very simple things: auto detect a llama-server running and list the models available. Here is a demo gif (sorry about poor quality): That's it - that's the post. Enjoy more free software! I am infact trying to make this a completely local AI driven repo since it's a simpl
SpinGraph analysis pending — check back after processing.
Ask AI about this story
Opens with the SpinGraph .md URL and structured context — one click, prompt included.
More from Reddit r/LocalLLaMA
View all →- is LM Link just too uncooked/experimental?
- Qwen 3.6 27B - VLLM Performance Benchmark Results (BF16, FP8, NVFP4)
- Considering Buying Another RTX 3090 - Benefits?
- longcat 2.0 (1.6T, ~48B active) weights are now open under MIT license
- DeepSeek-V4-Flash in MXFP4 is too slow on CPU
- GH Copilot’s BYOK Blocking for Inline Completion Makes No Sense. [THE FIX]
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO