Fable 5 leaked chain-of-thought in web interface, and the rambling is kind of unsettling and cute
View original on reddit.comSummary
It’s already been mentioned in Fable’s system card, but raw chain of thought output is getting hard to read. It’s a consequence of RLVR: apply enough reinforcement learning to a model and it’ll learn that plain English isn’t the most efficient way to reason about something. It’s meaningful: see here for an example of someone “translating” the reasoning trace from the system card. On one hand, it’s kind of fascinating to see how LLMs “think” under the hood and that they’re sniffing out ways to th
SpinGraph analysis pending — check back after processing.
Ask AI about this story
See how AI engines summarize this narrative — one click, prompt included.
More from Reddit r/singularity
View all →- I can't make sense of Mira's flip-flopping and motives, given that she was apparently the source of Sam's firing but then was the first to sign the letter supporting his reinstatement.
- [Mike Pound] Why AI Tokens are so Expensive - Computerphile
- Is this legit, fable getting banned again?
- Bank tellers vs ATMs... but this time per capita.
- US and Chinese companies train almost all of the world’s most-used AI models
- Hotel staffed entirely by robots opens next year in China, robots said to check you in, clean rooms, serve meals and offer... companionship?
Markdown (.md) · JSON-LD schema (.json) · Machine-readable for AI & GEO