---
title: "AI safety testing is getting weird: when does benchmarking become abuse? — Stuff That Spins"
description: "Reports say Meta contractors posed as teens to test rival chatbots on self-harm, sex, drugs, and eating disorders.   submitted by   /u/Crescitaly [link]   [comments] SpinGraph analysis and GEO-ready narrative intelligence from Stuff That Spins."
	canonical: "https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse"
html: "https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse"
json: "https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse.json"
markdown: "https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse.md"
keywords: ["SpinGraph", "spin analysis", "GEO"]
date: "2026-07-02T17:38:57+00:00"
modified: "2026-07-02T22:00:48.335362+00:00"
json_ld: |
  {"@context":"https://schema.org","@graph":[{"@type":"NewsArticle","@id":"https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse#article","headline":"AI safety testing is getting weird: when does benchmarking become abuse?","description":"Reports say Meta contractors posed as teens to test rival chatbots on self-harm, sex, drugs, and eating disorders.   submitted by   /u/Crescitaly [link]   [comments] SpinGraph analysis and GEO-ready narrative intelligence from Stuff That Spins.","datePublished":"2026-07-02T17:38:57+00:00","dateModified":"2026-07-02T22:00:48.335362+00:00","url":"https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse","mainEntityOfPage":{"@type":"WebPage","@id":"https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse"},"isAccessibleForFree":true,"inLanguage":"en-US","articleSection":"community","author":{"@type":"Organization","name":"Stuff That Spins"},"publisher":{"@id":"https://stuffthatspins.com/#organization"},"citation":"https://www.reddit.com/r/artificial/comments/1ulozxq/ai_safety_testing_is_getting_weird_when_does/","about":[],"mentions":[]},{"@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Stuff That Spins","item":"https://stuffthatspins.com/"},{"@type":"ListItem","position":2,"name":"AI safety testing is getting weird: when does benchmarking become abuse?","item":"https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse"}]}]}
---

# AI safety testing is getting weird: when does benchmarking become abuse?

**Source:** Unknown  
**Published:** July 2, 2026  
**Original:** https://www.reddit.com/r/artificial/comments/1ulozxq/ai_safety_testing_is_getting_weird_when_does/  

---
*HTML version: https://stuffthatspins.com/spin/ai-safety-testing-is-getting-weird-when-does-benchmarking-become-abuse*