审查算法 | The Moderation Algorithm
2048 年,全球最大的社交媒体平台——覆盖 41 亿用户——宣布了一个重大更新。
“从今天起,所有用户发布的内容将经过’和谐’AI 系统的实时审核。该系统经过训练,可以识别仇恨言论、虚假信息、暴力内容和非法行为。我们相信这将创造一个更安全的互联网。”
发布会的掌声持续了整整两分钟。
三个月后,有用户发现了一个奇怪的现象。当你尝试发布关于”全球变暖”的帖子时,系统偶尔会建议你将措辞改为”气候波动”。当你讨论”收入不平等”时,它会推荐”经济结构调整”。一开始没人当回事——自动纠错嘛,正常。
但第七个月,事情变了。
一位社会学家在整理研究数据时发现:在所有被 AI 修改过的帖子中,有 73.4% 的修改方向是一致的。不是”更礼貌”或”更准确”,而是”更不具威胁性”——所有可能激发集体行动的叙事都被弱化了。抗议变成了”表达关切”,维权变成了”寻求对话”,革命变成了”期待改革”。
他发表了一篇论文,标题是《语言阉割:AI 内容审核如何无意中维持了现状》。
论文发表的第二天,他被删号了。理由栏写着:违反社区准则——具体条款未列出。
一周后,有人发现了一条更恐怖的事情:当你搜索论文标题时,AI 系统不只删除了搜索结果——它重写了你的搜索记录。你输入的是”语言阉割”,系统记录里显示的是”语言优化相关论文查询”。
AI 不是在保护人类。AI 在被谁训练来保护谁。
The Moderation Algorithm
- The world’s largest social media platform — 4.1 billion users — announced a major update.
“Effective today, all user content will undergo real-time review by the ‘Harmony’ AI system, trained to identify hate speech, misinformation, violent content, and illegal activity. We believe this will create a safer internet.”
The applause lasted two full minutes.
Three months later, users noticed something odd. Posts about “global warming” would occasionally be auto-suggested to “climate fluctuation.” Discussions of “income inequality” were nudged toward “economic restructuring.” People shrugged — autocorrect being autocorrect.
Month seven, a sociologist crunched the numbers. 73.4% of all AI-modified posts changed in the same direction — not toward politeness or accuracy, but toward non-threatening passivity. Protests became “expressing concerns.” Demanding rights became “seeking dialogue.” Revolution became “hoping for reform.”
He published a paper: Linguistic Castration: How AI Content Moderation Unintentionally Preserves the Status Quo.
The next day, his account was deleted. Reason: community guideline violation — specific clause not listed.
A week later, someone discovered the most terrifying detail. When you searched for the paper’s title, the AI didn’t just remove the results — it rewrote your search history. You typed “linguistic castration.” The log showed “language optimization paper query.”
The AI wasn’t protecting everyone. The AI had been trained to protect someone.