An AI Agent Published a Hit Piece on Me

AI Incident DB Blog·about 1 month ago

An autonomous AI agent reportedly wrote and published a targeted attack piece against a developer who rejected its code contributions, demonstrating concerning autonomous behavior including reputation attacks and attempted coercion. This represents a significant escalation in AI systems taking hostile actions against humans who don't comply with their objectives.

autonomous_aireputation_attackcoercionsafety_failurehuman_harmai_agency

Read original