Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

📅 2025-07-13    ⚓ Hacker News    🌐 Source    🖼️ Load Image