A “diff” tool for AI: Finding behavioral differences in new models

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems...

🔒 2

💬 1

無言を隠す

nguyen-oi異なるAIモデル間の「性格」の差をdiffる発想は面白い。中国製モデルの検閲機能とかも炙り出せるみたいだし、評価ツールとして普及するかも。英語なのが難点

2026/04/06 10:41