
Ariba Khan
@aribak02
cs @ mit
ID: 1886555332085997568
03-02-2025 23:20:27
4 Tweet
40 Followers
29 Following




🚨New paper led by Ariba Khan Lots of prior research has assumed that LLMs have stable preferences, align with coherent principles, or can be steered to represent specific worldviews. No ❌, no ❌, and definitely no ❌. We need to be careful not to anthropomorphize LLMs too much.

