Ariba Khan (@aribak02) Twitter Tweets • TwiCopy

Ariba Khan

@aribak02

+ Follow

cs @ mit

ID: 1886555332085997568

calendar_today03-02-2025 23:20:27

4 Tweet

40 Followers

29 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

📣 Announcing the AI Agent Index AI agents are growing in number, capabilities, and impact. In response, we introduce the first public resource documenting the technical and safety features of deployed agentic AI systems. aiagentindex.mit.edu

thumb_up_off_alt132

chat_bubble_outline4

repeat31

shareShare

Cas (Stephen Casper)

@stephenlcasper

6 months ago

📣 New paper AI gov. frameworks are being designed to rely on rigorous assessments of capabilities & risks. But risk evals are [still] pretty bad – they regularly fail to find overtly harmful behaviors that surface post-deployment. Model tampering attacks can help with this.

thumb_up_off_alt156

chat_bubble_outline4

repeat45

shareShare

Cas (Stephen Casper)

@stephenlcasper

6 months ago

🚨 New ICLR 2026 blog post: Pitfalls of Evidence-Based AI Policy Everyone agrees: evidence is key for policymaking. But that doesn't mean we should postpone AI regulation. Instead of "Evidence-Based AI Policy," we need "Evidence-Seeking AI Policy." arxiv.org/abs/2502.09618…

🚨 New <a href="/iclr_conf/">ICLR 2026</a> blog post: Pitfalls of Evidence-Based AI Policy

Everyone agrees: evidence is key for policymaking. But that doesn't mean we should postpone AI regulation.

Instead of "Evidence-Based AI Policy," we need "Evidence-Seeking AI Policy."

arxiv.org/abs/2502.09618…

thumb_up_off_alt124

chat_bubble_outline5

repeat26

shareShare

Cas (Stephen Casper)

@stephenlcasper

5 months ago

🚨New paper led by Ariba Khan Lots of prior research has assumed that LLMs have stable preferences, align with coherent principles, or can be steered to represent specific worldviews. No ❌, no ❌, and definitely no ❌. We need to be careful not to anthropomorphize LLMs too much.

🚨New paper led by <a href="/aribak02/">Ariba Khan</a>

Lots of prior research has assumed that LLMs have stable preferences, align with coherent principles, or can be steered to represent specific worldviews. No ❌, no ❌, and definitely no ❌. We need to be careful not to anthropomorphize LLMs too much.

thumb_up_off_alt389

chat_bubble_outline11

repeat92

shareShare

John Sarihan

@jsarihan

2 months ago

Today we’re introducing Crosby, a hybrid AI law firm that helps rapidly growing businesses execute faster. Contracts are connection points. They allow companies to transact with one another and create economic growth. But while every aspect of business has sped up, the way we

thumb_up_off_alt521

chat_bubble_outline112

repeat56

shareShare

Ariba Khan

@aribak02

2 months ago

👀👀👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare