Justin Bullock (@justinbullock14) Twitter Tweets • TwiCopy

Justin Bullock

@justinbullock14

+ Follow

VP of Policy for @americans4ri; Senior Fellow with Convergence Analysis; Advocate of Love, Intelligence, & Freedom

ID: 2933754365

linkhttp://governingwithAI.com calendar_today20-12-2014 15:27:34

3,3K Tweet

1,1K Followers

1,1K Following

Ethan Mollick

@emollick

5 months ago

Current agents only do 30% of complex real company tasks in this paper. Though note benchmarks are a floor, not a ceiling, if: 1) More recent models show improvement in the benchmark, suggesting future models may do it 2) Better prompting/tools would make the AI perform better.

thumb_up_off_alt194

chat_bubble_outline13

repeat18

shareShare

Justin Bullock

@justinbullock14

5 months ago

So, um, is it really the case that the headline AI agents COMPLETED 30%-48% of real-real world professional office tasks? That’s, um, *checks notes* a lot, right?

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

My sample is biased (obviously), but ChatGPT use (and, rapidly others as well) is beginning to feel ambient. The oracles are running amok amongst us, well maybe not quite amok, but quickly thriving towards ends unknown for sure.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

👀👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

Any attempts at counting or estimating anything in this direction anyone knows about?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

👀👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

👀👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Diego

@diego__pasini

5 months ago

Ethan Mollick rohit We pushed the system prompt earlier today. Feel free to take a look! github.com/xai-org/grok-p…

thumb_up_off_alt172

chat_bubble_outline13

repeat10

shareShare

Ethan Mollick

@emollick

5 months ago

This is a step in a good direction for xAI, transparency in their system prompt.

thumb_up_off_alt139

chat_bubble_outline6

repeat7

shareShare

Justin Bullock

@justinbullock14

5 months ago

👀👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Ketan Ramakrishnan

@ketanr

5 months ago

So what to do? We say: focus on the handful of large AI developers truly at at the frontier. That's where the most distinctive potential risks of frontier AI development are most likely to arise, and where the need for transparency, evidence, and understanding are most pressing.

thumb_up_off_alt11

chat_bubble_outline1

repeat0

shareShare

Loquacious Bibliophilia ⏸️

@locbibliophilia

5 months ago

Ketan Ramakrishnan Dean W. Ball Carnegie Endowment I did not expect to agree with Dean W. Ball but yes, this is the path forward. We do need governance on frontier companies. "One of the chief tasks of a frontier AI regulatory regime—arguably the chief task, at least for now—is to put society in a position to reduce such

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

Keep an eye for these projects to be published! Deric and I were very impressed with the quality of these fellows and their work. There’s not nearly enough work being done to understand what components are needed for an AGI Social Contract. With this incredible team of fellows

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

Kudos to Anthropic for their work on transparency! This a big step in the right direction. Happy to see it! Let’s have the public conversation about this incredibly important piece of AI policy! “Frontier AI development needs greater transparency to ensure public safety and

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

Lots to like here. Kudos to Ketan Ramakrishnan and Dean W. Ball for their work here! Moving the discussion forward, in a sensible direction. Love to see it.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

👀👀

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

How could you not be compelled to watch.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Joe Carlsmith

@jkcarlsmith

5 months ago

I'm giving a public talk Tuesday July 8th, 7:30 pm at Mox in SF. Title: "Can goodness compete?". It's about long-term equilibrium outcomes post-AGI. More info at link in thread.

thumb_up_off_alt49

chat_bubble_outline3

repeat6

shareShare

Justin Bullock

@justinbullock14

5 months ago

I know I’m new to DC, because when I look around, I see opportunity everywhere. And, even more than opportunity, there’s maybe even a hint of a touch of momentum.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Justin Bullock

@justinbullock14

5 months ago

Always nice to see rohit making an obvious observation that I completely missed. Well put.

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare