akira (@realmcore_) 's Twitter Profile
akira

@realmcore_

oss code gen | here be dragons

ID: 1465427172726755330

calendar_today29-11-2021 21:10:14

1,1K Tweet

4,4K Followers

479 Following

akira (@realmcore_) 's Twitter Profile Photo

A long time ago, a very smart researcher told me to look for the things unsaid to describe what the labs are doing. In this case, it seems their product team is truly building an agent, and by extension, we will finally see whether or not the app layer optionality thesis holds.

Edward Z. Yang (@ezyang) 's Twitter Profile Photo

I regret to inform you that once I have actually paged a codebase into my head, it is faster for me to make changes than it is to ask Claude to do it

akira (@realmcore_) 's Twitter Profile Photo

interestingly, it may be volume of data rather than diversity due to multi-modality that makes human intelligence more compressive and therefore more "intelligent" when compared to deep neural nets.

akira (@realmcore_) 's Twitter Profile Photo

kind of funny that everyone is making these images and freaking out when its pretty clear it was just trained on more svg data. Same goes for task variety. Theres a frontier of diverse tasks to eval on so of course every version should support increasing diversity. still cool

akira (@realmcore_) 's Twitter Profile Photo

Imagine if this was bait for anthropic to drop a swe-agent, then oai releases the full version of theirs to crush whatever ace anthropic has up their sleeve. Also possible that 4.5 is bait for new opus to open up space for 4.5 + reasoning as pro mode 'o4'

akira (@realmcore_) 's Twitter Profile Photo

Claude 3.7 for code is kind of terrible ~50% of the time. The code it writes feels like ancient legacy java and I'm not even using it for java.

akira (@realmcore_) 's Twitter Profile Photo

I sort of agree with this but also don't. Pretraining will likely have to shift to a bunch of curated, useful data. We'll need grounded data. The current paradigm is pretty much filtered common crawl which I agree will not continue. Data efficiency go "brrrr" as they say

akira (@realmcore_) 's Twitter Profile Photo

The main ideas behind practically ALL agent companies are: 1_ how you spend compute >>> how much compute you spend 2_ distribution >>> tech as n -> infinity if you don't have your own models, youre just running an arbitrage play over these two ideas

Dcai (@_dcai) 's Twitter Profile Photo

A few days ago, we signed a lease for an absolutely beautiful office space in the heart of San Francisco. It's got high ceilings, hugeee windows, and a rooftop with a gorgeous view of the SF skyline. But this space isn't for us. It's not an office. The plan was never to just

A few days ago, we signed a lease for an absolutely beautiful office space in the heart of San Francisco. It's got high ceilings, hugeee windows, and a rooftop with a gorgeous view of the SF skyline.

But this space isn't for us. It's not an office. The plan was never to just
Y Combinator (@ycombinator) 's Twitter Profile Photo

Simplex (Simplex) builds developer-first web agents that companies use to integrate with legacy portals. They're already in production, dispatching freight shipments, downloading customers’ invoices, and fetching websites’ internal APIs. ycombinator.com/launches/NbM-s… Congrats on

Morph (@morph_labs) 's Twitter Profile Photo

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at Morph. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at <a href="/morph_labs/">Morph</a>. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.