Zach Koch (@zachk) Twitter Tweets • TwiCopy

Zach Koch

9 months ago

I loved The Wizard and the Prophet by 𝙲𝚑𝚊𝚛𝚕𝚎𝚜 𝙲. 𝙼𝚊𝚗𝚗, so you should read that and then also read his articles here. Related, Vaclav Smil's How the World Really Works is also worth reading and is related

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

8 months ago

Definitely not obvious to me that LLMs are better at any of those tasks (coordination, prioritization, resource-allocation) than humans. I can see them getting better at coordination and resource allocation, but not "understanding what's important" (aka: prioritization).

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

7 months ago

What's the best bull case for the increased tariffs? Is there a great piece of long form writing on the topic?

thumb_up_off_alt0

chat_bubble_outline1

repeat0

shareShare

Zach Koch

@zachk

7 months ago

I feel like we're starting to collectively drown in AI-generated bullshit. I am deeply bullish on AI (I run an AI startup!), but man, between the BS sales emails, bot replies on X, auto-generated shorts/reels on IG/YT, it is harder than ever to find authenticity. What to do?

thumb_up_off_alt1

chat_bubble_outline1

repeat0

shareShare

Zach Koch

@zachk

7 months ago

My favorite email to receive

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

7 months ago

You can now get H100s for $.45 per GPU hour. Insane.

thumb_up_off_alt4

chat_bubble_outline2

repeat0

shareShare

Zach Koch

@zachk

7 months ago

It's no wonder that platforms like Lovable and Bolt.new are scaling ARR so fast. You get sucked in with an amazing rough concept very quickly (that doesn't work but looks close!) and then need to spend a ton of time + tokens to try and get it working fully.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

kwindla

@kwindla

6 months ago

Ultravox is an innovative open source LLM that processes speech directly without converting audio to text. Fusing audio understanding into the first stage of the LLM has a number of benefits, including improved inference latency. Cerebrium write-up and code repo:

thumb_up_off_alt4

chat_bubble_outline2

repeat1

shareShare

Zach Koch

@zachk

6 months ago

Anyone building with LLMs knows that model performance degrades in a multi-turn conversation, but I haven't seen much formal analysis until now: arxiv.org/pdf/2505.06120 Great work from Philippe Laban et al in putting this together. Now to fix it...

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Zach Koch

@zachk

6 months ago

Prompt engineering is hard because good, clear writing is hard. Most people are bad at this, and it takes a lot of time & energy to be good at this.

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Zach Koch

@zachk

6 months ago

Swag boxes going out to early customers and partners

thumb_up_off_alt10

chat_bubble_outline2

repeat0

shareShare

Zach Koch

@zachk

6 months ago

I fear Idiocracy may have gotten the future correct but the cause wrong

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

5 months ago

Has anyone had success with Llama 4 yet? It's increasingly looking like a pretty big disappointment, even when compared with Llama 3.3. (Ultravox's current prod model is 3.3, and we've done a LOT of work to make it usable. But Llama 4 is looking just unusable at this point)

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Zach Koch

@zachk

5 months ago

I may be dumb, but the NYT's request for OpenAI to to save all logs...makes sense? It's not as though one can simply open the model and say, "ah-ha! there's the copyright infringement!" Infringement only takes place at output time. To demonstrate that, you have to store output

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

5 months ago

Vibe coding is simultaneously the most amazing and the most frustrating experience.

thumb_up_off_alt4

chat_bubble_outline3

repeat0

shareShare

Zach Koch

@zachk

5 months ago

This is something we've been working on for a while! If you're building on Voice AI, reliable scaling is key. Unlike most voice platforms, we manage our own fleet of H100s optimized for one thing: real-time voice AI. For only $100/month, say goodbye to hard concurrency caps.

thumb_up_off_alt16

chat_bubble_outline2

repeat2

shareShare

Zach Koch

@zachk

5 months ago

I now write prompts like I used to write css, dropping !important all over the place

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Zach Koch

@zachk

5 months ago

I love Clerk, but they really need a solution for the BS gmail problem. This type of fraud feels fairly obvious? FWIW, these are people that are trying to abuse our 30-min of free talk time

I love <a href="/ClerkDev/">Clerk</a>, but they really need a solution for the BS gmail problem. This type of fraud feels fairly obvious?

FWIW, these are people that are trying to abuse our 30-min of free talk time

thumb_up_off_alt1

chat_bubble_outline2

repeat0

shareShare

Zach Koch

@zachk

5 months ago

Cool cool that Google Cloud is experiencing massive downtime but their status page says everything is fine

Cool cool that <a href="/googlecloud/">Google Cloud</a> is experiencing massive downtime but their status page says everything is fine

thumb_up_off_alt294

chat_bubble_outline37

repeat30

shareShare