David Zhao (@davidzh) 's Twitter Profile
David Zhao

@davidzh

Co-Founder @livekit. Entrepreneur and engineer. I like computers and believe in hard money. #Bitcoin

ID: 22118674

linkhttps://livekit.io calendar_today27-02-2009 07:35:20

1,1K Tweet

1,1K Followers

708 Following

Cartesia (@cartesia_ai) 's Twitter Profile Photo

Building voice agents? Meet Ink-Whisper: the fastest, most affordable streaming speech-to-text model. 🌎 Optimized for accuracy in real-world conditions 👯 Pair with our Sonic text-to-speech → fastest duo in voice AI 🔌 Plugs into Vapi,Pipecat AI, LiveKit Read more:

Building voice agents? Meet Ink-Whisper: the fastest, most affordable streaming speech-to-text model. 

 🌎 Optimized for accuracy in real-world conditions
 👯 Pair with our Sonic text-to-speech → fastest duo in voice AI
 🔌 Plugs into <a href="/Vapi_AI/">Vapi</a>,<a href="/pipecat_ai/">Pipecat AI</a>, <a href="/livekit/">LiveKit</a> 

Read more:
Shayne (@shayneparlo) 's Twitter Profile Photo

Ink-Whisper is fast! Cartesia released a new STT model yesterday, and it's as fast as you'd expect. Streamed transcription finishes in <100ms—before you can say the next sentence. I used it to build a live teleprompter that follows along with what you're saying. Code in 🧵

David Zhao (@davidzh) 's Twitter Profile Photo

Inworld's new TTS is rich and expressive! I've been having a lot of fun playing with them. With a competitive cost & low latency TTFB of ~300ms, this is one to watch for in the voice AI space. Give it a try: pip install livekit-agents[inworld]

Jake (@justjake) 's Twitter Profile Photo

I think the “shell-qui-hires” (Scale, Windsurf, etc) is one of the worst things to ever happen to startups The point of a startup is there’s assurance of binded fate “shell-qui-hires” undo that bindings As a founder, it’s your job to get everyone home. Full stop.

LiveKit (@livekit) 's Twitter Profile Photo

Next, our end-of-turn model has been significantly improved. In the past, things like punctuation or STT provider formatting could affect the accuracy of the model. The new model is MUCH more resilient, resulting in more accurate turn detection across 14 different languages.

Kyle Campbell (@imkylecampbell) 's Twitter Profile Photo

A lot of people have reached out to me for my LiveKit UI integrated with voice agents and avatars, so I made this video to demo it and give the code for free. Free meeting app for voice ai developers to demo their voice agents to clients

David Zhao (@davidzh) 's Twitter Profile Photo

Hedra has achieved quite the engineering feat here. They are performing full frame generation - every pixel is synthesized, not just the lips area. Being able to do it at $0.05/min is just 🤯🤯🤯

LiveKit (@livekit) 's Twitter Profile Photo

Since Hedra allows you to create avatars with ANY image, you can create new avatars in real time by generating new images with an LLM. In this example, we're using GPT-4.1-mini to generate a random avatar. We pass that image to the avatar session, and Hedra will animate it.

Since Hedra allows you to create avatars with ANY image, you can create new avatars in real time by generating new images with an LLM. In this example, we're using GPT-4.1-mini to generate a random avatar.

We pass that image to the avatar session, and Hedra will animate it.
Jake (@justjake) 's Twitter Profile Photo

I don't think people understand that raising money is like, one of the easiest parts of starting a company You're pitching to sell 100% of the next years of your life If your pitch makes sense, they *will* sign you up for that work Be very deliberate about what you do

CloudWebRTC (@cloudwebrtc) 's Twitter Profile Photo

since flutter-webrtc is relied on by many products, its main functions have been implemented and become stable, has more than 50k+ downloads per month recently, I think it is time to update its version number to 1.0. github.com/flutter-webrtc…

David Huie (@davidhuie) 's Twitter Profile Photo

The moment things changed for us? Switching to LiveKit LiveKit handles the entire real-time media pipeline: •TTS •STT •LLM integration •Telephony •And most importantly… turn detection 🧠

The moment things changed for us?

Switching to <a href="/livekit/">LiveKit</a>

LiveKit handles the entire real-time media pipeline:
•TTS
•STT
•LLM integration
•Telephony
•And most importantly… turn detection 🧠
Santiago (@svpino) 's Twitter Profile Photo

I built a simple voice assistant in 70 lines of Python code. It uses: • LiveKit - The voice agent • AssemblyAI - To turn your voice into text • OpenAI - The brain of the agent, and to turn text into audio There's something really cool about this: