Jack Vanlightly (@vanlightly) Twitter Tweets • TwiCopy

Jack Vanlightly

@vanlightly

+ Follow

@confluentinc thinking about event streaming.
Ex @Splunk, @VMware
hotds.dev, jack-vanlightly.com
Credit: ESO/B. Tafreshi

ID: 796723788855603200

linkhttps://jack-vanlightly.com calendar_today10-11-2016 14:38:48

1,1K Tweet

4,4K Followers

233 Following

Jack Vanlightly

@vanlightly

8 months ago

I may have to add Restate to my disaggregated log replication survey 😁

thumb_up_off_alt17

chat_bubble_outline0

repeat2

shareShare

A new log replication disaggregation survey post is out! The Kafka Replication Protocol: 🔹Separation of control plane from data plane. 🔹Role separation with minimal coupling. 🔹Kafka’s alignment with Paxos roles. jack-vanlightly.com/blog/2025/2/21…

thumb_up_off_alt119

chat_bubble_outline2

repeat17

shareShare

Ankush Desai

@ankushpd

8 months ago

If you are looking for formal models of a real-world distributed system, DeepSeek DeepSeek released P specifications for their new distributed file system (3FS): github.com/deepseek-ai/3F…

thumb_up_off_alt220

chat_bubble_outline2

repeat43

shareShare

Jack Vanlightly

@vanlightly

8 months ago

Another Humans of the Data Sphere is out, with issue #9! In this issue, we also look at whether software engineers can learn from mechanical engineering, and looking at table formats as a form of virtualization. hotds.dev/p/humans-of-th…

thumb_up_off_alt8

chat_bubble_outline0

repeat2

shareShare

Jack Vanlightly

@vanlightly

8 months ago

A new disaggregated log replication survey post is out. How does the combination of Apache Pulsar with Apache BookKeeper divide and conquer the responsibilities of log replication? jack-vanlightly.com/blog/2025/3/13…

thumb_up_off_alt99

chat_bubble_outline0

repeat19

shareShare

Jack Vanlightly

@vanlightly

7 months ago

Any Principal Engineers out there with ADHD or creative wiring — who don’t thrive in the tasks of project coordination, alignment meetings, and people management, but thrive on strategy, system design, writing, and shaping direction through ideas? Curious how you navigate the

thumb_up_off_alt180

chat_bubble_outline19

repeat8

shareShare

Ismael Juma

@ijuma

7 months ago

And the old group coordinator implementation is gone from Apache Kafka - love it when open-source projects can delete large chunks of complex code. github.com/apache/kafka/p…

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Jack Vanlightly

@vanlightly

7 months ago

Seems like I’m not alone. For what it’s worth, I’ve got a great fit at Confluent — but the more senior I get, the more I wonder how sustainable that is across future PE roles. Thinking of writing a blog post, maybe with interviews or perspectives from PEs who aren’t natural cat

thumb_up_off_alt30

chat_bubble_outline0

repeat1

shareShare

Jack Vanlightly

@vanlightly

7 months ago

Another Humans of the Data Sphere is out, with issue 10! In this issue people are talking fsyncs, tips for running ClickHouse at scale, the problems with MCP and more. Plus I dig up a classic paper from 1962. hotds.dev/p/humans-of-th…

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Jack Vanlightly

@vanlightly

5 months ago

How to reliably distribute work across microservices, stream processors, durable execution, event-driven, orchestration and now AI agents? Coordinated Progress is a 4-part series that explores the common structure behind reliable distributed systems. jack-vanlightly.com/blog/2025/6/11…

thumb_up_off_alt182

chat_bubble_outline1

repeat31

shareShare

Jack Vanlightly

@vanlightly

5 months ago

Over the past few months, I’ve been thinking deeply about how systems make progress reliably in the face of partial failures, service boundaries, retries, and complex dependencies. Building reliable workflows across microservices, functions, and stream processors is one of the

thumb_up_off_alt61

chat_bubble_outline3

repeat16

shareShare

Jack Vanlightly

@vanlightly

4 months ago

In distributed systems, reliability isn’t just about retries and durability, it’s about knowing who owns recovery. My latest post, based on the Coordinated Progress model I posted previously, explores how reliable triggers create responsibility boundaries and how those boundaries

thumb_up_off_alt101

chat_bubble_outline0

repeat16

shareShare

Jack Vanlightly

@vanlightly

3 months ago

A new case study is born

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Jack Vanlightly

@vanlightly

3 months ago

Science moves slowly because wrong theories waste decades. Engineering is careful because failures kill people. Software moves fast because mistakes are cheap, the expensive error isn't making the wrong choice, it's taking too long to make any choice. jack-vanlightly.com/blog/2025/7/22…

thumb_up_off_alt56

chat_bubble_outline3

repeat11

shareShare

Jack Vanlightly

@vanlightly

3 months ago

In a future of autonomous AI agents, we can't limit ourselves to error prevention and error detection, we must also include remediation. But when AI loses touch with reality due to hallucinations, confabulation and misinterpretation, who does the remediation? In cases of

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare