Jack Vanlightly (@vanlightly) 's Twitter Profile
Jack Vanlightly

@vanlightly

@confluentinc thinking about event streaming.
Ex @Splunk, @VMware
hotds.dev, jack-vanlightly.com
Credit: ESO/B. Tafreshi

ID: 796723788855603200

linkhttps://jack-vanlightly.com calendar_today10-11-2016 14:38:48

1,1K Tweet

4,4K Followers

233 Following

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

A new log replication disaggregation survey post is out! The Kafka Replication Protocol: 🔹Separation of control plane from data plane. 🔹Role separation with minimal coupling. 🔹Kafka’s alignment with Paxos roles. jack-vanlightly.com/blog/2025/2/21…

Ankush Desai (@ankushpd) 's Twitter Profile Photo

If you are looking for formal models of a real-world distributed system, DeepSeek DeepSeek released P specifications for their new distributed file system (3FS): github.com/deepseek-ai/3F…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Another Humans of the Data Sphere is out, with issue #9! In this issue, we also look at whether software engineers can learn from mechanical engineering, and looking at table formats as a form of virtualization. hotds.dev/p/humans-of-th…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

A new disaggregated log replication survey post is out. How does the combination of Apache Pulsar with Apache BookKeeper divide and conquer the responsibilities of log replication? jack-vanlightly.com/blog/2025/3/13…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Any Principal Engineers out there with ADHD or creative wiring — who don’t thrive in the tasks of project coordination, alignment meetings, and people management, but thrive on strategy, system design, writing, and shaping direction through ideas? Curious how you navigate the

Ismael Juma (@ijuma) 's Twitter Profile Photo

And the old group coordinator implementation is gone from Apache Kafka - love it when open-source projects can delete large chunks of complex code. github.com/apache/kafka/p…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Seems like I’m not alone. For what it’s worth, I’ve got a great fit at Confluent — but the more senior I get, the more I wonder how sustainable that is across future PE roles. Thinking of writing a blog post, maybe with interviews or perspectives from PEs who aren’t natural cat

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Another Humans of the Data Sphere is out, with issue 10! In this issue people are talking fsyncs, tips for running ClickHouse at scale, the problems with MCP and more. Plus I dig up a classic paper from 1962. hotds.dev/p/humans-of-th…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

How to reliably distribute work across microservices, stream processors, durable execution, event-driven, orchestration and now AI agents? Coordinated Progress is a 4-part series that explores the common structure behind reliable distributed systems. jack-vanlightly.com/blog/2025/6/11…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Over the past few months, I’ve been thinking deeply about how systems make progress reliably in the face of partial failures, service boundaries, retries, and complex dependencies. Building reliable workflows across microservices, functions, and stream processors is one of the

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

In distributed systems, reliability isn’t just about retries and durability, it’s about knowing who owns recovery. My latest post, based on the Coordinated Progress model I posted previously, explores how reliable triggers create responsibility boundaries and how those boundaries

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

Science moves slowly because wrong theories waste decades. Engineering is careful because failures kill people. Software moves fast because mistakes are cheap, the expensive error isn't making the wrong choice, it's taking too long to make any choice. jack-vanlightly.com/blog/2025/7/22…

Jack Vanlightly (@vanlightly) 's Twitter Profile Photo

In a future of autonomous AI agents, we can't limit ourselves to error prevention and error detection, we must also include remediation. But when AI loses touch with reality due to hallucinations, confabulation and misinterpretation, who does the remediation? In cases of