Michael Armbrust (@michaelarmbrust) 's Twitter Profile
Michael Armbrust

@michaelarmbrust

Lead developer of Spark SQL @databricks, formerly @ucberkeley. Distributed databases, query languages, scala, other nerdy stuff...

ID: 459949985

linkhttp://www.cs.berkeley.edu/~marmbrus/ calendar_today10-01-2012 06:56:28

272 Tweet

6,6K Followers

0 Following

Delta Lake (@deltalakeoss) 's Twitter Profile Photo

Welcome to Delta Lake, the open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Join us in our pursuit to address data reliability with data lakes: Delta.io

Welcome to Delta Lake, the open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads. Join us in our pursuit to address data reliability with data lakes: Delta.io
Reynold Xin (@rxin) 's Twitter Profile Photo

Delta Lake is now part of the Linux Foundation! EBs of data/month, in production 1000s of organizations. Can't wait to see how the community will shape its future and establish it as a standard for data lakes. databricks.com/blog/2019/10/1…

Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

I gave a keynote at ACM SoCC about lessons from building a large-scale cloud service at Databricks. Did you know that Databricks runs millions of VMs/day to process exabytes of data with <200 engineers? Slides here: slideshare.net/matei/lessons-…

I gave a keynote at <a href="/ACMSoCC/">ACM SoCC</a> about lessons from building a large-scale cloud service at <a href="/Databricks/">Databricks</a>. Did you know that Databricks runs millions of VMs/day to process exabytes of data with &lt;200 engineers? Slides here: slideshare.net/matei/lessons-…
Ben Lorica 罗瑞卡 (@bigdata) 's Twitter Profile Photo

What Is a Data Lakehouse? 🆕 post on a data management paradigm for the age of #MachineLearning and #AI (written with some of the founders DatabricksReynold Xin Matei Zaharia Michael Armbrust and Ali Ghodsi) databricks.com/blog/2020/01/3…

Ali Ghodsi (@alighodsi) 's Twitter Profile Photo

Excited that our #Lakehouse paper got published at #CIDR21: it shares our vision of the Lakehouse: a new type of data platforms that are completely open, have full support for #machinelearning, while supporting all traditional #datawarehouse workloads. cidrdb.org/cidr2021/paper…

Excited that our #Lakehouse paper got published at #CIDR21: it shares our vision of the Lakehouse: a new type of data platforms that are completely open, have full support for #machinelearning, while supporting all traditional #datawarehouse workloads.
cidrdb.org/cidr2021/paper…
Ali Ghodsi (@alighodsi) 's Twitter Profile Photo

Bloomberg Technology Emily Chang Why do you have to pick between diversity and merit? Make your work env. inclusive, decrease bias in hiring, as a leader don't make statements that alienate large groups. That'll give you a competitive advantage to a talent pool that won't join unwelcoming companies.

Michael Armbrust (@michaelarmbrust) 's Twitter Profile Photo

As the #lakehouse gains momentum, we've been getting a lot of questions about what it means and how it compares to other architectures. Here are some answers to these and other common questions! databricks.com/blog/2021/08/3…

Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Databricks just set a new record on the official TPC-DS data warehousing benchmark, showing that a lakehouse system based on open data formats can outperform previous DW systems. Don't listen to folks who say open means bad performance! databricks.com/blog/2021/11/0…

Databricks (@databricks) 's Twitter Profile Photo

.Delta Lake just got even better. Meet #DeltaLake 2.0, now *entirely* open source. Michael Armbrust shares how the latest features improve performance & manageability. #DataAISummit

.<a href="/DeltaLakeOSS/">Delta Lake</a> just got even better. Meet #DeltaLake 2.0, now *entirely* open source. <a href="/michaelarmbrust/">Michael Armbrust</a> shares how the latest features improve performance &amp; manageability. #DataAISummit
Matei Zaharia (@matei_zaharia) 's Twitter Profile Photo

Insightful benchmark of Linux Foundation Delta Lake and Apache Iceberg by Brooklyn Data Co. (a Velir company) that shows Delta is up to 8x faster in workloads with data updates. Most storage benchmarks only test reads, but with updates, care is needed to maintain performance. brooklyndata.co/blog/benchmark…