Andreas Mueller (@amuellerml) 's Twitter Profile
Andreas Mueller

@amuellerml

Machine learner, Python geek and scikit-learn developer.
Principal Research SDE @AzureData @Microsoft. Posting on LinkedIn now.

ID: 471550563

linkhttp://amueller.github.io calendar_today23-01-2012 00:40:44

9,9K Tweet

48,48K Followers

1,1K Following

Nick Erickson (@innixma) 's Twitter Profile Photo

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance! When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it. Competition: kaggle.com/competitions/p…

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance!

When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it.

Competition: kaggle.com/competitions/p…
Julien Le Dem (@j_) 's Twitter Profile Photo

The rumors are true! I started a(nother) blog. sympathetic.ink The first post is an adaption of my talk, recalling the pas 10+ years of building open source standards and the lessons learned along the way. sympathetic.ink/2024/01/24/Ten…

Ibis (@ibisdata) 's Twitter Profile Photo

We often get questions around why Voltron Data supports the Ibis project -- we've answered them here! TL;DR: open standards are critical for the composable data ecosystem and tightly coupling Python dataframes to execution engines is bad for everyone ibis-project.org/posts/why-voda…

Yann LeCun (@ylecun) 's Twitter Profile Photo

🥁 Llama3 is out 🥁 8B and 70B models available today. 8k context length. Trained with 15 trillion tokens on a custom-built 24k GPU cluster. Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases. More versions are coming over the next

🥁 Llama3 is out 🥁
8B and 70B models available today.
8k context length.
Trained with 15 trillion tokens on a custom-built 24k GPU cluster.
Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases.
More versions are coming over the next
Andrej Karpathy (@karpathy) 's Twitter Profile Photo

Congrats to AI at Meta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ LMSYS Org :)) 400B is still training, but already encroaching

William Fedus (@liamfedus) 's Twitter Profile Photo

GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot 🙂. Here’s how it’s been doing.

GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot 🙂. Here’s how it’s been doing.
William Fedus (@liamfedus) 's Twitter Profile Photo

Not only is this the best model in the world, but it's available for free in ChatGPT, which has never before been the case for a frontier model.

Andy Pavlo (@andypavlo.bsky.social) (@andy_pavlo) 's Twitter Profile Photo

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with Xinyu Zeng + Huanchen Zhang + Wes McKinney studies their internals. TLDR: They're not optimized for modern hardware. Something new is needed. Paper: vldb.org/pvldb/vol17/p1… Code: github.com/XinyuZeng/Eval…

Columnar file formats like Parquet/ORC are ubiquitous. Our VLDB paper with <a href="/XinyuZeng218/">Xinyu Zeng</a> + <a href="/huanchenzhang/">Huanchen Zhang</a> + <a href="/wesmckinn/">Wes McKinney</a> studies their internals.

TLDR: They're not optimized for modern hardware. Something new is needed.

Paper: vldb.org/pvldb/vol17/p1…
Code: github.com/XinyuZeng/Eval…
DuckDB (@duckdb) 's Twitter Profile Photo

We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck". This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format. Check out our

We are proud to release the first major version of DuckDB, v1.0.0, codenamed "Snow Duck".

This version is a culmination of almost six years of research and development. Today we are shipping an innovative database system with a backwards-compatible storage format.

Check out our
Matthias Feurer (@__mfeurer__) 's Twitter Profile Photo

Wondering how humans should be involved in designing #AutoML solutions 🤔? Check out our #ICML2024 paper: "Position: A Call to Action for a Human-Centered AutoML Paradigm"! 📄✨ proceedings.mlr.press/v235/lindauer2… Drop by at our poster on Thu, Jul 25 at 11:30 AM in Hall C 4-9 #2003 📅 1/3

François Chollet (@fchollet) 's Twitter Profile Photo

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task
Frank Hutter (@frankrhutter) 's Twitter Profile Photo

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points &amp; 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19
Andreas Mueller (@amuellerml) 's Twitter Profile Photo

New preprint arxiv.org/abs/2502.05392 Open Challenges in Time Series Anomaly Detection: An Industry Perspective This is a vision paper about what I think it missing from current research in time series anomaly detection, and how it could align better with practical applications.

Nick Erickson (@innixma) 's Twitter Profile Photo

📢 We are excited to announce "#FMSD: 1st Workshop on Foundation Models for Structured Data" has been accepted to #ICML 2025! Call for Papers: icml-structured-fm-workshop.github.io/call-for-paper…

📢 We are excited to announce "#FMSD: 1st Workshop on Foundation Models for Structured Data" has been accepted to #ICML 2025! Call for Papers: icml-structured-fm-workshop.github.io/call-for-paper…
Satya Nadella (@satyanadella) 's Twitter Profile Photo

Open protocols like A2A and MCP are key to enabling the agentic web. With A2A support coming to Copilot Studio and Foundry, customers can build agentic systems that interoperate by design.