mehdi cherti (@mehdidc) 's Twitter Profile
mehdi cherti

@mehdidc

PostDoc at Jülich Supercomputing Center (JSC), Germany / LAION.

ID: 101535747

linkhttps://mehdidc.github.io calendar_today03-01-2010 18:03:23

727 Tweet

339 Followers

779 Following

Alicia Curth (@aliciacurth) 's Twitter Profile Photo

Why do Random Forests perform so well off-the-shelf & appear essentially immune to overfitting?!? I’ve found the text-book answer “it’s just variance reduction 🤷🏼‍♀️” to be a bit too unspecific, so in our new pre-print arxiv.org/abs/2402.01502, Alan Jeffares & I investigate..🕵🏼‍♀️ 1/n

Why do Random Forests perform so well off-the-shelf & appear essentially immune to overfitting?!?

I’ve found the text-book answer “it’s just variance reduction 🤷🏼‍♀️” to be a bit too unspecific, so in our new pre-print arxiv.org/abs/2402.01502, <a href="/Jeffaresalan/">Alan Jeffares</a> &amp; I investigate..🕵🏼‍♀️ 1/n
Alexandre Gramfort (@agramfort) 's Twitter Profile Photo

For those of you who were wondering what I’ve been doing since I joined Meta Reality Labs late 2022. Here is the first detailed scientific communication about our work. You can read the paper at: biorxiv.org/content/10.110…

Massimo (@rainmaker1973) 's Twitter Profile Photo

Beluga whales love to play, scare, joke and generally interact with humans. This compilation is a good example. [📹 aquariumadvicesa] x.com/i/status/17635…

Tim Brooks (@_tim_brooks) 's Twitter Profile Photo

"fly through tour of a museum with many paintings and sculptures and beautiful works of art in all styles" Video generated by #Sora

Shyamgopal Karthik (@shyamgopalkart1) 's Twitter Profile Photo

Do you want to improve the performance of your text-to-image model without any training? That too by just looking for a better initialization noise? Sounds too good to be true? 🧵👇 x.com/LucaEyring/sta…

Tomer Porian (@tomerporian) 's Twitter Profile Photo

🧵1/8 We resolve the discrepancy between the compute optimal scaling laws of Kaplan (exponent 0.88, Figure 14, left) et al. and Hoffmann et al. (“Chinchilla”, exponent 0.5). Paper: arxiv.org/abs/2406.19146 Data + Code: github.com/formll/resolvi…

🧵1/8 We resolve the discrepancy between the compute optimal scaling laws of Kaplan (exponent 0.88, Figure 14, left) et al. and Hoffmann et al. (“Chinchilla”, exponent 0.5).
Paper: arxiv.org/abs/2406.19146
Data + Code: github.com/formll/resolvi…
Vishaal Udandarao (@vishaal_urao) 's Twitter Profile Photo

Ever feel frustrated when you vaguely know what paper you want to cite but can't find it on Google? Can LM-based agents automatically find paper citations for you? Our new paper presents a tough new benchmark for this task along with an LM-based agent for finding citations.

Sam Rodriques (@sgrodriques) 's Twitter Profile Photo

Introducing PaperQA2, the first AI agent that conducts entire scientific literature reviews on its own. PaperQA2 is also the first agent to beat PhD and Postdoc-level biology researchers on multiple literature research tasks, as measured both by accuracy on objective benchmarks

Frank Hutter (@frankrhutter) 's Twitter Profile Photo

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points &amp; 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19
Variety (@variety) 's Twitter Profile Photo

Director-writer David Lynch, who radicalized American film with with a dark, surrealistic artistic vision in films like “Blue Velvet” and “Mulholland Drive” and network television with “Twin Peaks,” has died. He was 78. bit.ly/40is3yQ

Director-writer David Lynch, who radicalized American film with with a dark, surrealistic artistic vision in films like “Blue Velvet” and “Mulholland Drive” and network television with “Twin Peaks,” has died. He was 78. bit.ly/40is3yQ
Thomas Wolf (@thom_wolf) 's Twitter Profile Photo

Finally took time to go over Dario's essay on DeepSeek and export control and to be honest it was quite painful to read. And I say this as a great admirer of Anthropic and big user of Claude* The first half of the essay reads like a lengthy attempt to justify that closed-source

Thomas Wolf (@thom_wolf) 's Twitter Profile Photo

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook" Check it out here: hf.co/spaces/nanotro… A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,

After 6+ months in the making and burning over a year of GPU compute time, we're super excited to finally release the "Ultra-Scale Playbook"

Check it out here: hf.co/spaces/nanotro…

A free, open-source, book to learn everything about 5D parallelism, ZeRO, fast CUDA kernels,
Vishaal Udandarao (@vishaal_urao) 's Twitter Profile Photo

🚀New Paper! arxiv.org/abs/2504.07086 Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress? We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀 🧵👇

🚀New Paper!
arxiv.org/abs/2504.07086

Everyone’s celebrating rapid progress in math reasoning with RL/SFT. But how real is this progress?

We re-evaluated recently released popular reasoning models—and found reported gains often vanish under rigorous testing!! 👀

🧵👇
Chelsea Finn (@chelseabfinn) 's Twitter Profile Photo

Introducing π-0.5! The model works out of the box in completely new environments. Here the robot cleans new kitchens & bedrooms. 🤖 Detailed paper + videos in more than 10 unseen rooms: physicalintelligence.company/blog/pi05 A short thread 🧵

Ross Wightman (@wightmanr) 's Twitter Profile Photo

timm's got a new vision transformer (NaFlexVit), and it's flexible! I've been plugging away at this for a bit, integrating ideas from FlexiViT, NaViT, and NaFlex and finally ready to merge for initial exploration. The model supports: * variable aspect/size images of NaFlex (see

timm's got a new vision transformer (NaFlexVit), and it's flexible! I've been plugging away at this for a bit, integrating ideas from FlexiViT, NaViT, and NaFlex and finally ready to merge for initial exploration. The model supports:
* variable aspect/size images of NaFlex (see
Ludwig Schmidt (@lschmidt3) 's Twitter Profile Photo

Very excited to finally release our paper for OpenThoughts! After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.

Very excited to finally release our paper for OpenThoughts!

After DataComp and DCLM, this is the third large open dataset my group has been building in collaboration with the DataComp community. This time, the focus is on post-training, specifically reasoning data.
Jenia Jitsev 🏳️‍🌈 🇺🇦 🇮🇱 (@jjitsev) 's Twitter Profile Photo

When all of the sudden puzzle pieces fall into right places and predicting the unknown starts to work, those are rare beautiful moments I am grateful for in science. Made by rare minds of Marianna Tomer Porian mehdi cherti github.com/LAION-AI/scali… arxiv.org/abs/2506.04598