Emiliano Penaloza (@emilianopp_) Twitter Tweets • TwiCopy

Emiliano Penaloza

@emilianopp_

+ Follow

PhD Student @Mila_Quebec

Interested in developing AI systems that prioritize user-focused and interpretable decision-making.

ID: 1841991137562595328

linkhttps://emilianopp.com calendar_today03-10-2024 23:58:19

31 Tweet

32 Followers

178 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Once again I am recommending that you NOT optimize your intelligent agents around usage metrics. Please. Pretty please. Just stop. I know that's how you optimize other products like recommendation engines but these things are NOT THE SAME.

thumb_up_off_alt1,1K

chat_bubble_outline42

repeat108

shareShare

Thomas G. Dietterich

@tdietterich

3 months ago

Neel Nanda I think we should scale up TMLR so that it can be the central publication venue for ML. 4/

thumb_up_off_alt34

chat_bubble_outline3

repeat7

shareShare

Hafez Ghaemi

@hafezghm

3 months ago

🚨 Preprint Alert 🚀 📄 seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models arxiv.org/abs/2505.03176 Can we simultaneously learn both transformation-invariant and transformation-equivariant representations with self-supervised learning (SSL)?

thumb_up_off_alt32

chat_bubble_outline3

repeat11

shareShare

Siddarth Venkatraman

@siddarthv66

3 months ago

Traj level objectives (RLOO, GRPO) seem better than value function based methods for LLM training (Q-learning, PPO). Quite unfortunate. Traj level RL can't do skill stitching, this is extremely sample inefficient for learning compositional reasoning (such as multi turn tool use).

thumb_up_off_alt17

chat_bubble_outline3

repeat2

shareShare

Siddarth Venkatraman

@siddarthv66

2 months ago

Is there a universal strategy to turn any generative model—GANs, VAEs, diffusion models, or flows—into a conditional sampler, or finetuned to optimize a reward function? Yes! Outsourced Diffusion Sampling (ODS) accepted to ICML Conference , does exactly that!

thumb_up_off_alt48

chat_bubble_outline2

repeat24

shareShare

Benjamin Thérien

@benjamintherien

2 months ago

Is AdamW the best inner optimizer for DiLoCo? Does the inner optimizer affect the compressibility of the DiLoCo delta? Excited to introduce MuLoCo: Muon is a practical inner optimizer for DiLoCo! 🧵arxiv.org/abs/2505.23725 1/N

thumb_up_off_alt81

chat_bubble_outline2

repeat25

shareShare

Avery Ryoo

@averyryoo

2 months ago

Super stoked to share my first first-author paper that introduces a hybrid architecture approach for real-time neural decoding. It's been a lot of work, but happy to showcase some very cool results!

thumb_up_off_alt29

chat_bubble_outline1

repeat5

shareShare

Majdi Hassan

@majdi_has

2 months ago

(1/n)🚨You can train a model solving DFT for any geometry almost without training data!🚨 Introducing Self-Refining Training for Amortized Density Functional Theory — a variational framework for learning a DFT solver that predicts the ground-state solutions for different

thumb_up_off_alt151

chat_bubble_outline3

repeat38

shareShare

Luke Rowe

@luke22r

2 months ago

🚀 Our method, Poutine, was the best-performing entry in the 2025 Waymo Vision-based End-to-End Driving Challenge at #CVPR2025! Our 3 B-parameter VLM Poutine scored 7.99 RFS on the official test set—comfortably ahead of every other entry (see figure).

thumb_up_off_alt16

chat_bubble_outline1

repeat9

shareShare

Emiliano Penaloza

Gate.io

Emmett Shear

Thomas G. Dietterich

Hafez Ghaemi

Siddarth Venkatraman

Siddarth Venkatraman

Benjamin Thérien

Avery Ryoo

Majdi Hassan

Luke Rowe