/MachineLearning (@slashml) Twitter Tweets • TwiCopy

Haider.

2 months ago

Geoffrey Hinton says I'm more optimistic now, not because we'll control AI, but because we might not need to "don't try to dominate superintelligence; design it to care, like a mother wired to protect her child" Control through attachment, not power. we want AI to be like that

thumb_up_off_alt1,1K

chat_bubble_outline210

repeat200

shareShare

Tim Dettmers

@tim_dettmers

2 months ago

It feels the coding agent frontier is now open-weights: GLM 4.5 costs only $3/month and is on par with Sonnet Kimi K2.1 Turbo is 3x speed, 7x cheaper vs Opus 4.1, but as good Kimi K2.1 feels clean. The best model for me. GPT-5 is only good for complicated specs -- too slow.

thumb_up_off_alt1,1K

chat_bubble_outline66

repeat91

shareShare

MBZUAI

@mbzuai

2 months ago

Introducing K2 Think - a breakthrough in advanced AI reasoning. Developed by MBZUAI’s Institute of Foundation Models and G42, K2 Think delivers frontier reasoning performance at a fraction of the size of today’s largest systems. Smaller. Smarter. Open to the world.

thumb_up_off_alt316

chat_bubble_outline19

repeat91

shareShare

RoboHub🤖

@xrobohub

2 months ago

A new dexterous hand is here. DexcelRobotics, a startup founded by a former core member of Tencent Robotics X, has launched its first product, the Apex Hand. The company claims it's the first in the industry capable of operating a cell phone with a single hand. The Apex Hand is

thumb_up_off_alt653

chat_bubble_outline17

repeat131

shareShare

Ethan Mollick

@emollick

2 months ago

It turned out that model collapse didn't happen. I think there are many reasons to be skeptical of AI lab claims (and point out bad predictions & watch for bubbles) but I also think it is worth reflecting that "AI development is going to stop" arguments have been wrong so far.

thumb_up_off_alt517

chat_bubble_outline32

repeat65

shareShare

Daniel Han

@danielhanchen

a month ago

DeepSeek V3.2 breakdown 1. Sparse attention via lightning indexer + top_k attention 2. Uses V3.1 Terminus + 1T continued pretraining tokens 3. 5 specialized models (coding, math etc) via RL then distillation for final ckpt 4. GRPO. Reward functions for length penalty, language

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat160

shareShare

Qwen

@alibaba_qwen

a month ago

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking are here! Smaller size, same powerhouse performance 💪—packed with all the capabilities of Qwen3-VL! 🔧 With just 3B active params, it’s rivaling GPT-5-Mini & Claude4-Sonnet — and often beating them across STEM, VQA, OCR, Video, Agent

thumb_up_off_alt1,1K

chat_bubble_outline80

repeat325

shareShare

Reflection AI

@reflection_ai

a month ago

Today we're sharing the next phase of Reflection. We're building frontier open intelligence accessible to all. We've assembled an extraordinary AI team, built a frontier LLM training stack, and raised $2 billion. Why Open Intelligence Matters Technological and scientific

thumb_up_off_alt1,1K

chat_bubble_outline112

repeat117

shareShare

Qwen

@alibaba_qwen

22 days ago

Introducing the compact, dense versions of Qwen3-VL — now available in 4B and 8B pairs, each with both Instruct and Thinking variants. ✅ Lower VRAM usage ✅ Full Qwen3-VL capabilities retained ✅ Strong performance across the board Despite their size, they outperform models

thumb_up_off_alt1,1K

chat_bubble_outline63

repeat200

shareShare

Andrej Karpathy

@karpathy

14 days ago

Avijit Thawani (Avi) Haha. I am afraid people interpreted my “delete tokenizer” as “use bytes directly without BPE”, the issue is you *still* need bytes encoding arbitrariness even for that! Pixels is the only way. Just like humans. It is written. If GPT-10 uses utf8 at the input I will eat a shoe.

thumb_up_off_alt935

chat_bubble_outline41

repeat41

shareShare

Greg Kamradt

@gregkamradt

13 days ago

ARC Prize announces all validated scores on ARC-AGI We have not verified MythWorx's 100% claim in their recent fundraise $100M val press release We would be open to verifying their score (assuming it passes the testing policy) for the founder and their investors

thumb_up_off_alt295

chat_bubble_outline32

repeat15

shareShare

Yuchen Jin

@yuchenj_uw

13 days ago

Meta laid off 600 people from its Superintelligence Lab today. Many FAIR researchers, including FAIR Research Scientist Director Yuandong Tian, were affected. I think Yann Lecun will leave soon. Maybe I should raise $2B and start a new frontier lab with these folks.

thumb_up_off_alt3,3K

chat_bubble_outline180

repeat223

shareShare

Sakana AI

@sakanaailabs

13 days ago

Sakana AI’s CTO says he’s ‘absolutely sick’ of transformers, the tech that powers every major AI model “You should only do the research that wouldn’t happen if you weren’t doing it.” (Brian Cheung) 🧠 Llion Jones venturebeat.com/ai/sakana-ais-…

thumb_up_off_alt429

chat_bubble_outline22

repeat61

shareShare

Julian Ibarz

@julianibarz

11 days ago

I disagree with Yann LeCun on this. We have a pretty good idea at Tesla on how we can make general humanoids a reality very quickly. Funny anecdote: Yann was advising me to launch what became the first production vision based deep neural network at Google. His feedback: use convs,

thumb_up_off_alt2,2K

chat_bubble_outline165

repeat163

shareShare

echo.hive

@hive_echo

10 days ago

Spiking Neural Network from scratch achieves 8% accuracy. no backpropagation or SGD I created a genetic hyper parameter optimizer and it now, on average, can get 8% accuracy which is ~3% above chance Link to source code with a detailed video and markdown explanations in comment

thumb_up_off_alt990

chat_bubble_outline41

repeat81

shareShare

/MachineLearning

@slashml

8 days ago

OpenAI successfully converts from non-profit to un-profitable for-profit.

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Causal Wizard

@causalwizard

7 days ago

HRM-Agent: Using the Hierarchical Reasoning Model in Reinforcement Learning Paper: arxiv.org/abs/2510.22832 The Hierarchical Reasoning Model (HRM) has impressive reasoning abilities given its small size, but has only been applied to supervised, static, fully-observable problems.

thumb_up_off_alt155

chat_bubble_outline3

repeat27

shareShare

Rishabh Agarwal

@agarwl_

5 days ago

The trick below to align tokens with different tokenizers is a cute idea -- this allows you to run on-policy distillation with teacher logprobs for sampled tokens even when student and teacher belong to different model families (e.g., Qwen vs Llama). There's more we need to do

thumb_up_off_alt357

chat_bubble_outline11

repeat32

shareShare

Rosinality

@rosinality

5 days ago

FP16 can have a smaller training-inference gap compared to BFloat16, thus fits better for RL. Even the difference between RL algorithms vanishes once FP16 is adopted. Surprising!

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat127

shareShare

Penghui Qi

@qphutu

5 days ago

🚀Excited to share our new work! 💊Problem: The BF16 precision causes a large training-inference mismatch, leading to unstable RL training. 💡Solution: Just switch to FP16. 🎯That's it. 📰Paper: arxiv.org/pdf/2510.26788 ⭐️Code: github.com/sail-sg/Precis…

thumb_up_off_alt591

chat_bubble_outline18

repeat92

shareShare