Yujin Kim (@yujin301300) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

📢 [New Research] Introducing Speculative Thinking—boosting small LLMs by leveraging large-model mentorship. Why? - Small models generate overly long responses, especially when incorrect. - Large models offer concise, accurate reasoning patterns. - Wrong reasoning (thoughts) is

thumb_up_off_alt223

chat_bubble_outline5

repeat48

shareShare

ℏεsam

@hesamation

5 months ago

a new article just dropped on "the state of LLM reasoning models". if you hear about test-time compute a lot, but don't actually know what it is, this is a great article. Sebastian Raschka covered 12 of the major papers in test-time compute.

thumb_up_off_alt1,1K

chat_bubble_outline7

repeat291

shareShare

Gradio

@gradio

5 months ago

🚀 New Research: Self-training inspires clear and concise thinking in LLMs! Paper achieves a 30% reduction in output tokens across five model families on GSM8K and MATH while maintaining average accuracy 👀

thumb_up_off_alt42

chat_bubble_outline2

repeat9

shareShare

Rohan Paul

@rohanpaul_ai

3 months ago

Small language models struggle with complex reasoning tasks where large models excel. This paper introduces the SMART framework, where a small model performs reasoning but selectively requests corrections from a large model only for steps identified as uncertain via a scoring

thumb_up_off_alt179

chat_bubble_outline4

repeat32

shareShare

Deedy

@deedydas

12 days ago

Google DeepMind just dropped this new LLM model architecture called Mixture-of-Recursions. It gets 2x inference speed, reduced training FLOPs and ~50% reduced KV cache memory. Really interesting read. Has potential to be a Transformers killer.

thumb_up_off_alt3,3K

chat_bubble_outline76

repeat447

shareShare

Rohan Paul

@rohanpaul_ai

12 days ago

This is quite a landmark paper from Google DeepMind 📌 2x faster inference because tokens exit the shared loop early. 📌 During training it cuts the heavy math, dropping attention FLOPs per layer by about half, so the same budget trains on more data. Shows a fresh way to

This is quite a landmark paper from <a href="/GoogleDeepMind/">Google DeepMind</a>

📌 2x faster inference because tokens exit the shared loop early.

📌 During training it cuts the heavy math, dropping attention FLOPs per layer by about half, so the same budget trains on more data.

Shows a fresh way to

thumb_up_off_alt103

chat_bubble_outline3

repeat24

shareShare

alphaXiv

@askalphaxiv

12 days ago

"experts" for harder tokens? "Mixture-of-Recursions (MoR): Learning Dynamic Recursive Depths for Adaptive Token-Level Computation" MoR makes one shared Transformer block loop only for tokens that need extra thought, delivering quality with half the weights & twice the speed

thumb_up_off_alt331

chat_bubble_outline9

repeat63

shareShare

The AI Timeline

@theaitimeline

8 days ago

🚨This week's top AI/ML research papers: - Mixture-of-Recursions - Scaling Laws for Optimal Data Mixtures - Training Transformers with Enforced Lipschitz Constants - Reasoning or Memorization? - How Many Instructions Can LLMs Follow at Once? - Chain of Thought Monitorability -

thumb_up_off_alt494

chat_bubble_outline4

repeat71

shareShare

Jen Ha @ ICML 2025

@jenhriver

8 days ago

New architectures leggooooo

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare

Sangmin Bae

@raymin0223

4 days ago

✨Huge thanks for interest in Mixture-of-Recursions! Codes are officially out! It's been a long journey exploring Early-exiting with Recursive Architecture. I'll soon post my 👨‍🎓PhD thesis on Adaptive Computation too! Code: github.com/raymin0223/mix… Paper: arxiv.org/abs/2507.10524

thumb_up_off_alt263

chat_bubble_outline6

repeat60

shareShare

Sungnyun Kim

@kim_sungnyun

4 days ago

Sangmin Bae great work bro

thumb_up_off_alt4

chat_bubble_outline1

repeat1

shareShare

Yujin Kim

Gate.io

Xiaotian (Max) Han

ℏεsam

Gradio

Rohan Paul

Deedy

Rohan Paul

alphaXiv

The AI Timeline

Jen Ha @ ICML 2025

Sangmin Bae

Sungnyun Kim