Robert Lange (@roberttlange) Twitter Tweets • TwiCopy

Robert Lange

@roberttlange

+ Follow

Founding Research Scientist @SakanaAILabs
💬 Agentic Discovery 🔬 AI Scientist 🧬 EvoLLM
🏋️ gymnax 🦎 evosax 🤹 MLE-Infra
Ex: SR @Google DM. DeepMind Intern

ID: 856432904108421120

linkhttp://roberttlange.com calendar_today24-04-2017 09:01:30

571 Tweet

8,8K Followers

572 Following

hardmaru

@hardmaru

3 months ago

Proud to announce Sakana AI’s partnership with Hokkoku Bank, based in the Ishikawa Prefecture of Japan. Sakana AI will deliver bank-specific AI-powered tools to Hokkoku Bank. We aim for this partnership to serve as a model case for other regional banks in Japan.

thumb_up_off_alt106

chat_bubble_outline3

repeat12

shareShare

jack

@jack

3 months ago

arxiv.org/abs/2505.22954

thumb_up_off_alt640

chat_bubble_outline86

repeat106

shareShare

Sakana AI

@sakanaailabs

3 months ago

We’re excited to introduce Text-to-LoRA: a Hypernetwork that generates task-specific LLM adapters (LoRAs) based on a text description of the task. Catch our presentation at #ICML2025! Paper: arxiv.org/abs/2506.06105 Code: github.com/SakanaAI/Text-… Biological systems are capable of

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat380

shareShare

hardmaru

@hardmaru

3 months ago

Text-to-LoRA: Instant Transformer Adaption arxiv.org/abs/2506.06105 Generative models can produce text, images, video. They should also be able to generate models! Here, we trained a Hypernetwork to generate new task-specific LoRAs by simply describing the task as a text prompt.

thumb_up_off_alt774

chat_bubble_outline12

repeat131

shareShare

clem 🤗

@clementdelangue

3 months ago

Text to models!

thumb_up_off_alt415

chat_bubble_outline15

repeat53

shareShare

Rujikorn Charakorn (Tan)

@tan51616

3 months ago

Very excited to share this work. Here's an example of how to interact with T2L through a webui provided in the repo github.com/sakanaai/text-… More results in this 🧵

thumb_up_off_alt47

chat_bubble_outline6

repeat20

shareShare

Sakana AI

@sakanaailabs

3 months ago

Introducing ALE-Bench, ALE-Agent! Towards Automating Long-Horizon Algorithm Engineering for Hard Optimization Problems Blog: sakana.ai/ale-bench/ Paper: arxiv.org/abs/2506.09050 ALE-Bench is a coding benchmark primarily focused on hard optimization (NP-hard) problems. We

thumb_up_off_alt184

chat_bubble_outline0

repeat53

shareShare

hardmaru

@hardmaru

3 months ago

Sakana AI developed a new coding agent, ALE-Agent, trained to solve NP-hard optimization problems. Our agent participated in a live coding competition, the challenging AtCoder Heuristic Contest, and ranked #21 out of 1,000 human participants! Learn more: sakana.ai/ale-bench/

thumb_up_off_alt364

chat_bubble_outline10

repeat66

shareShare

Takuya Akiba

@iwiwi

3 months ago

AI will soon master Codeforces. So, what's the next challenge? 🚀Introducing ALE-Bench (ALgorithm Engineering Benchmark) 🏆 A new frontier benchmark for algorithmic coding, designed to test long-horizon reasoning on complex problems through trial and error. 🤖What is ALE-Bench?

thumb_up_off_alt53

chat_bubble_outline0

repeat20

shareShare

Johannes Oswald

@oswaldjoh

3 months ago

Super happy and proud to share our novel scalable RNN model - the MesaNet! This work builds upon beautiful ideas of 𝗹𝗼𝗰𝗮𝗹𝗹𝘆 𝗼𝗽𝘁𝗶𝗺𝗮𝗹 𝘁𝗲𝘀𝘁-𝘁𝗶𝗺𝗲 𝘁𝗿𝗮𝗶𝗻𝗶𝗻𝗴 (TTT), and combines ideas of in-context learning, test-time training and mesa-optimization.

thumb_up_off_alt377

chat_bubble_outline3

repeat63

shareShare

Robert Lange

@roberttlange

2 months ago

How do different coding agents perform file edits? 📝 Great blog by Fabian Hertwig 🧑‍💻: fabianhertwig.com/blog/coding-as… Different instruction tuning protocols imply that there is no clear winning approach. Most agents are either model-specific (Codex/Claude Code) or deploy robust

How do different coding agents perform file edits? 📝

Great blog by <a href="/FabianHertwig/">Fabian Hertwig</a> 🧑‍💻: fabianhertwig.com/blog/coding-as…

Different instruction tuning protocols imply that there is no clear winning approach. Most agents are either model-specific (Codex/Claude Code) or deploy robust

thumb_up_off_alt24

chat_bubble_outline2

repeat6

shareShare

Sakana AI

@sakanaailabs

2 months ago

Introducing Reinforcement-Learned Teachers (RLTs): Transforming how we teach LLMs to reason with reinforcement learning (RL). Blog: sakana.ai/rlt Paper: arxiv.org/abs/2506.08388 Traditional RL focuses on “learning to solve” challenging problems with expensive LLMs and

thumb_up_off_alt947

chat_bubble_outline21

repeat221

shareShare

hardmaru

@hardmaru

2 months ago

Reinforcement Learning Teachers of Test Time Scaling In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve! The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve

thumb_up_off_alt667

chat_bubble_outline20

repeat100

shareShare

Shengran Hu

@shengranhu

2 months ago

Very excited to share our latest work on Automated Design of Agentic Systems (ADAS) and Darwin Gödel Machine (DGM) with 机器之心 JIQIZHIXIN! Watch the interview: bilibili.com/video/BV1vJK8z… (in Chinese)

Very excited to share our latest work on Automated Design of Agentic Systems (ADAS) and Darwin Gödel Machine (DGM) with <a href="/Synced_Global/">机器之心 JIQIZHIXIN</a>! Watch the interview: bilibili.com/video/BV1vJK8z… (in Chinese)

thumb_up_off_alt21

chat_bubble_outline2

repeat7

shareShare