Robert Lange (@roberttlange) 's Twitter Profile
Robert Lange

@roberttlange

Founding Research Scientist @SakanaAILabs
๐Ÿ’ฌ Agentic Discovery ๐Ÿ”ฌ AI Scientist ๐Ÿงฌ EvoLLM
๐Ÿ‹๏ธ gymnax ๐ŸฆŽ evosax ๐Ÿคน MLE-Infra
Ex: SR @Google DM. DeepMind Intern

ID: 856432904108421120

linkhttp://roberttlange.com calendar_today24-04-2017 09:01:30

571 Tweet

8,8K Followers

572 Following

hardmaru (@hardmaru) 's Twitter Profile Photo

Proud to announce Sakana AIโ€™s partnership with Hokkoku Bank, based in the Ishikawa Prefecture of Japan. Sakana AI will deliver bank-specific AI-powered tools to Hokkoku Bank. We aim for this partnership to serve as a model case for other regional banks in Japan.

Sakana AI (@sakanaailabs) 's Twitter Profile Photo

Weโ€™re excited to introduce Text-to-LoRA: a Hypernetwork that generates task-specific LLM adapters (LoRAs) based on a text description of the task. Catch our presentation at #ICML2025! Paper: arxiv.org/abs/2506.06105 Code: github.com/SakanaAI/Text-โ€ฆ Biological systems are capable of

hardmaru (@hardmaru) 's Twitter Profile Photo

Text-to-LoRA: Instant Transformer Adaption arxiv.org/abs/2506.06105 Generative models can produce text, images, video. They should also be able to generate models! Here, we trained a Hypernetwork to generate new task-specific LoRAs by simply describing the task as a text prompt.

Rujikorn Charakorn (Tan) (@tan51616) 's Twitter Profile Photo

Very excited to share this work. Here's an example of how to interact with T2L through a webui provided in the repo github.com/sakanaai/text-โ€ฆ More results in this ๐Ÿงต

Sakana AI (@sakanaailabs) 's Twitter Profile Photo

Introducing ALE-Bench, ALE-Agent! Towards Automating Long-Horizon Algorithm Engineering for Hard Optimization Problems Blog: sakana.ai/ale-bench/ Paper: arxiv.org/abs/2506.09050 ALE-Bench is a coding benchmark primarily focused on hard optimization (NP-hard) problems. We

hardmaru (@hardmaru) 's Twitter Profile Photo

Sakana AI developed a new coding agent, ALE-Agent, trained to solve NP-hard optimization problems. Our agent participated in a live coding competition, the challenging AtCoder Heuristic Contest, and ranked #21 out of 1,000 human participants! Learn more: sakana.ai/ale-bench/

Takuya Akiba (@iwiwi) 's Twitter Profile Photo

AI will soon master Codeforces. So, what's the next challenge? ๐Ÿš€Introducing ALE-Bench (ALgorithm Engineering Benchmark) ๐Ÿ† A new frontier benchmark for algorithmic coding, designed to test long-horizon reasoning on complex problems through trial and error. ๐Ÿค–What is ALE-Bench?

Johannes Oswald (@oswaldjoh) 's Twitter Profile Photo

Super happy and proud to share our novel scalable RNN model - the MesaNet! This work builds upon beautiful ideas of ๐—น๐—ผ๐—ฐ๐—ฎ๐—น๐—น๐˜† ๐—ผ๐—ฝ๐˜๐—ถ๐—บ๐—ฎ๐—น ๐˜๐—ฒ๐˜€๐˜-๐˜๐—ถ๐—บ๐—ฒ ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด (TTT), and combines ideas of in-context learning, test-time training and mesa-optimization.

Super happy and proud to share our novel scalable RNN model - the MesaNet! 

This work builds upon beautiful ideas of ๐—น๐—ผ๐—ฐ๐—ฎ๐—น๐—น๐˜† ๐—ผ๐—ฝ๐˜๐—ถ๐—บ๐—ฎ๐—น ๐˜๐—ฒ๐˜€๐˜-๐˜๐—ถ๐—บ๐—ฒ ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป๐—ถ๐—ป๐—ด (TTT), and combines ideas of in-context learning, test-time training and mesa-optimization.
Robert Lange (@roberttlange) 's Twitter Profile Photo

How do different coding agents perform file edits? ๐Ÿ“ Great blog by Fabian Hertwig ๐Ÿง‘โ€๐Ÿ’ป: fabianhertwig.com/blog/coding-asโ€ฆ Different instruction tuning protocols imply that there is no clear winning approach. Most agents are either model-specific (Codex/Claude Code) or deploy robust

How do different coding agents perform file edits? ๐Ÿ“

Great blog by <a href="/FabianHertwig/">Fabian Hertwig</a> ๐Ÿง‘โ€๐Ÿ’ป: fabianhertwig.com/blog/coding-asโ€ฆ

Different instruction tuning protocols imply that there is no clear winning approach. Most agents are either model-specific (Codex/Claude Code) or deploy robust
Sakana AI (@sakanaailabs) 's Twitter Profile Photo

Introducing Reinforcement-Learned Teachers (RLTs): Transforming how we teach LLMs to reason with reinforcement learning (RL). Blog: sakana.ai/rlt Paper: arxiv.org/abs/2506.08388 Traditional RL focuses on โ€œlearning to solveโ€ challenging problems with expensive LLMs and

hardmaru (@hardmaru) 's Twitter Profile Photo

Reinforcement Learning Teachers of Test Time Scaling In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve! The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve

Reinforcement Learning Teachers of Test Time Scaling

In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve!

The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve
Shengran Hu (@shengranhu) 's Twitter Profile Photo

Very excited to share our latest work on Automated Design of Agentic Systems (ADAS) and Darwin Gรถdel Machine (DGM) with ๆœบๅ™จไน‹ๅฟƒ JIQIZHIXIN! Watch the interview: bilibili.com/video/BV1vJK8zโ€ฆ (in Chinese)

Very excited to share our latest work on Automated Design of Agentic Systems (ADAS) and Darwin Gรถdel Machine (DGM) with <a href="/Synced_Global/">ๆœบๅ™จไน‹ๅฟƒ JIQIZHIXIN</a>! Watch the interview: bilibili.com/video/BV1vJK8zโ€ฆ (in Chinese)