Andy (@viewerisland) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Andy

@viewerisland

3 months ago

So much folklore of injecting text like checking docs or extra think for models to improve performance. Folks, just say you are forcing a certain distribution of behavior, it ain't that deep

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Bytedance has allegedly surpassed 1 million GPUs of H100/H800 sku, putting them in the same league as Google (1-1.5M H100 level) and Microsoft (750k - 900k). Combining megascale, Verl, and joint efforts with AReaL, it is some incredibly exciting times ahead

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Andy

@viewerisland

3 months ago

With releases like MiMo it's clear the future for production AI systems will be a strong and small reasoning model paired with a strong retrieval and knowledge recommendation system.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Kasey Zhang

@_weexiao

3 months ago

We used RL to train a model for MCP! Connect any MCP client to any MCP server - you can run MCP workflows fully with local models (+ tune it further). It works with Ollama / any MCP client that supports Qwen3 models - download it below 👇1/

thumb_up_off_alt51

chat_bubble_outline11

repeat9

shareShare

Andy

@viewerisland

3 months ago

The path to ASI already exist in the data we just gotta figure out how to sample ourselves to it

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Andy

@viewerisland

3 months ago

For people that have tried post training the MiMo RL models, have anyone noticed that this model is *incredibly* verbose? Even for simple tasks it will go on for 20k+ tokens before an answer.

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Andy

@viewerisland

2 months ago

Many frameworks have claimed to support RL for LLMs. Only one that would actually work in production and even it lacks support on standardizing things like tool invocation and reward policies. It's like selling you a fully functional car with no gas and the nearest gas station is

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Andy

@viewerisland

2 months ago

Did Anthropic just make pass@n "parallel time-time compute" or is it different

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Andy

@viewerisland

2 months ago

Mythical Waymo pull

thumb_up_off_alt9

chat_bubble_outline3

repeat0

shareShare

Kasey Zhang

@_weexiao

2 months ago

Don't use structured output mode for reasoning tasks. We’re open sourcing Osmosis-Structure-0.6B: an extremely small model that can turn any unstructured data into any format (e.g. JSON schema). Use it with any model - download and blog below!

thumb_up_off_alt2,2K

chat_bubble_outline88

repeat199

shareShare

ollama

@ollama

2 months ago

ollama run Osmosis/Osmosis-Structure-0.6B

thumb_up_off_alt629

chat_bubble_outline7

repeat75

shareShare

Andy

@viewerisland

2 months ago

Am I a boomer in the ai agent world... I don't care about your fancy abstractions show me the jinja template and your tools k? Thx

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Andy

@viewerisland

a month ago

Nvidia releases Python DSL for CuTe and within weeks there's already a 200+ WeChat group chat about this discussing use cases and findings. Recommending a blog post here: veitner.bearblog.dev/bridging-math-…

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare

Kasey Zhang

@_weexiao

24 days ago

It’s easy to fine-tune small models w/ RL to outperform foundation models on vertical tasks. We’re open sourcing Osmosis-Apply-1.7B: a small model that merges code (similar to Cursor’s instant apply) better than foundation models. Links to download and try out the model below!