Nathan Brown (@oxxotweets) Twitter Tweets • TwiCopy

Nathan Brown

@oxxotweets

+ Follow

SWE @Microsoft AI; multilingual LLMs and other shenanigans; Masters grad @ Clemson; Probably staring at wandb logs; DMs open

ID: 1408520605671071745

linkhttps://oxxocodes.github.io/ calendar_today25-06-2021 20:21:08

521 Tweet

83 Followers

638 Following

Anthropic

@anthropicai

3 months ago

Our interpretability team recently released research that traced the thoughts of a large language model. Now we’re open-sourcing the method. Researchers can generate “attribution graphs” like those in our study, and explore them interactively.

thumb_up_off_alt4,4K

chat_bubble_outline103

repeat576

shareShare

Nathan Brown

@oxxotweets

3 months ago

Feel like I haven't seen much work on incorporating existing models into VLMs. Seems a bit naive, but I'd imagine incorporating embeddings from depth estimation / segmentation / edge detection models would help remedy the "text first, vision second" issue present in VLMs

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Sander Land

@magikarp_tokens

3 months ago

🔠 UTF-8 was never meant for language models. Yet every major tokenizer still uses it, creating unfair "byte premiums". Why should your native script cost more to tokenize? It's time for a change. 🧵👇

thumb_up_off_alt291

chat_bubble_outline5

repeat35

shareShare

Nathan Brown

@oxxotweets

3 months ago

Really cool work - as a field, we all need more ethically sourced and openly licensed data

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Qingxiu Dong

@qx_dong

3 months ago

⏰ We introduce Reinforcement Pre-Training (RPT🍒) — reframing next-token prediction as a reasoning task using RLVR ✅ General-purpose reasoning 📑 Scalable RL on web corpus 📈 Stronger pre-training + RLVR results 🚀 Allow allocate more compute on specific tokens

thumb_up_off_alt920

chat_bubble_outline28

repeat147

shareShare

ARC Prize

@arcprize

3 months ago

After the o3 price reduction, we retested the o3-2025-04-16 model on ARC-AGI to determine whether its performance had changed. We compared the retest results with the original results and observed no difference in performance.

thumb_up_off_alt2,2K

chat_bubble_outline47

repeat172

shareShare

Nathan Brown

@oxxotweets

3 months ago

rip gcp / aistudio, how am i supposed to get my work done without system prompt control

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Phillip Isola

@phillip_isola

3 months ago

Our computer vision textbook is now available for free online here: visionbook.mit.edu We are working on adding some interactive components like search and (beta) integration with LLMs. Hope this is useful and feel free to submit Github issues to help us improve the text!

thumb_up_off_alt2,2K

chat_bubble_outline35

repeat595

shareShare

Nathan Brown

@oxxotweets

3 months ago

Really like the approach of treating multitude training as a series of database transactions+rollbacks. Makes intuitive sense, surprised I haven’t seen this discussed elsewhere in the OSS ML space as of yet

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

jack morris

@jxmnop

2 months ago

In the beginning, there was BERT. Eventually BERT gave rise to RoBERTa. Then, DeBERTa. Later, ModernBERT. And now, NeoBERT. The new state-of-the-art small-sized encoder:

thumb_up_off_alt925

chat_bubble_outline28

repeat69

shareShare

Nathan Brown

@oxxotweets

2 months ago

OCR is not yet solved

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Nathan Brown

@oxxotweets

2 months ago

For anyone serious about on-device AI: more <0.5B LLMs are desperately needed

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Nathan Brown

@oxxotweets

2 months ago

Marin is such a cool project

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Nathan Brown

@oxxotweets

2 months ago

👀

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Nathan Brown

@oxxotweets

2 months ago

Providing a YouTube URL to Google AI Studio w/ Gemini 2.5 Pro, seeing significant spikes in GPU utilization while I wait on a model response (usage drop is when the window is no longer active). Anyone familiar with this? Unsure why any of this would be client-side.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Nathan Brown

@oxxotweets

2 months ago

uh oh

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Owain Evans

@owainevans_uk

a month ago

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

thumb_up_off_alt7,7K

chat_bubble_outline260

repeat1,1K

shareShare