Sam Klein📚 in SF (@metasj) Twitter Tweets • TwiCopy

Sam Klein📚 in SF

@metasj

+ Follow

Public AI • Transluce • Wikipedia • KFG ⁋
Structure & interpretation of layered knowledge 📚
metasj@bsky • UTTR∅ §
#🍯🐝💉🧡

ID: 75123

linkhttp://blogs.law.harvard.edu/sj calendar_today17-12-2006 06:16:00

14,14K Tweet

5,5K Followers

2,2K Following

Simon Willison

@simonw

a month ago

My notes on nanochat, including links to the training data it uses simonwillison.net/2025/Oct/13/na…

thumb_up_off_alt1,1K

chat_bubble_outline10

repeat102

shareShare

RLMs are meant to address context rot, which is that weird effect when you have a long Claude Code or Cursor instance where it can’t properly handle your long history. OOLONG is a challenging new long context benchmark where models answer queries over an extremely dense context.

thumb_up_off_alt110

chat_bubble_outline1

repeat5

shareShare

Open Source Intel

@osint613

a month ago

Nearly all journalists leave Pentagon after refusing to sign new press access rules

thumb_up_off_alt42,42K

chat_bubble_outline9,9K

repeat5,5K

shareShare

Stewart Brand

@stewartbrand

25 days ago

For now, the place to find news that most people don't know about is in books, because AI search hasn't reached there yet. On the other hand, Web research offers a lot of lore that older books don't know about. Current books, like my MAINTENANCE and Brian Potter's THE ORIGINS

thumb_up_off_alt52

chat_bubble_outline9

repeat7

shareShare

Jeremy Howard

@jeremyphoward

25 days ago

18 months ago, Andrej Karpathy set a challenge: "Can you take my 2h13m tokenizer video and translate [into] a book chapter". We've done it! It includes prose, code & key images. It's a great way to learn this key piece of how LLMs work. fast.ai/posts/2025-10-…

thumb_up_off_alt3,3K

chat_bubble_outline30

repeat333

shareShare

Alexander Berger

@albrgr

24 days ago

Wow crazy result

thumb_up_off_alt199

chat_bubble_outline7

repeat24

shareShare

clem 🤗

@clementdelangue

24 days ago

Btw the model router is the amazing open model Arch-Router-1.5B by katanemo Salman Paracha (Intelligent Infrastructure ): huggingface.co/katanemo/Arch-…

thumb_up_off_alt253

chat_bubble_outline13

repeat26

shareShare

Internet Archive

@internetarchive

19 days ago

Librarians, help us celebrate 1 trillion web pages preserved by the @InternetArchive! 🌐 Use our resource guide, complete with templates, visuals & event ideas, to connect your community to the web’s history. More ⤵️ blog.archive.org/2025/10/07/cal… #Wayback1T #libraries #librarians

thumb_up_off_alt124

chat_bubble_outline1

repeat32

shareShare

Percy Liang

@percyliang

18 days ago

You spend $1B training a model A. Someone on your team leaves and launches their own model API B. You're suspicious. Was B was derived (e.g., fine-tuned) from A? But you only have blackbox access to B... With our paper, you can still tell with strong statistical guarantees

thumb_up_off_alt2,2K

chat_bubble_outline55

repeat216

shareShare

Alexander Doria

@dorialexander

17 days ago

too many models, not enough evals.

thumb_up_off_alt48

chat_bubble_outline6

repeat2

shareShare

Jürgen Schmidhuber

@schmidhuberai

14 days ago

Our Huxley-Gödel Machine learns to rewrite its own code, estimating its own long-term self-improvement potential. It generalizes on new tasks (SWE-Bench Lite), matching the best officially checked human-engineered agents. Arxiv 2510.21614 With Wenyi Wang, Piotr Piękos,

thumb_up_off_alt1,1K

chat_bubble_outline56

repeat158

shareShare

Rishabh Agarwal

@agarwl_

14 days ago

Very nice blog post from Thinky (Kevin Lu et al) about on-policy distillation for LLMs -- we published this idea back in 2023 and it is *publicly* known to be successfully applied to Gemma 2 & 3, and Qwen3-Thinking (and probably many closed frontier models)! The idea behind

thumb_up_off_alt519

chat_bubble_outline13

repeat48

shareShare

Deb Raji

@rajiinio

13 days ago

Even before MMitchell recently raised this discussion, I've had conversation after conversation with students & new grads struggling with this exact dilemma. I want to help! Here's a live thread of AI-related opportunities for those looking to do good & make (enough) money:

Even before <a href="/mmitchell_ai/">MMitchell</a> recently raised this discussion, I've had conversation after conversation with students & new grads struggling with this exact dilemma.

I want to help! Here's a live thread of AI-related opportunities for those looking to do good & make (enough) money:

thumb_up_off_alt131

chat_bubble_outline9

repeat24

shareShare

Fazl Barez

@fazlbarez

7 days ago

New paper: 🧭 Introducing VAL-Bench: Measuring Value Alignment in Language Models. A benchmark that measures the consistency in language model expression of human values when prompted to justify opposing positions on real-life issues. Work with Aman Gupta and Denny O'Shea!

thumb_up_off_alt34

chat_bubble_outline1

repeat19

shareShare

Andrew White 🐦‍⬛

@andrewwhite01

5 days ago

After two years of work, we’ve made an AI Scientist that runs for days and makes genuine discoveries. Working with external collaborators, we report seven externally validated discoveries across multiple fields. It is available right now for anyone to use. 1/5

thumb_up_off_alt3,3K

chat_bubble_outline108

repeat542

shareShare

Alexander Doria

@dorialexander

4 days ago

👀 tempting to reproduce this with [model to come]

thumb_up_off_alt81

chat_bubble_outline2

repeat2

shareShare

Sam Klein📚 in SF

@metasj

4 days ago

Seconded! :)

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rota

@pli_cachete

3 days ago

From Terry Tao on Mathstodon: “ A new paper with Bogdan Georgiev, Javier Gomez-Serrano, and Adam Zsolt Wagner: "Mathematical exploration and discovery at scale" arxiv.org/abs/2511.02864 , in which we record our experiments using the LLM-powered optimization tool #AlphaEvolve to

thumb_up_off_alt172

chat_bubble_outline5

repeat17

shareShare

Sam Klein📚 in SF

@metasj

3 days ago

👹😁

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rand Hindi

@randhindi

a day ago

One of the few podcasts where I talk about longevity. I share some of my findings, including an experiment I did where I purposefully gained and lost 70 lbs to prove a point 😅

thumb_up_off_alt120

chat_bubble_outline36

repeat7

shareShare

Sam Klein📚 in SF

Simon Willison

Alex Zhang

Open Source Intel

Stewart Brand

Jeremy Howard

Alexander Berger

clem 🤗

Internet Archive

Percy Liang

Alexander Doria

Jürgen Schmidhuber

Rishabh Agarwal

Deb Raji

Fazl Barez

Andrew White 🐦‍⬛

Alexander Doria

Sam Klein📚 in SF

Rota

Sam Klein📚 in SF

Rand Hindi