VR (@victorrambaud1) Twitter Tweets • TwiCopy

Keyon Vafa

a year ago

New paper: How can you tell if a transformer has the right world model? We trained a transformer to predict directions for NYC taxi rides. The model was good. It could find shortest paths between new points But had it built a map of NYC? We reconstructed its map and found this:

thumb_up_off_alt3,3K

chat_bubble_outline50

repeat403

shareShare

bioRxiv Neuroscience

@biorxiv_neursci

5 months ago

Effective connectivity reveals dual-route mechanism of visual prediction precision via insula and pulvinar biorxiv.org/content/10.110… #biorxiv_neursci

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

֎ Elena Semer ֎

@elenasemer

5 months ago

Why the Armenian Genocide Really Happened The Armenian Genocide was not only a mass extermination, it was a targeted attempt to destroy a people who carried ancient sacred knowledge of the universe. The Armenians meaning, “Men of the Sun” or “People of the Light” trace their

thumb_up_off_alt302

chat_bubble_outline30

repeat64

shareShare

Nicholas Fabiano, MD

@ntfabiano

4 months ago

A common belief is that cognition arises from the brain. This paper suggests that cognition is a complex multiscale information processing distributed across every single cell in the body. 🧵1/10

thumb_up_off_alt2,2K

chat_bubble_outline94

repeat441

shareShare

CLaE

@leafs_s

4 months ago

Nature Communications Double dissociation of dynamic and static face perception provides causal evidence for a third visual pathway nature.com/articles/s4146…

thumb_up_off_alt79

chat_bubble_outline5

repeat22

shareShare

bioRxiv Neuroscience

@biorxiv_neursci

4 months ago

Simulated Language Acquisition in a Biologically Realistic Model of the Brain biorxiv.org/content/10.110… #biorxiv_neursci

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

CLaE

@leafs_s

4 months ago

Nature Human Behaviour Neural similarity predicts whether strangers become friends nature.com/articles/s4156…

thumb_up_off_alt256

chat_bubble_outline1

repeat54

shareShare

Raphaël Millière

@raphaelmilliere

4 months ago

The final version of this paper has now been published in open access in the Journal of Memory and Language (link below). This was a long-running but very rewarding project. Here are a few thoughts on our methodology and main findings. 1/9

thumb_up_off_alt165

chat_bubble_outline4

repeat36

shareShare

Science Magazine

@sciencemagazine

3 months ago

The hippocampus is one of the most studied brain areas because of its major role in fundamental brain functions including learning, memory, and spatial navigation. In a new #ScienceReview, researchers focus on the role of overlooked areas and circuits within the hippocampus.

thumb_up_off_alt751

chat_bubble_outline6

repeat173

shareShare

Rohan Paul

@rohanpaul_ai

2 months ago

Another fantastic AI at Meta paper. Large language models keep redoing the same work inside long chains of thought, so this paper teaches the model to compress those recurring steps into small named behaviors that it can recall later or even learn into its weights. Shows that

Another fantastic <a href="/AIatMeta/">AI at Meta</a> paper.

Large language models keep redoing the same work inside long chains of thought, so this paper teaches the model to compress those recurring steps into small named behaviors that it can recall later or even learn into its weights.

Shows that

thumb_up_off_alt532

chat_bubble_outline16

repeat87

shareShare

DailyPapers

@huggingpapers

2 months ago

A missing link between Transformers and the brain? 🧠 Dragon Hatchling (BDH) is a new LLM architecture based on a scale-free, biologically-inspired network of locally-interacting neuron particles. It rivals GPT2 performance, but is designed for interpretability.

thumb_up_off_alt544

chat_bubble_outline14

repeat68

shareShare

Jubayer Ibn Hamid

@jubayer_hamid

2 months ago

Exploration is fundamental to RL. Yet policy gradient methods often collapse: during training they fail to explore broadly, and converge into narrow, easily exploitable behaviors. The result is poor generalization, limited gains from test-time scaling, and brittleness on tasks

thumb_up_off_alt1,1K

chat_bubble_outline16

repeat135

shareShare

Rohan Paul

@rohanpaul_ai

2 months ago

New paper from Meta Superintelligence Labs (FAIR) Explains why grokking happens and shows when learning moves from memorizing to generalizing. Gives a concrete recipe to trigger grokking, with weight decay, moderate width, and a data threshold near size times log size.

thumb_up_off_alt535

chat_bubble_outline15

repeat61

shareShare

Peyman Milanfar

@docmilanfar

2 months ago

How Kernel Regression is related to Attention Mechanism - a summary in 10 slides. 0/1

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat155

shareShare

Yungkingmito

@yungkingmito

2 months ago

Memory is not stored in matter, it is the matter, arranged in a way it can’t forget. Every lasting thing in the universe, from galaxies to cells, holds its past not in chemistry but in geometry, in the alignment that refuses to collapse. A skyrmion is one of those shapes. It’s

thumb_up_off_alt3,3K

chat_bubble_outline136

repeat440

shareShare

Rohan Paul

@rohanpaul_ai

2 months ago

🧠 New research finds similarities between human and AI learning. Large neural networks juggle in-context learning and in-weight learning much like humans juggle working memory and long-term memory, which ties flexibility, retention, and curriculum effects into one story.

thumb_up_off_alt168

chat_bubble_outline6

repeat30

shareShare

Yilun Du

@du_yilun

2 months ago

Introducing Geometry-aware Policy Imitation (GPI)! GPI constructs an energy landscape over the state space using demonstrations. A policy acts in the environment by following the gradient of the landscape. This enables fast multimodal policies with very fast inference (<1 ms)!

thumb_up_off_alt403

chat_bubble_outline6

repeat44

shareShare

Luiz Pessoa

@pessoabrain

2 months ago

𝗡𝗲𝘄 𝗹𝗮𝘄𝘀 𝗳𝗼𝗿 𝗲𝘃𝗼𝗹𝘃𝗶𝗻𝗴 𝘀𝘆𝘀𝘁𝗲𝗺𝘀? Thought-provoking piece on properties of systems. Including the Law of Increasing Functional Information, which applies to both biological and non-biological systems. pnas.org/doi/10.1073/pn…

thumb_up_off_alt235

chat_bubble_outline7

repeat48

shareShare

Thomas Fel

@napoolar

a month ago

🕳️🐇Into the Rabbit Hull – Part I (Part II tomorrow) An interpretability deep dive into DINOv2, one of vision’s most important foundation models. And today is Part I, buckle up, we're exploring some of its most charming features.

thumb_up_off_alt612

chat_bubble_outline10

repeat111

shareShare

Mathelirium

@mathelirium

a month ago

Wolfram Stephen Wolfram might be onto something here😁 WOLFRAM PHYSICS in motion: Two competing feeders and and a predator. Everything here is a hypergraph...including the environment. We visualize a synthetic micro-ecosystem built on computational irreducibility, hypergraph

thumb_up_off_alt901

chat_bubble_outline39

repeat131

shareShare