Chandan Singh (@csinva) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

LLMs excel at fitting finetuning data, but are they learning to reason or just parroting🦜? We found a way to probe a model's learning process to reveal *how* each example is learned. This lets us predict model generalization using only training data, amongst other insights: 🧵

thumb_up_off_alt766

chat_bubble_outline19

repeat122

shareShare

Colin Fraser

@colin_fraser

8 months ago

I'm really fascinated by this dataset from the AI poetry survey paper. Here's another visualization I just made. Survey respondents were shown one of these 10 poems, and either told that they were authored by AI, human, or not told anything.

thumb_up_off_alt1,1K

chat_bubble_outline43

repeat123

shareShare

elvis

@omarsar0

8 months ago

LLMs surpass human experts in predicting neuroscience results Scientific discovery is the next big goal for AI. We are seeing a huge number of research studies tackling AI-powered scientific discovery from different angles and for different problems. This new paper published in

thumb_up_off_alt412

chat_bubble_outline13

repeat88

shareShare

Chandan Singh

@csinva

8 months ago

I’ll be at NeurIPS this week presenting our work on interpretable embeddings — drop me a message if you want to chat!

thumb_up_off_alt48

chat_bubble_outline1

repeat8

shareShare

Zhou Xian

@zhou_xian_

7 months ago

Everything you love about generative models — now powered by real physics! Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics

thumb_up_off_alt16,16K

chat_bubble_outline578

repeat3,3K

shareShare

Surya Ganguli

@suryaganguli

7 months ago

Absolutely. In any hypothesis test between A and B about the working of a complex system, the right answer is invariably none of the above. Systems identification is a much better paradigm for neuroscience discovery; it allows us to efficiently explore huge hypothesis spaces.

thumb_up_off_alt129

chat_bubble_outline3

repeat20

shareShare

Jonas Pfeiffer

@pfeiffjo

7 months ago

🧠💡 Our LLMs just had a ‘memory augmentation’—now they can deliberate like seasoned thinkers! arxiv.org/abs/2412.17747

thumb_up_off_alt592

chat_bubble_outline20

repeat83

shareShare

Chandan Singh

@csinva

7 months ago

Cool new paper interpreting neurons in macaque V4 biorxiv.org/content/10.110…

thumb_up_off_alt15

chat_bubble_outline0

repeat4

shareShare

Marlene Cohen

@marlenecohen

7 months ago

New results for a new year! “Linking neural population formatting to function” describes our modern take on an old question: how can we understand the contribution of a brain area to behavior? biorxiv.org/content/10.110… 🧵1/

thumb_up_off_alt163

chat_bubble_outline3

repeat37

shareShare

Frank Hutter

@frankrhutter

7 months ago

The data science revolution is getting closer. TabPFN v2 is published in Nature: nature.com/articles/s4158… On tabular classification with up to 10k data points & 500 features, in 2.8s TabPFN on average outperforms all other methods, even when tuning them for up to 4 hours🧵1/19

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat249

shareShare

Andrea de Varda

@devarda_a

6 months ago

New preprint! 🧠🤖 Brain encoding in 21 languages! biorxiv.org/content/10.110… w/ Saima Malik-Moraleda, Greta Tuckute, and Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 (1/)

thumb_up_off_alt134

chat_bubble_outline4

repeat32

shareShare

Yi Ma

@yimatweets

5 months ago

arxiv.org/abs/2502.10385 This is our latest work SimDINO that, again based on coding rate principle, significantly simplifies the popular (but unnecessarily sophisticated) visual self-supervised learning methods DINO and DINOv2. The power of understanding and principles is

thumb_up_off_alt229

chat_bubble_outline5

repeat48

shareShare

Jianwei Yang

@jw2yang4ai

5 months ago

Thanks for featuring our work! Aran Komatsuzaki. 🔥Today we are thrilled to announce our MSR flagship project Magma! This is a fully open-sourced project. We will roll out all the stuff: code, model and training data through the following days. Check out our full work here:

thumb_up_off_alt185

chat_bubble_outline7

repeat38

shareShare

Berkeley AI Research

@berkeley_ai

3 months ago

Humans just saw a *new* color—literally outside the known visual spectrum. BAIR faculty and visual computing expert Ren Ng and collaborators made it possible with the Oz Vision System. 🌈👁️ Newly published in Science Advances: science.org/doi/10.1126/sc… popsci.com/health/new-col…

thumb_up_off_alt69

chat_bubble_outline4

repeat22

shareShare

Yufan Zhuang

@yufan_zhuang

2 months ago

🤯Your LLM just threw away 99.9 % of what it knows. Standard decoding samples one token at a time and discards the rest of the probability mass. Mixture of Inputs (MoI) rescues that lost information, feeding it back for more nuanced expressions. It is a brand new

thumb_up_off_alt39

chat_bubble_outline4

repeat7

shareShare

Sahil Verma

@sahil1v

2 months ago

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 arxiv.org/abs/2505.23856

thumb_up_off_alt73

chat_bubble_outline1

repeat33

shareShare

rohit

@rohitarorayyc

a month ago

We automated systematic reviews using gpt-4.1 and o3-mini ! Our platform (otto-SR) beat humans at all tasks and conducted 12 years of systematic review research in just two days. We also show how otto-SR can be used in the real world to rapidly update clinical guidelines 🧵

thumb_up_off_alt371

chat_bubble_outline15

repeat73

shareShare

Shirley Wu

@shirleyyxwu

a month ago

Even the smartest LLMs can fail at basic multiturn communication Ask for grocery help → without asking where you live 🤦‍♀️ Ask to write articles → assumes your preferences 🤷🏻‍♀️ ⭐️CollabLLM (top 1%; oral ICML Conference) transforms LLMs from passive responders into active collaborators.

thumb_up_off_alt140

chat_bubble_outline6

repeat43

shareShare

hardmaru

@hardmaru

a month ago

Reinforcement Learning Teachers of Test Time Scaling In this new paper, we introduce a new way to teach LLMs how to reason by learning to teach, not solve! The core idea: A teacher model is trained via RL to generate explanations from question-answer pairs, optimized to improve

thumb_up_off_alt667

chat_bubble_outline20

repeat100

shareShare

Chandan Singh

Gate.io

Katie Kang

Colin Fraser

elvis

Chandan Singh

Zhou Xian

Surya Ganguli

Jonas Pfeiffer

Chandan Singh

Marlene Cohen

Frank Hutter

Andrea de Varda

Yi Ma

Jianwei Yang

Berkeley AI Research

Yufan Zhuang

Sahil Verma

rohit

Shirley Wu

hardmaru