Abdulkadir Gokce (@akgokce0) Twitter Tweets • TwiCopy

Abdulkadir Gokce

@akgokce0

+ Follow

IC SC @EPFL_en @ICepfl | EE&Math @unibogazici_en

ID: 1300695920405893122

calendar_today01-09-2020 07:26:44

12 Tweet

49 Followers

65 Following

Badr AlKhamissi

@bkhmsi

a year ago

🚨 New Paper!! How can we train LLMs using 100M words? In our babyLM paper, we introduce a new self-synthesis training recipe to tackle this question! 🍼💻 This was a fun project co-led by me, Yingtian Tang, Abdulkadir Gokce, w/ Hannes Mehrer & Martin Schrimpf 🧵⬇️

🚨 New Paper!!

How can we train LLMs using 100M words? In our <a href="/babyLMchallenge/">babyLM</a> paper, we introduce a new self-synthesis training recipe to tackle this question! 🍼💻

This was a fun project co-led by me, <a href="/yingtian80536/">Yingtian Tang</a>, <a href="/akgokce0/">Abdulkadir Gokce</a>, w/ <a href="/HannesMehrer/">Hannes Mehrer</a> & <a href="/martin_schrimpf/">Martin Schrimpf</a>

🧵⬇️

thumb_up_off_alt97

chat_bubble_outline1

repeat24

shareShare

Badr AlKhamissi

@bkhmsi

a year ago

🚨 New Paper! Can neuroscience localizers uncover brain-like functional specializations in LLMs? 🧠🤖 Yes! We analyzed 18 LLMs and found units mirroring the brain's language, theory of mind, and multiple demand networks! w/ Greta Tuckute, Antoine Bosselut, & Martin Schrimpf 🧵👇

thumb_up_off_alt98

chat_bubble_outline1

repeat30

shareShare

Badr AlKhamissi

@bkhmsi

8 months ago

🚨 New Preprint!! LLMs trained on next-word prediction (NWP) show high alignment with brain recordings. But what drives this alignment—linguistic structure or world knowledge? And how does this alignment evolve during training? Our new paper explores these questions. 👇🧵

thumb_up_off_alt279

chat_bubble_outline5

repeat63

shareShare

Ben Lonnqvist

@lonnqvistben

7 months ago

AI vision is insanely good nowadays—but is it really like human vision or something else entirely? In our new pre-print, we pinpoint a fundamental visual mechanism that's trivial for humans yet causes most models to fail spectacularly. Let's dive in👇🧠 [arxiv.org/abs/2504.05253]

thumb_up_off_alt341

chat_bubble_outline9

repeat46

shareShare

Badr AlKhamissi

@bkhmsi

5 months ago

🚨New Preprint!! Thrilled to share with you our latest work: “Mixture of Cognitive Reasoners”, a modular transformer architecture inspired by the brain’s functional networks: language, logic, social reasoning, and world knowledge. 1/ 🧵👇