Andrea Boscutti (@aboscutti) Twitter Tweets • TwiCopy

Robbie Barrat

7 years ago

I'm laughing so hard at this slide a friend sent me from one of Geoff Hinton's courses; "To deal with hyper-planes in a 14-dimensional space, visualize a 3-D space and say 'fourteen' to yourself very loudly. Everyone does it."

thumb_up_off_alt2,2K

chat_bubble_outline21

repeat681

shareShare

Yann LeCun

@ylecun

2 years ago

cognito Convolution is equivariant to translations. Self-attention is equivariant to permutations. They both have a role to play. Conv is efficient for signals with strong local correlations and motifs that can appear anywhere. SelfAtt is good for "object-based" representations where

thumb_up_off_alt243

chat_bubble_outline11

repeat31

shareShare

JAMA Psychiatry

@jamapsych

2 years ago

Can machine learning uncover the multivariate neural signature of major depressive disorder in individual patients? This large-scale study including 1,801 patients and controls finds no robust multivariate depression markers. ja.ma/3S0QgFY

thumb_up_off_alt135

chat_bubble_outline7

repeat63

shareShare

Yann LeCun

@ylecun

2 years ago

Meta has always tried to do the Right Thing. Meta has always practiced open research in AI. Meta has been promoting open source AI platforms. After numerous discussions over the last year (sometimes contentious) a consensus is emerging that open source AI platforms are

thumb_up_off_alt1,1K

chat_bubble_outline185

repeat136

shareShare

Andrej Karpathy

@karpathy

2 years ago

# on shortification of "learning" There are a lot of videos on YouTube/TikTok etc. that give the appearance of education, but if you look closely they are really just entertainment. This is very convenient for everyone involved : the people watching enjoy thinking they are

thumb_up_off_alt16,16K

chat_bubble_outline695

repeat3,3K

shareShare

Paul Graham

@paulg

2 years ago

I just moved the ChatGPT tab over to the left end of my main browser window, where I keep the tabs of things I use all the time, like GMail and Google Calendar.

thumb_up_off_alt4,4K

chat_bubble_outline385

repeat91

shareShare

Yann LeCun

@ylecun

2 years ago

* Language is low bandwidth: less than 12 bytes/second. A person can read 270 words/minutes, or 4.5 words/second, which is 12 bytes/s (assuming 2 bytes per token and 0.75 words per token). A modern LLM is typically trained with 1x10^13 two-byte tokens, which is 2x10^13 bytes.

thumb_up_off_alt8,8K

chat_bubble_outline562

repeat1,1K

shareShare

Cognition

@cognition_labs

a year ago

Today we're excited to introduce Devin, the first AI software engineer. Devin is the new state-of-the-art on the SWE-Bench coding benchmark, has successfully passed practical engineering interviews from leading AI companies, and has even completed real jobs on Upwork. Devin is

thumb_up_off_alt43,43K

chat_bubble_outline4,4K

repeat10,10K

shareShare

Dan Roberts

@danintheory

a year ago

Do LLMs really need to be so L? That's a rejected title for a new paper w/ Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso on pruning open-weight LLMs: we can remove up to *half* the layers of Llama-2 70B w/ essentially no impact on performance on QA benchmarks. 1/

Do LLMs really need to be so L?

That's a rejected title for a new paper w/ <a href="/Andr3yGR/">Andrey Gromov</a>, <a href="/kushal_tirumala/">Kushal Tirumala</a>, <a href="/Hasan_Shap/">Hassan Shapourian</a>, <a href="/PaoloGlorioso1/">Paolo Glorioso</a> on pruning open-weight LLMs: we can remove up to *half* the layers of Llama-2 70B w/ essentially no impact on performance on QA benchmarks.

1/

thumb_up_off_alt344

chat_bubble_outline16

repeat59

shareShare

Andrea Boscutti

@aboscutti

a year ago

AlphaFold 3 predicts the structure and interactions of all of life’s molecules @google blog.google/technology/ai/…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Andrej Karpathy

@karpathy

a year ago

The killer app of LLMs is Scarlett Johansson. You all thought it was math or something

thumb_up_off_alt11,11K

chat_bubble_outline318

repeat985

shareShare

Fleetwood

@fleetwood___

a year ago

Best tiled matmul animation I've found on the internet. Thanks Michal Sojka (@[email protected])

thumb_up_off_alt1,1K

chat_bubble_outline13

repeat208

shareShare

Jayson Jeganathan

@jaysonjeg

a year ago

Do you use surface fMRI? We found spurious correlations in surface fMRI, with potentially serious implications for test-retest reliability, fingerprinting, functional parcellations and brain-behaviour associations (1/n) biorxiv.org/cgi/content/sh…

thumb_up_off_alt170

chat_bubble_outline3

repeat79

shareShare

Divyansha

@divyansha1115

a year ago

Excited to share our Graph Foundation Model, 🌐 GraphFM, trained on 152 datasets with over 7.4 million nodes and 189 million edges spanning diverse domains. 🚨 Check out our preprint for GraphFM where we test how our model scales with data and model size, and show efficient

thumb_up_off_alt528

chat_bubble_outline13

repeat116

shareShare

Jonathan Gorard

@getjonwithit

a year ago

Moths are attracted to lights because of the same mathematics that underlies twistor theory and compactification in theoretical physics: projective geometry. It all starts from a simple observation: translations are just rotations whose center is located "at infinity". (1/11)

thumb_up_off_alt5,5K

chat_bubble_outline83

repeat689

shareShare

ACNP

@acnporg

a year ago

Join us in congratulating the 2024 ACNP Travel Award Class! bit.ly/46GyJK1 Amanda Arulpragasam Estef Azevedo Igor D. Bandeira, MD, PhD Andrea Boscutti @CarinaSBrown Brenda Cabrera Mendoza Robert Y. Chen Seungwon (Sebastian) Choi 최승원 Dr. Kirstie Cummings Ashley Cunningham, MSc 🧠 Pasha Davoudian Kevin Dowling 🧬🧠 Lana Ruvolo, PhD

Join us in congratulating the 2024 ACNP Travel Award Class! bit.ly/46GyJK1 <a href="/AArulpragasam/">Amanda Arulpragasam</a> <a href="/estef_azevedo/">Estef Azevedo</a> <a href="/igorbandeira/">Igor D. Bandeira, MD, PhD</a> <a href="/ABoscutti/">Andrea Boscutti</a> @CarinaSBrown <a href="/brendacabreram/">Brenda Cabrera Mendoza</a> <a href="/therealRYC/">Robert Y. Chen</a> <a href="/SSebastianChoi/">Seungwon (Sebastian) Choi 최승원</a> <a href="/not_kristie/">Dr. Kirstie Cummings</a> <a href="/ANeurogirl/">Ashley Cunningham, MSc 🧠</a> <a href="/PashaDavoudian/">Pasha Davoudian</a> <a href="/KevinFDowling/">Kevin Dowling 🧬🧠</a> <a href="/ScientificRuvvy/">Lana Ruvolo, PhD</a>

thumb_up_off_alt86

chat_bubble_outline7

repeat30

shareShare

The Transmitter

@_thetransmitter

a year ago

With neuroscience datasets and scientific collaborations growing in size, Gaelle Chapuis and Olivier Winter explain why neuroscience needs to create a career path for software engineers. thetransmitter.org/craft-and-care…

thumb_up_off_alt187

chat_bubble_outline2

repeat63

shareShare

Andy Keller

@t_andy_keller

6 months ago

In the physical world, almost all information is transmitted through traveling waves -- why should it be any different in your neural network? Super excited to share recent work with the brilliant Mozes Jacobs: "Traveling Waves Integrate Spatial Information Through Time" 1/14

thumb_up_off_alt7,7K

chat_bubble_outline148

repeat933

shareShare

Kyle Chan

@kyleichan

5 months ago

All Americans should think about this chart

thumb_up_off_alt11,11K

chat_bubble_outline513

repeat1,1K

shareShare