alyssa loo (@alyssamloo) Twitter Tweets • TwiCopy

Clara Isabel Meister

3 years ago

Humans' reading behaviors are different at the middle vs. the end of a sentence. Several theories have been proposed to explain what readers are doing on those final words to “wrap-up” the sentence. In our #acl2022 paper we take a look at this phenomenon! arxiv.org/abs/2203.17213

thumb_up_off_alt108

chat_bubble_outline1

repeat20

shareShare

Jerry Tang

@jerryptang

3 years ago

very excited to share our paper on reconstructing language from non-invasive brain recordings! we introduce a decoder that takes in fMRI recordings and generates continuous language descriptions of perceived speech, imagined speech, and possibly much more biorxiv.org/content/10.110…

thumb_up_off_alt2,2K

chat_bubble_outline56

repeat451

shareShare

Yann LeCun

@ylecun

3 years ago

OK, debates about the necessity or "priors" (or lack thereof) in learning systems are pointless. Here are some basic facts that all ML theorists and most ML practitioners understand, but a number of folks-with-an-agenda don't seem to grasp. Thread. 1/

thumb_up_off_alt959

chat_bubble_outline25

repeat197

shareShare

Philipp Schmitt

@philippschmitt

3 years ago

New research-y project: Blueprints for Intelligence, a visual history of artificial neural networks from 1943 to 2020 philippschmitt.com/blueprints-for…

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat477

shareShare

Sebastian Ruder

@seb_ruder

3 years ago

My new blog post takes a look at the state of multilingual AI. 🌍 How multilingual are current models in NLP, vision, and speech? 🏛 What are the recent contributions in this area? ⛰ What challenges remain and how we can we address them? ruder.io/state-of-multi…

thumb_up_off_alt367

chat_bubble_outline6

repeat122

shareShare

Felix Hill

@felixhill84

3 years ago

Lots of folks are talking about *emergence* in Deep Learning as if it's a new thing, that happens only in large language models at scale. It's not! It has been happening for decades and in very small networks. 🧵 🧵 🧵 🧵 🧵 🧵 🧵 🧵 🧵

thumb_up_off_alt602

chat_bubble_outline14

repeat126

shareShare

Charlotte Caucheteux @ICML24

@c_caucheteux

3 years ago

🔥Our work has now been accepted to NeurIPS 2022 !! `Toward a realistic model of speech processing in the brain with self-supervised learning’: arxiv.org/abs/2206.01685 Let’s meet in New Orleans on Tue 29 Nov 2:30pm PST (Hall J #524). A recap of the 3 main results below 👇

thumb_up_off_alt1,1K

chat_bubble_outline21

repeat260

shareShare

Jim Fan

@drjimfan

3 years ago

Why does ChatGPT work so well? Is it “just scaling up GPT-3” under the hood? In this 🧵, let’s discuss the “Instruct” paradigm, its deep technical insights, and a big implication: “prompt engineering” as we know it may likely disappear soon:👇

thumb_up_off_alt2,2K

chat_bubble_outline48

repeat480

shareShare

Brown NLP

@brown_nlp

3 years ago

Last year, we criticized LMs for performing “too well” with pathological prompts, and many papers have now shown similar results with corrupted ICL or CoT. In our new work, we find that *humans* also perform surprisingly well with irrelevant prompts! (But not misleading ones.) ⅕

thumb_up_off_alt135

chat_bubble_outline2

repeat25

shareShare

Boaz Barak

@boazbaraktcs

3 years ago

The new scaling laws

thumb_up_off_alt271

chat_bubble_outline1

repeat35

shareShare

Yong Zheng-Xin (Yong)

@yong_zhengxin

3 years ago

LLMs such as ChatGPT and BLOOMZ claim that they are multilingual, but does this mean they can generate code-mixed data? Follow this 🧵 to find out. (1/N) Paper: arxiv.org/abs/2303.13592

thumb_up_off_alt124

chat_bubble_outline4

repeat31

shareShare

Kyle Mahowald

@kmahowald

2 years ago

Now that you’ve no doubt solved your Sunday crossword puzzle, looking to read about crosswords and linguistics? In The Atlantic theatlantic.com/science/archiv…, Scott AnderBois, Nicholas Tomlin, and I talk about what linguistics can tell us about crosswords and vice versa. Thread.

thumb_up_off_alt58

chat_bubble_outline3

repeat11

shareShare

Michael Lepori

@michael_lepori

2 years ago

Domain experts often have intuitions about the algorithms that transformers may use to solve tasks, but do models actually use them? In new work with Thomas Serre and Brown NLP, we introduce circuit probing, a method for uncovering circuits that compute intermediate variables. (1/15)

thumb_up_off_alt104

chat_bubble_outline1

repeat18

shareShare

Sundar Pichai

@sundarpichai

2 years ago

Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano Gemini Ultra’s performance exceeds current state-of-the-art results on

thumb_up_off_alt23,23K

chat_bubble_outline962

repeat3,3K

shareShare

Michael Lepori

@michael_lepori

2 years ago

Compositional generalization is a major challenge for neural networks. In a #NeurIPS2023 spotlight paper with Thomas Serre and Brown NLP, we ask whether neural networks learn the types of representations that are a prerequisite for compositionality! (1/14)

Compositional generalization is a major challenge for neural networks. In a #NeurIPS2023 spotlight paper with <a href="/tserre/">Thomas Serre</a> and <a href="/Brown_NLP/">Brown NLP</a>, we ask whether neural networks learn the types of representations that are a prerequisite for compositionality! (1/14)

thumb_up_off_alt141

chat_bubble_outline2

repeat24

shareShare

alyssa loo

@alyssamloo

2 years ago

a nyt connections game with "orca", "llama", "alpaca" and "vicuna" would such a dogwhistle

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Daniel Johnson

@_ddjohnson

2 years ago

Excited to share Penzai, a JAX research toolkit from Google DeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…

thumb_up_off_alt2,2K

chat_bubble_outline39

repeat404

shareShare

Benjamin Spiegel

@superspeeg

7 months ago

Why did only humans invent graphical systems like writing? 🧠✍️ In our new paper at CogSci Society, we explore how agents learn to communicate using a model of pictographic signification similar to human proto-writing. 🧵👇

thumb_up_off_alt1,1K

chat_bubble_outline22

repeat180

shareShare

Anthropic

@anthropicai

6 months ago

THE WAY OF CODE, a project by @rickrubin in collaboration with Anthropic:

thumb_up_off_alt9,9K

chat_bubble_outline445

repeat1,1K

shareShare