Yi-han Sheu (@yihansheu) Twitter Tweets • TwiCopy

Yi-han Sheu

@yihansheu

+ Follow

AI & Mental Health | Psychiatrist & Epidemiologist & ML | Instructor at Harvard & MGH | Opinions are my own.
yihansheu.github.io

ID: 1908485875

calendar_today26-09-2013 16:30:46

103 Tweet

66 Followers

256 Following

Yi-han Sheu

@yihansheu

8 months ago

Timeless youtu.be/2D2IPDRPvFY?si…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference! Core components of NSA: • Dynamic hierarchical sparse strategy • Coarse-grained token compression • Fine-grained token selection 💡 With

thumb_up_off_alt16,16K

chat_bubble_outline901

repeat2,2K

shareShare

Satya Nadella

@satyanadella

8 months ago

A couple reflections on the quantum computing breakthrough we just announced... Most of us grew up learning there are three main types of matter that matter: solid, liquid, and gas. Today, that changed. After a nearly 20 year pursuit, we’ve created an entirely new state of

thumb_up_off_alt109,109K

chat_bubble_outline5,5K

repeat19,19K

shareShare

nature

@nature

8 months ago

About a month after Donald Trump took office, almost all grant-review meetings remain suspended at the US National Institutes of Health, preventing the world’s largest public funder of biomedical research from spending much of its US$47 billion annual budget.

thumb_up_off_alt1,1K

chat_bubble_outline96

repeat442

shareShare

Yi-han Sheu

@yihansheu

7 months ago

Nice

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

AI at Meta

@aiatmeta

7 months ago

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

thumb_up_off_alt13,13K

chat_bubble_outline706

repeat2,2K

shareShare

Yi-han Sheu

@yihansheu

4 months ago

Apparently, LLMs today have capabilities and limitations that approximate human-level intelligence, although along dimensions different from those involved when a child develops cognitive functions into adulthood. That is a well-known, objective fact. What is the added value or

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Yi-han Sheu

@yihansheu

4 months ago

With some abuse of analogy, Waymo is reminiscent of Deep Blue in chess. However, driving, unlike chess, is not a game with a fixed set of rules and finite states. The Waymo approach seems to impose a set of hard-wired constraints that prevent it from scaling. Of course, they can

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yi-han Sheu

@yihansheu

4 months ago

lol

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yi-han Sheu

@yihansheu

4 months ago

Looks promising

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

NeurIPS Conference

@neuripsconf

3 months ago

We're excited to announce a second physical location for NeurIPS 2025, in Mexico City. By expanding our physical locations, we hope to address concerns around skyrocketing attendance and difficulties in obtaining travel visas that some attendees have experienced in the past few

thumb_up_off_alt477

chat_bubble_outline14

repeat62

shareShare

Yi-han Sheu

@yihansheu

3 months ago

Nice!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Yi-han Sheu

Yi-han Sheu

DeepSeek

Satya Nadella

nature

Yi-han Sheu

AI at Meta

Yi-han Sheu

Yi-han Sheu

Yi-han Sheu

Yi-han Sheu

NeurIPS Conference

Yi-han Sheu