Yi-han Sheu (@yihansheu) 's Twitter Profile
Yi-han Sheu

@yihansheu

AI & Mental Health | Psychiatrist & Epidemiologist & ML | Instructor at Harvard & MGH | Opinions are my own.
yihansheu.github.io

ID: 1908485875

calendar_today26-09-2013 16:30:46

103 Tweet

66 Followers

256 Following

DeepSeek (@deepseek_ai) 's Twitter Profile Photo

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference! Core components of NSA: • Dynamic hierarchical sparse strategy • Coarse-grained token compression • Fine-grained token selection 💡 With

🚀 Introducing NSA: A Hardware-Aligned and Natively Trainable Sparse Attention mechanism for ultra-fast long-context training & inference!

Core components of NSA:
• Dynamic hierarchical sparse strategy
• Coarse-grained token compression
• Fine-grained token selection

đź’ˇ With
Satya Nadella (@satyanadella) 's Twitter Profile Photo

A couple reflections on the quantum computing breakthrough we just announced... Most of us grew up learning there are three main types of matter that matter: solid, liquid, and gas. Today, that changed. After a nearly 20 year pursuit, we’ve created an entirely new state of

A couple reflections on the quantum computing breakthrough we just announced...

Most of us grew up learning there are three main types of matter that matter: solid, liquid, and gas. Today, that changed.

After a nearly 20 year pursuit, we’ve created an entirely new state of
nature (@nature) 's Twitter Profile Photo

About a month after Donald Trump took office, almost all grant-review meetings remain suspended at the US National Institutes of Health, preventing the world’s largest public funder of biomedical research from spending much of its US$47 billion annual budget.

AI at Meta (@aiatmeta) 's Twitter Profile Photo

Today is the start of a new era of natively multimodal AI innovation. Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick — our most advanced models yet and the best in their class for multimodality. Llama 4 Scout • 17B-active-parameter model

Today is the start of a new era of natively multimodal AI innovation.

Today, we’re introducing the first Llama 4 models: Llama 4 Scout and Llama 4 Maverick —  our most advanced models yet and the best in their class for multimodality.

Llama 4 Scout
• 17B-active-parameter model
Yi-han Sheu (@yihansheu) 's Twitter Profile Photo

Apparently, LLMs today have capabilities and limitations that approximate human-level intelligence, although along dimensions different from those involved when a child develops cognitive functions into adulthood. That is a well-known, objective fact. What is the added value or

Yi-han Sheu (@yihansheu) 's Twitter Profile Photo

With some abuse of analogy, Waymo is reminiscent of Deep Blue in chess. However, driving, unlike chess, is not a game with a fixed set of rules and finite states. The Waymo approach seems to impose a set of hard-wired constraints that prevent it from scaling. Of course, they can

NeurIPS Conference (@neuripsconf) 's Twitter Profile Photo

We're excited to announce a second physical location for NeurIPS 2025, in Mexico City. By expanding our physical locations, we hope to address concerns around skyrocketing attendance and difficulties in obtaining travel visas that some attendees have experienced in the past few