Joy Dong (@joychew_d) Twitter Tweets • TwiCopy

Joy Dong

@joychew_d

+ Follow

PhD candidate @UMich. Previously @PyTorch @NVidia. #ConfidentialComputing #GPU Optimization & Architecture

ID: 419348267

calendar_today23-11-2011 07:42:53

20 Tweet

142 Followers

50 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🌟1/3: Introducing mm2-gb, GPU-accelerated Minimap2 for long-read DNA mapping! 🔥 mm2-gb accelerates Minimap2's bottleneck (chaining) on GPUs without compromising accuracy. 🚀 Kudos to Xueshen Liu Joy Dong, Satish Narayanasamy & Gina Sitaraman Computer Science and Engineering at Michigan AMD Michigan Engineering

thumb_up_off_alt57

chat_bubble_outline3

repeat26

shareShare

Horace He

@chhillee

a year ago

For too long, users have lived under the software lottery tyranny of fused attention implementations. No longer. Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch. pytorch.org/blog/flexatten… 1/10

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat267

shareShare

Joy Dong

@joychew_d

a year ago

Excited to announce PyTorch support for customizable score modification for attention kernels! Stay tuned for Chapter 2: Inference and GQA support🥳

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Joy Dong

@joychew_d

a year ago

Last week of internship:

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Joy Dong

@joychew_d

9 months ago

I'll be at ACM BCB'24 in Shenzhen, China from 22-15th Nov to present our work mm2-gb and how we boot minimap2 performance using GPUs. If you are running Minimap2, please check out mm2-gb if you haven't already. github.com/Minimap2onGPU/… DM me if you're around and willing to chat!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Joy Dong

@joychew_d

7 months ago

Our preprint for FlexAttention is available on arxiv: arxiv.org/abs/2412.05496! check it out for more technical details on how flexattention works and how we optimized it.

thumb_up_off_alt172

chat_bubble_outline2

repeat27

shareShare

Joy Dong

@joychew_d

6 months ago

🚀 Excited to see FlexAttention used in real-world research! We’ve recently released a preprint on this—arxiv.org/abs/2412.05496 -- check it out for more details! We are writing a second blog to talk about how to use FlexAttention for inference. Stay tuned!

thumb_up_off_alt100

chat_bubble_outline1

repeat6

shareShare

PyTorch

@pytorch

3 months ago

FlexAttention’s decoding backend is now optimized for inference—supporting GQA, PagedAttention, nested jagged tensors, trainable biases, and more. Read our latest blog for performance tuning guidance and examples using torchtune and gpt-fast: 🔗 hubs.la/Q03ktGsH0 #PyTorch

thumb_up_off_alt237

chat_bubble_outline2

repeat41

shareShare

Joy Dong

@joychew_d

3 months ago

Super excited to release FlexAttention for Inference with a decoding backend, GQA, PagedAttention, trainable bias and more! Meet us at the MLSys '25 conference in Santa Clara -- We will present FlexAttention on Wed May 14. #MLsys

thumb_up_off_alt36

chat_bubble_outline0

repeat7

shareShare

Joy Dong

Gate.io

Hari Sadasivan PhD

Horace He

Joy Dong

Joy Dong

Joy Dong

Joy Dong

Joy Dong

PyTorch

Joy Dong