Jiarui Xu (@jerry_xu_jiarui) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Alex Nichol

@unixpickle

a year ago

I investigated this in 2017 and even at the time it looked encouraging. Glad to see it make a comeback. github.com/unixpickle/sgd…

thumb_up_off_alt61

chat_bubble_outline2

repeat7

shareShare

Very happy to see the TTT-series reaching yet another milestone! This time it serves as an inspiration for next-generation architecture post-Transformer, and by connecting TTT to Transformer, it can explain why (autoregressive) Transformers are so good at in-context learning!

thumb_up_off_alt100

chat_bubble_outline0

repeat11

shareShare

Yann Dubois

@yanndubs

a year ago

🔥new language modelling layer in town - more expressive than RNN - more efficient (linear comp.) than attention! key perspective: LM layers are ML models trained to memorize tokens in a sequence: - Linear memorizer => RNN - Kernel mem. => attention - Neural mem. => our layer

thumb_up_off_alt166

chat_bubble_outline5

repeat34

shareShare

Jiarui Xu

@jerry_xu_jiarui

8 months ago

Thinking about a PhD? Don’t miss the chance to work with Elliott / Shangzhe Wu! He’s not only a brilliant researcher but also an inspiring mentor and collaborator. Excited to see the amazing projects his new team will bring to life! 🌟

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Jiarui Xu

@jerry_xu_jiarui

8 months ago

This is so cooool!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Omer Bar Tal

@omerbartal

8 months ago

Meet Pika 2.0! Besides improved quality and motion, our new model can embed user-provided concepts into the generated videos, without any training! Combined with an unprecedented level of text-alignment, you can now create YOUR OWN personalized content with minimal effort 🎬

thumb_up_off_alt139

chat_bubble_outline9

repeat19

shareShare

Jiarui Xu

@jerry_xu_jiarui

7 months ago

Impressive work on scaling test-time compute for diffusion models!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

SifeiL

@sifei30488l

7 months ago

Introducing GSPN: A Leap Forward in Vision Attention Mechanisms Paper: arxiv.org/pdf/2501.12381 Project: whj363636.github.io/GSPN/ We present GSPN (Generalized Spatial Propagation Network), a novel attention mechanism developed at NVIDIA. Unlike pixel-to-pixel scans like mamba,

thumb_up_off_alt160

chat_bubble_outline3

repeat49

shareShare

Yinbo Chen

@yinbochen

6 months ago

Introducing “Diffusion Autoencoders are Scalable Image Tokenizers” (DiTo). We show that with proper designs and scaling up, diffusion autoencoders (a single L2 loss) can outperform the GAN-LPIPS tokenizers (hybrid losses) used in current SOTA generative models. (1/4)

thumb_up_off_alt510

chat_bubble_outline4

repeat104

shareShare

An-Chieh Cheng

@anjjei

6 months ago

Me when the deadline is tomorrow but vibes come first. Need more memes rigged for maximum chill🤤 #chillguy

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Yuzhe Qin

@qinyuzhe

5 months ago

Meet our first general-purpose robot at Dexmate dexmate.ai/vega Adjustable height from 0.66m to 2.2m: compact enough for an SUV, tall enough to reach those impossible high shelves. Powerful dual arms (15lbs payload each) and omni-directional mobility for ultimate

thumb_up_off_alt207

chat_bubble_outline13

repeat33

shareShare

Karan Dalal

@karansdalal

4 months ago

Today, we're releasing a new paper – One-Minute Video Generation with Test-Time Training. We add TTT layers to a pre-trained Transformer and fine-tune it to generate one-minute Tom and Jerry cartoons with strong temporal consistency. Every video below is produced directly by

thumb_up_off_alt5,5K

chat_bubble_outline187

repeat940

shareShare

Gashon Hussein

@gashonhussein

4 months ago

Excited to share our new paper, "One-Minute Video Generation with Test-Time Training (TTT)" in collaboration with NVIDIA. We augment a pre-trained Transformer with TTT-layers and finetune it to generate one-minute Tom and Jerry cartoons with strong temporal and spatial

thumb_up_off_alt935

chat_bubble_outline28

repeat160

shareShare

Xiaolong Wang

@xiaolonw

4 months ago

Test-Time Training (TTT) is now on Video! And not just a 5-second video. We can generate a full 1-min video! TTT module is an RNN module that provides an explicit and efficient memory mechanism. It models the hidden state of an RNN with a machine learning model, which is updated

thumb_up_off_alt1,1K

chat_bubble_outline31

repeat181

shareShare

An-Chieh Cheng

@anjjei

4 months ago

Kids in 2025: writing Tom & Jerry instead of watching them. TTT + diffusion transformer is wild!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Jiarui Xu

Gate.io

Alex Nichol

Xinlei Chen

Yann Dubois

Jiarui Xu

Jiarui Xu

Omer Bar Tal

Jiarui Xu

SifeiL

Yinbo Chen

An-Chieh Cheng

Yuzhe Qin

Karan Dalal

Gashon Hussein

Xiaolong Wang

An-Chieh Cheng