Mickael Chen (@mickael_chen) Twitter Tweets • TwiCopy

Andrew Brown

a year ago

So how did we get to these amazing videos for Meta Movie Gen? One of the things I’m proudest of is that we released a very detailed technical report (ai.meta.com/research/movie……) Lets dive into a technical summary of what we did & learnt 🧵 1/n x.com/AIatMeta/statu…

thumb_up_off_alt1,1K

chat_bubble_outline26

repeat159

shareShare

Mickael Chen

@mickael_chen

a year ago

This here is the important part: Unified architecture for text and media generation.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Christian Wolf (🦋🦋🦋)

@chriswolfvision

a year ago

François Fleuret It looks like the name "entropy" in information theory was optimized for impact on the field 😉

<a href="/francoisfleuret/">François Fleuret</a> It looks like the name "entropy" in information theory was optimized for impact on the field 😉

thumb_up_off_alt148

chat_bubble_outline4

repeat16

shareShare

Steinn Sigurðsson

@steinly0

a year ago

Dearest LLM script kiddies who are slashdotting arXiv.org - if you're being blocked for being rude, the solution is not to try the same stupid thing again an hour later. You're spoiling the Commons. Please don't make us get irritated at your dumbfoolery.

thumb_up_off_alt212

chat_bubble_outline5

repeat35

shareShare

Sergey Levine

@svlevine

10 months ago

Really excited to share what I've been working on with my colleagues at Physical Intelligence! We've developed a prototype robotic foundation model that can fold laundry, assemble a box, bus a table, and many other things. We've written a paper and blog post about it. 🧵👇

thumb_up_off_alt959

chat_bubble_outline25

repeat149

shareShare

Andrei Bursuc

@abursuc

10 months ago

I'm giving a try to the other blue thing place, though not planning to leave this one as it has its own specific fun. Currently that place does have a vibe of early 2010s Google Plus as I start my bubble with mostly researchers.

thumb_up_off_alt17

chat_bubble_outline2

repeat1

shareShare

Mickael Chen

@mickael_chen

10 months ago

If you are a MSc student looking for research internship in AI/CV in Paris, you should definitely check valeo.ai. This place takes a very good care of all of their interns. ❤️

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Luca Ambrogioni

@lucaamb

10 months ago

Generative decisions in diffusion models are made at special critical time points. Missing these points with a fast sampler results in loss of diversity. It's like missing an exit in the highway! Paper: openreview.net/forum?id=lxGFG…

thumb_up_off_alt134

chat_bubble_outline1

repeat29

shareShare

Alexander Kolesnikov

@__kolesnikov__

9 months ago

I always dreamed of a model that simultaneously 1. optimizes NLL of raw pixel data, 2. generates competitive high-res. natural images, 3. is practical. But it seemed too good to be true. Until today! Our new JetFormer model (arxiv.org/abs/2411.19722) ticks on all of these. 🧵

thumb_up_off_alt292

chat_bubble_outline5

repeat41

shareShare

Sander Dieleman

@sedielem

9 months ago

Better VQ-VAEs with this one weird rotation trick! I love papers like this: a simple change to an already powerful technique, that significantly improves results without introducing complexity or hyperparameters. arxiv.org/abs/2410.06424 (h/t lucidrains)

thumb_up_off_alt473

chat_bubble_outline3

repeat54

shareShare

Sasha Rush

@srush_nlp

9 months ago

Rare sincere tweet: December can be tough in academia. As a student I thought everyone had it together. As an advisor you see that is very much not true. Generally, at least as a starting place, a really recommend finding someone who you can go on a long walk with to talk it

thumb_up_off_alt405

chat_bubble_outline5

repeat37

shareShare

Edward Milsom

@edward_milsom

7 months ago

Our paper "Function-Space Learning Rates" is on arXiv! We give an efficient way to estimate the magnitude of changes to NN outputs caused by a particular weight update. We analyse optimiser dynamics in function space, and enable hyperparameter transfer with our scheme FLeRM! 🧵👇

thumb_up_off_alt420

chat_bubble_outline12

repeat68

shareShare

Stefano Ermon

@stefanoermon

6 months ago

Excited to share that I’ve been working on scaling up diffusion language models at Inception. A new generation of LLMs with unprecedented capabilities is coming!

thumb_up_off_alt692

chat_bubble_outline37

repeat81

shareShare

Garry Kasparov

@kasparov63

6 months ago

No US Russia or Ukraine policy was ever going to be made or changed today. That is the only conclusion if a buffoon like Vance is permitted to yap and derail a meeting between the US president and a world leader. The UN vote with Russia was the real signal; this was noise.

thumb_up_off_alt2,2K

chat_bubble_outline36

repeat428

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

6 months ago

In some research fields (including AI), it can be common for people to have similar ideas. You'll even talk research w/ people working on ideas similar to yours. You'll often get scooped unless you're working in some extremely niche field. But the mark of a good

thumb_up_off_alt75

chat_bubble_outline5

repeat4

shareShare

Mustafa Shukor

@mustafashukor1

5 months ago

We release a large scale study to answer the following: - Is late fusion inherently better than early fusion for multimodal models? - How do native multimodal models scale compared to LLMs. - How sparsity (MoEs) can play a detrimental role in handling heterogeneous modalities? 🧵

thumb_up_off_alt428

chat_bubble_outline8

repeat73

shareShare

clem 🤗

@clementdelangue

3 months ago

It's VLA day with open-source model releases today from both H & Hugging Face LeRobot 🦾🦾🦾 VLA is short for Vision, Language, Action models. These are the models that allow modern robots to see, hear, understand & take action thanks to AI. It's GPT but for

It's VLA day with open-source model releases today from both <a href="/hcompany_ai/">H</a> & <a href="/huggingface/">Hugging Face</a> <a href="/LeRobotHF/">LeRobot</a> 🦾🦾🦾

VLA is short for Vision, Language, Action models. These are the models that allow modern robots to see, hear, understand & take action thanks to AI. It's GPT but for

thumb_up_off_alt241

chat_bubble_outline8

repeat46

shareShare

Nick Jiang @ ICLR

@nickhjiang

3 months ago

Vision transformers have high-norm outliers that hurt performance and distort attention. While prior work removed them by retraining with “register” tokens, we find the mechanism behind outliers and make registers at ✨test-time✨—giving clean features and better performance! 🧵

thumb_up_off_alt995

chat_bubble_outline15

repeat134

shareShare