Théo Moutakanni (@theomoutakanni) Twitter Tweets • TwiCopy

Wassim (Wes) Bouaziz

a year ago

Want to know if a ML model was trained on your dataset? Introducing ✨Data Taggants✨! We use data poisoning to leave a harmless and stealthy signature on your dataset that radiates through trained models. Learn how to protect your dataset from unauthorized use... A 🧵

thumb_up_off_alt77

chat_bubble_outline6

repeat20

shareShare

TimDarcet

@timdarcet

a year ago

Alright actual serious post. Lingua := super simple codebase + torch.compile for speed --> clean, hackable, but still efficient *It can train a 7B >llama2 in 24h*. Crazy. If you got the gpus, not only can you train a good 7B, you can *iterate* on it. You can do *research*

thumb_up_off_alt291

chat_bubble_outline5

repeat19

shareShare

TimDarcet

@timdarcet

a year ago

It's easy to add your weird new attention idea and see if it *scales*. Does it beat llama2? If two 2nd/3rd year phd students can do it, that means *you* can do it! You don't need a big team! *Just do stuff* Also Badr Youbi Idrissi is on the market in May and Mathurin next Oct. Just sayin.

thumb_up_off_alt15

chat_bubble_outline2

repeat1

shareShare

Mathurin Videau

@mathuvu_

a year ago

Meta Lingua: a minimal, fast LLM codebase for training and inference. By researchers, for researchers. Easily hackable, still reproducible. Built-in efficiency, profiling (cpu, gpu and mem) and interpretability (automatic activation and gradient statistics) Joint work w/ Badr Youbi Idrissi

thumb_up_off_alt48

chat_bubble_outline1

repeat14

shareShare

Tom Sander @NeurIPS

@rednastom

a year ago

🔒Image watermarking is promising for digital content protection. But images often undergo many modifications—spliced or altered by AI. Today at AI at Meta, we released Watermark Anything that answers not only "where does the image come from," but "what part comes from where." 🧵

thumb_up_off_alt24

chat_bubble_outline1

repeat7

shareShare

Tom Sander @NeurIPS

@rednastom

a year ago

🎉Exciting news from AI at Meta FAIR! We've released a Watermark Anything Model under the MIT license! It was announced yesterday: ai.meta.com/blog/meta-fair… Great project with Pierre Fernandez et al. ! We're close to hitting 1,000 stars on GitHub. Give it a try: github.com/facebookresear… 🚀

thumb_up_off_alt29

chat_bubble_outline0

repeat4

shareShare

Transactions on Machine Learning Research

@tmlrorg

a year ago

Outstanding Finalist 2: “DINOv2: Learning Robust Visual Features without Supervision," by Maxime Oquab, Timothée Darcet (TimDarcet), Théo Moutakanni (Théo Moutakanni) et al. 5/n x.com/AIatMeta/statu…

thumb_up_off_alt24

chat_bubble_outline2

repeat6

shareShare

TimDarcet

@timdarcet

a year ago

Very happy to say DINOv2 got outstanding certification finalist at TMLR! The models had an amazing reception already, but this kind of award is the cherry on top 😁

thumb_up_off_alt112

chat_bubble_outline2

repeat10

shareShare

Matt Schwartz

@mattzschwartz

10 months ago

📢Exciting announcement today 📢 #EndoDINO is a foundation model for #GI #Endoscopy. We believe this represents a new paradigm for #AI development in the field! EndoDINO - inspired by AI at Meta - will democratize/accelerate new AI research in gastroenterology. More below 👇

thumb_up_off_alt59

chat_bubble_outline9

repeat13

shareShare

Piotr Bojanowski

@p_bojanowski

9 months ago

🔥 The DINO team is looking for a PostDoc! 🔥 If you are about to graduate, and want to be part of what’s next for SSL, don’t hesitate to reach out! Link to job offer : metacareers.com/jobs/502476149…

thumb_up_off_alt155

chat_bubble_outline1

repeat28

shareShare

TimDarcet

@timdarcet

9 months ago

Want strong SSL, but not the complexity of DINOv2? CAPI: Cluster and Predict Latents Patches for Improved Masked Image Modeling.

thumb_up_off_alt600

chat_bubble_outline21

repeat108

shareShare

Gül Varol

@gulvarol

9 months ago

Somewhat poorly advertised MS program in Paris: MVA master-mva.com is a 1-year research Master's, with (mostly English) courses across Maths, Vision, Learning, and an internship (that can lead to PhD). Applications open May 1 - June 30 for 2025-2026. Grants possible.

thumb_up_off_alt46

chat_bubble_outline1

repeat10

shareShare

Delong Chen (陈德龙)

@delong0_0

8 months ago

This is my first paper done at FAIR. We show that adaptive visual token segmentation, especially in subobject-level (i.e., subwords in images), enables VLMs to have a better and faster learning of image understanding! arxiv.org/pdf/2402.14327

thumb_up_off_alt59

chat_bubble_outline1

repeat13

shareShare

Pierre Chambon

@pierrechambon6

8 months ago

Does your LLM truly comprehend the complexity of the code it generates? 🥰 Introducing our new non-saturated (for at least the coming week? 😉) benchmark: ✨BigO(Bench)✨ - Can LLMs Generate Code with Controlled Time and Space Complexity? Check out the details below !👇

thumb_up_off_alt119

chat_bubble_outline9

repeat26

shareShare

Pierre Chambon

@pierrechambon6

8 months ago

🔥Very happy to introduce BigO(Bench) dataset on Hugging Face 🤗 ✨3,105 coding problems and 1,190,250 solutions from CodeContests ✨Time/Space Complexity labels and curve coefficients ✨Up to 5k Runtime/Memory Footprint measures for each solution huggingface.co/datasets/faceb…

thumb_up_off_alt17

chat_bubble_outline1

repeat5

shareShare

Kunhao Zheng @ ICLR 2025

@kunhaoz

7 months ago

🚨 Your RL only improves 𝗽𝗮𝘀𝘀@𝟭, not 𝗽𝗮𝘀𝘀@𝗸? 🚨 That’s not a bug — it’s a 𝗳𝗲𝗮𝘁𝘂𝗿𝗲 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗯𝗷𝗲𝗰𝘁𝗶𝘃𝗲 you’re optimizing. You get what you optimize for. If you want better pass@k, you need to optimize for pass@k at training time. 🧵 How?

thumb_up_off_alt823

chat_bubble_outline12

repeat141

shareShare

Adrien Bardes

@adrienbardes

5 months ago

Happy to share our latest work: V-JEPA 2, a world model trained from millions of videos, that enables visual understanding, planning and physical reasoning!

thumb_up_off_alt24

chat_bubble_outline0

repeat3

shareShare

Federico Baldassarre

@baldassarrefe

5 months ago

DINOv2 meets text at #CVPR 2025! Why choose between high-quality DINO features and CLIP-style vision-language alignment? Pick both with dino.txt 🦖📖 We align frozen DINOv2 features with text captions, obtaining both image-level and patch-level alignment at a minimal cost. [1/N]

thumb_up_off_alt675

chat_bubble_outline4

repeat105

shareShare

Wassim (Wes) Bouaziz

@_vassim

5 months ago

🚨New AI Security paper alert: Winter Soldier 🥶🚨 In our last paper, we show: -how to backdoor a LM _without_ training it on the backdoor behavior -use that to detect if a black-box LM has been trained on your protected data Yes, Indirect data poisoning is real and powerful!

thumb_up_off_alt46

chat_bubble_outline1

repeat21

shareShare

Delong Chen (陈德龙)

@delong0_0

5 months ago

I'm attending ICML'25 in Vancouver. Will present: 1) Subobject-level adaptive image token segmentation (main conference) arxiv.org/abs/2402.14327 2) WorldPrediction benchmark for world modeling and procedual planing (in Assessing World Models workshop) arxiv.org/abs/2506.04363

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare