Gargi Ghosh (@gargighosh) Twitter Tweets • TwiCopy

Yannic Kilcher 🇸🇨

9 months ago

🔥New Video🔥 I delve (ha!) into Byte Latent Transformer: Patches Scale Better Than Tokens where the authors do away with tokenization and create an LLM architecture that operates on dynamically sized "patches" instead of tokens. By controlling the patch size, they gain a level

thumb_up_off_alt1,1K

chat_bubble_outline12

repeat106

shareShare

wh

@nrehiew_

9 months ago

Wrote about some of my favourite papers over the past year or so and some research directions that I am excited about in 2025 As a bonus, I think it's a good overview for someone to catch up on the current state of the art :)

thumb_up_off_alt693

chat_bubble_outline21

repeat77

shareShare

AI at Meta

@aiatmeta

9 months ago

New research from Meta FAIR — Meta Memory Layers at Scale. This work takes memory layers beyond proof-of-concept, proving their utility at contemporary scale ➡️ go.fb.me/3lbt4m

thumb_up_off_alt1,1K

chat_bubble_outline38

repeat178

shareShare

Elon Musk

@elonmusk

9 months ago

A friend in LA just took this video

thumb_up_off_alt545,545K

chat_bubble_outline54,54K

repeat84,84K

shareShare

clem 🤗

@clementdelangue

8 months ago

Our science team has started working on fully reproducing and open-sourcing R1 including training data, training scripts,... Full power of open source AI so that everyone all over the world can take advantage of AI progress! Will help debunk some myths I’m sure too. Thanks

thumb_up_off_alt4,4K

chat_bubble_outline127

repeat538

shareShare

Inception Labs

@inceptionailabs

7 months ago

We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.

thumb_up_off_alt5,5K

chat_bubble_outline225

repeat996

shareShare

Engineering at Meta

@fb_engineering

5 months ago

Meta and NVIDIA have teamed up to supercharge vector search on GPUs by integrating NVIDIA cuVS into Faiss v1.10, Meta’s open-source library for similarity search. This collaboration brings groundbreaking performance improvements: 🔹 IVF indexing: NVIDIA cuVS boosts build times

thumb_up_off_alt686

chat_bubble_outline20

repeat130

shareShare

Yann LeCun

@ylecun

5 months ago

Rob Fergus is the new head of Meta-FAIR! FAIR is refocusing on Advanced Machine Intelligence: what others would call human-level AI or AGI. linkedin.com/posts/rob-ferg…

thumb_up_off_alt547

chat_bubble_outline34

repeat44

shareShare

Weixin Liang

@liang_weixin

5 months ago

🎉 Excited to share: "𝐌𝐢𝐱𝐭𝐮𝐫𝐞-𝐨𝐟-𝐓𝐫𝐚𝐧𝐬𝐟𝐨𝐫𝐦𝐞𝐫𝐬 (𝐌𝐨𝐓)" has been officially accepted to TMLR (March 2025) and the code is now open-sourced! 📌 GitHub repo: github.com/facebookresear… 📄 Paper: arxiv.org/abs/2411.04996 How can we reduce pretraining costs for

thumb_up_off_alt435

chat_bubble_outline3

repeat84

shareShare

Gargi Ghosh

@gargighosh

2 months ago

Factuality and hallucination is a big problem in foundation models. We demonstrate that it’s possible to improve long form factuality by 35% without losing helpfulness/ ability to provide detailed response with long CoT, new reward for GRPO and tricks to stop reward hacking

thumb_up_off_alt0

chat_bubble_outline0

repeat1

shareShare

AI at Meta

@aiatmeta

2 months ago

🏆 We're thrilled to announce that Meta FAIR’s Brain & AI team won 1st place at the prestigious Algonauts 2025 brain modeling competition. Their 1B parameter model, TRIBE (Trimodal Brain Encoder), is the first deep neural network trained to predict brain responses to stimuli

thumb_up_off_alt559

chat_bubble_outline31

repeat88

shareShare

Gargi Ghosh

@gargighosh

a month ago

New research from FAIR- Active Reading: a framework to learn a given set of material with self-generated learning strategies for generalized and expert domains(such as Finance). Absorb significantly more knowledge than vanilla finetuning and usual data augmentations strategies

thumb_up_off_alt28

chat_bubble_outline0

repeat11

shareShare