Sweta Agrawal (@swetaagrawal20) Twitter Tweets • TwiCopy

Andre Martins

@andre_t_martins

a year ago

Heading to Vancouver soon to attend #NeurIPS2024! Stop by our tutorial and posters 👇

thumb_up_off_alt14

chat_bubble_outline1

repeat4

shareShare

If you're in Vancouver attending #NeurIPS2024, stop by our spotlight poster on Thu 12 Dec 11am-2pm PST (East Exhibit Hall A-C #3203). Check out the updated version of the paper: openreview.net/forum?id=rhCgi…

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Artidoro Pagnoni

@artidoropagnoni

a year ago

🚀 Introducing the Byte Latent Transformer (BLT) – An LLM architecture that scales better than Llama 3 using byte-patches instead of tokens 🤯 Paper 📄 dl.fbaipublicfiles.com/blt/BLT__Patch… Code 🛠️ github.com/facebookresear…

thumb_up_off_alt702

chat_bubble_outline16

repeat140

shareShare

Saul Santos

@saul_santos1997

9 months ago

🚀 New paper alert! 🚀 Ever tried asking an AI about a 2-hour movie? Yeah… not great. Check: ∞-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation! 🔗 arxiv.org/abs/2501.19098 w/ Antonio Farinhas , Dan McNamee , Andre Martins

thumb_up_off_alt8

chat_bubble_outline1

repeat5

shareShare

Patrick Fernandes

@psanfernandes

5 months ago

MT metrics excel at evaluating sentence translations, but struggle with complex texts We introduce *TREQA* a framework to assess how translations preserve key info by using LLMs to generate & answer questions about them arxiv.org/abs/2504.07583 (co-lead Sweta Agrawal) 1/15

thumb_up_off_alt34

chat_bubble_outline2

repeat11

shareShare

Mohit Iyyer

@mohitiyyer

5 months ago

GRPO + BLEU is a surprisingly good combination for improving instruction following in LLMs, yielding results on par with those from strong reward models in our experiments! Check out our paper for more 👇

thumb_up_off_alt62

chat_bubble_outline0

repeat5

shareShare

Xin Eric Wang @ ICLR 2025

@xwang_lk

5 months ago

𝘏𝘶𝘮𝘢𝘯𝘴 𝘵𝘩𝘪𝘯𝘬 𝘧𝘭𝘶𝘪𝘥𝘭𝘺—𝘯𝘢𝘷𝘪𝘨𝘢𝘵𝘪𝘯𝘨 𝘢𝘣𝘴𝘵𝘳𝘢𝘤𝘵 𝘤𝘰𝘯𝘤𝘦𝘱𝘵𝘴 𝘦𝘧𝘧𝘰𝘳𝘵𝘭𝘦𝘴𝘴𝘭𝘺, 𝘧𝘳𝘦𝘦 𝘧𝘳𝘰𝘮 𝘳𝘪𝘨𝘪𝘥 𝘭𝘪𝘯𝘨𝘶𝘪𝘴𝘵𝘪𝘤 𝘣𝘰𝘶𝘯𝘥𝘢𝘳𝘪𝘦𝘴. But current reasoning models remain constrained by discrete tokens, limiting their full

thumb_up_off_alt931

chat_bubble_outline27

repeat136

shareShare

Edoardo Ponti

@pontiedoardo

5 months ago

🚀 By *learning* to compress the KV cache in Transformer LLMs, we can generate more tokens for the same compute budget. This unlocks *inference-time hyper-scaling* For the same runtime or memory load, we can boost LLM accuracy by pushing reasoning even further!

thumb_up_off_alt121

chat_bubble_outline5

repeat28

shareShare

ACLRollingReview

@reviewacl

4 months ago

Dear ACL community, We are seeking emergency reviewers for the May cycle. Please indicate your availability (ASAP) if you can help review extra papers urgently (by the 24th of June AOE). Many thanks!

thumb_up_off_alt33

chat_bubble_outline1

repeat16

shareShare

Manos Zaranis

@manoszaranis

4 months ago

🚨Meet MF²: Movie Facts & Fibs: a new benchmark for long-movie understanding! 🤔Do you think your model understands movies? Unlike existing benchmarks, MF² targets memorable events, emotional arcs 💔, and causal chains 🔗 — things humans recall easily, but even top models like

thumb_up_off_alt55

chat_bubble_outline2

repeat23

shareShare

Guilherme Penedo

@gui_penedo

4 months ago

We have finally released the 📝paper for 🥂FineWeb2, our large multilingual pre-training dataset. Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.

thumb_up_off_alt316

chat_bubble_outline7

repeat63

shareShare

Olga Golovneva

@olganlp

4 months ago

✨MTA was accepted at #COLM2025 ✨ Since our first announcement, we have updated the paper with scaling laws, new baselines, and more evaluations! Code is now available in our repo: github.com/facebookresear… Conference on Language Modeling

thumb_up_off_alt19

chat_bubble_outline0

repeat5

shareShare

ACL 2025

@aclmeeting

4 months ago

🎉A reminder from ACL 2025: 🗣️ #InvitedTalk by Professor Luke Zettlemoyer. He'll be presenting on "Rethinking Pretraining: Data and Architecture." dive into the foundations of large language models! #ACL2025NLP #NLProc 2025.aclweb.org/program/keynot…

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Google India

@googleindia

4 months ago

If you’re a student in India - you’ve just been granted access to a FREE Gemini upgrade worth ₹19,500 for one year 🥳✨ Claim and get free access to Veo 3, Gemini in Google apps, and 2TB storage 🔗 goo.gle/freepro. Google Gemini App

thumb_up_off_alt7,7K

chat_bubble_outline535

repeat1,1K

shareShare

Google DeepMind

@googledeepmind

3 months ago

An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇 It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵

thumb_up_off_alt4,4K

chat_bubble_outline148

repeat765

shareShare

Diptesh Kanojia

@diptesh

3 months ago

📢 Test Set RELEASED! 🚀 The test set for the #WMT25 Shared Task on QE-informed Segment-level Error Correction is now LIVE! It's time to put your MT error correction / APE methods to the test. Let's see how well they can correct machine translation! #NLProc #MT #WMT2025

thumb_up_off_alt7

chat_bubble_outline1

repeat5

shareShare

Vilém Zouhar

@zouharvi

3 months ago

The 2025 MT Evaluation shared task brings together the strengths of the previous Metrics and Quality Estimation tasks under a single, unified evaluation framework. The following tasks are now open (deadline July 31st but participation has never been easier 🙂)

thumb_up_off_alt12

chat_bubble_outline1

repeat6

shareShare

Markus Freitag

@markuseful

3 months ago

Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!

thumb_up_off_alt52

chat_bubble_outline1

repeat5

shareShare

Sweta Agrawal

@swetaagrawal20

3 months ago

📢Shared task deadline extended: You now have a whole week to go (until August 6 AoE) to register and send us your submissions!!

thumb_up_off_alt10

chat_bubble_outline0

repeat3

shareShare

Sundar Pichai

@sundarpichai

3 months ago

Excited to make our best AI tools free for college students in the US + other select countries for a year - and to provide $1B in funding for education + research, including free AI and career training for every college student in America.

thumb_up_off_alt10,10K

chat_bubble_outline372

repeat857

shareShare