Vaibhav Adlakha (@vaibhav_adlakha) Twitter Tweets • TwiCopy

Alexandra Chronopoulou

10 months ago

We are organizing Repl4NLP 2025 along with Freda Shi Giorgos Vernikos Vaibhav Adlakha Xiang Lorraine Li Bodhisattwa Majumder. The workshop will be co-located with NAACL 2025 in Albuquerque, New Mexico and we plan to have a great panel of speakers. Consider submitting your coolest work!

thumb_up_off_alt23

chat_bubble_outline0

repeat7

shareShare

Xing Han Lu

@xhluca

10 months ago

Glad to see BM25S (bm25s.github.io) has been downloaded 1M times on PyPi 🎉 Numbers aside, it makes me happy to hear the positive experience from friends working on retrieval. It's good to know that people near me are enjoying it! Discussion: github.com/xhluca/bm25s/d…

thumb_up_off_alt81

chat_bubble_outline1

repeat14

shareShare

Sai Rajeswar

@rajeswarsai

9 months ago

We're happy to report that our paper "BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks" has been accepted at ICLR and is available now at 🧐 arxiv.org/abs/2412.04626, congratulations to my dream team 📷👍 ServiceNow Research Mila - Institut québécois d'IA #ICLR2025

thumb_up_off_alt43

chat_bubble_outline0

repeat16

shareShare

Jason Furman

@jasonfurman

9 months ago

Hard to argue with the WSJ editorial page.

thumb_up_off_alt95,95K

chat_bubble_outline1,1K

repeat8,8K

shareShare

Ahmed Masry

@ahmed_masry97

9 months ago

Happy to announce AlignVLM📏: a novel approach to bridging vision and language latent spaces for multimodal understanding in VLMs! 🌍📄🖼️ 🔗 Read the paper: arxiv.org/abs/2502.01341 🧵👇 Thread

thumb_up_off_alt208

chat_bubble_outline2

repeat54

shareShare

Mushtaq Bilal, PhD

@mushtaqbilalphd

9 months ago

Meta illegaly downloaded 80+ terabytes of books from LibGen, Anna's Archive, and Z-library to train their AI models. In 2010, Aaron Swartz downloaded only 70 GBs of articles from JSTOR (0.0875% of Meta). Faced $1 million in fine and 35 years in jail. Took his own life in 2013.

thumb_up_off_alt41,41K

chat_bubble_outline288

repeat10,10K

shareShare

Vaibhav Adlakha

@vaibhav_adlakha

8 months ago

Check out the new MMTEB benchmark🙌 if you are looking for an extensive, reproducible and open-source evaluation of text embedders!

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

Vaibhav Adlakha

@vaibhav_adlakha

8 months ago

Check out this amazing work by Karolina Stanczak on rethinking LLM alignment through frameworks from multiple disciplines!

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Vaibhav Adlakha

@vaibhav_adlakha

8 months ago

LLM agents can be used for harmful and malicious intents. 🤬 Check out SafeArena for comprehensive evaluation of LLM agents!🛠️

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Jacob Springer

@jacspringer

7 months ago

Training with more data = better LLMs, right? 🚨 False! Scaling language models by adding more pre-training data can decrease your performance after post-training! Introducing "catastrophic overtraining." 🥁🧵+arXiv 👇 1/9

thumb_up_off_alt790

chat_bubble_outline16

repeat173

shareShare

Vaibhav Adlakha

@vaibhav_adlakha

7 months ago

Check out our comprehensive study and analysis of DeepSeek’s 🐳 reasoning chains! This opens new dimension to analyse the working of LLMs. Incredible effort by our research group!

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Amirhossein Kazemnejad

@a_kazemnejad

7 months ago

Introducing nanoAhaMoment: Karpathy-style, single file RL for LLM library (<700 lines) - super hackable - no TRL / Verl, no abstraction💆‍♂️ - Single GPU, full param tuning, 3B LLM - Efficient (R1-zero countdown < 10h) comes with a from-scratch, fully spelled out YT video [1/n]

thumb_up_off_alt1,1K

chat_bubble_outline15

repeat164

shareShare

Xing Han Lu

@xhluca

7 months ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories We are releasing the first benchmark to evaluate how well automatic evaluators, such as LLM judges, can evaluate web agent trajectories. We find that rule-based evals underreport success rates, and

thumb_up_off_alt230

chat_bubble_outline4

repeat100

shareShare

🇺🇦 Dzmitry Bahdanau

@dbahdanau

6 months ago

I am excited to open-source PipelineRL - a scalable async RL implementation with in-flight weight updates. Why wait until your bored GPUs finish all sequences? Just update the weights and continue inference! Code: github.com/ServiceNow/Pip… Blog: huggingface.co/blog/ServiceNo…

thumb_up_off_alt507

chat_bubble_outline6

repeat114

shareShare

Ziling Cheng

@ziling_cheng

5 months ago

Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts follow a systematic failure mode — revealing how LLMs generalize using abstract classes + context cues, albeit unreliably. 📎 Paper: arxiv.org/abs/2505.22630 1/n

thumb_up_off_alt34

chat_bubble_outline1

repeat20

shareShare

Benno Krojer

@benno_krojer

5 months ago

Excited to share the results of my internship research with AI at Meta, as part of a larger world modeling release! What subtle shortcuts are VideoLLMs taking on spatio-temporal questions? And how can we instead curate shortcut-robust examples at a large-scale? Details 👇🔬

Excited to share the results of my internship research with <a href="/AIatMeta/">AI at Meta</a>, as part of a larger world modeling release!

What subtle shortcuts are VideoLLMs taking on spatio-temporal questions?

And how can we instead curate shortcut-robust examples at a large-scale?

Details 👇🔬

thumb_up_off_alt59

chat_bubble_outline3

repeat22

shareShare

Xing Han Lu

@xhluca

5 months ago

"Build the web for agents, not agents for the web" This position paper argues that rather than forcing web agents to adapt to UIs designed for humans, we should develop a new interface optimized for web agents, which we call Agentic Web Interface (AWI).

thumb_up_off_alt184

chat_bubble_outline7

repeat52

shareShare

Vaibhav Adlakha

@vaibhav_adlakha

4 months ago

Saw this while reviewing for COLM too!

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

Verna Dankers

@vernadankers

4 months ago

I miss Edinburgh and its wonderful people already!! Thanks to Tal Linzen and Edoardo Ponti for inspiring discussions during the viva! I'm now exchanging Arthur's Seat for Mont Royal to join Siva Reddy's wonderful lab Mila - Institut québécois d'IA 🤩

thumb_up_off_alt88

chat_bubble_outline10

repeat8

shareShare

cohere

@cohere

4 months ago

Cohere is excited to announce our new office in Montreal, QC! We look forward to contributing to the local AI landscape, collaborating with new and existing partners in the city, and growing our Montreal-based team. cohere.com/blog/montreal-…

thumb_up_off_alt304

chat_bubble_outline15

repeat25

shareShare