Leo Boytsov (@srchvrs) Twitter Tweets • TwiCopy

Leo Boytsov

@srchvrs

+ Follow

Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on (un)natural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.

ID: 87473622

linkhttp://searchivarius.org/about calendar_today04-11-2009 16:22:33

24,24K Tweet

8,8K Followers

1,1K Following

Leo Boytsov

@srchvrs

6 months ago

Nobody wants to compare with BM25 any more.

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Leo Boytsov

@srchvrs

5 months ago

""Whenever someone asks me if RL works, I say it doesn't and 70% of the time I am right." alexirpan.com/2018/02/14/rl-…

thumb_up_off_alt2

chat_bubble_outline0

repeat1

shareShare

Leo Boytsov

@srchvrs

5 months ago

This is a rather blockbuster piece of news: the Hugging Face library is dropping support for both Jax and Tensorflow. linkedin.com/posts/lysandre…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Yann LeCun

@ylecun

5 months ago

I don't wanna say 'I told you so', but I told you so.

thumb_up_off_alt5,5K

chat_bubble_outline291

repeat508

shareShare

I find it quite upsetting that people use the terms LLM, or PLM (pretrained LM) to denote every possible pre-trained Transformer model including BERT, T5, etc... This reminds me good old days when DNN was often used to denote a multilayer feedforward network (or at least this was

thumb_up_off_alt10

chat_bubble_outline3

repeat0

shareShare

ACLRollingReview

@reviewacl

5 months ago

Dear ACL community, We are seeking emergency reviewers for the May cycle. Please indicate your availability (ASAP) if you can help review extra papers urgently (by the 24th of June AOE). Many thanks!

thumb_up_off_alt33

chat_bubble_outline1

repeat16

shareShare

Leo Boytsov

@srchvrs

5 months ago

Gemini CLI is a free agent now!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Aryeh Kontorovich

@aryehazan

4 months ago

arxiv.org/abs/2506.23908…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Leo Boytsov

@srchvrs

4 months ago

This is something I have been suspecting for a while, but I do not think we have a lot of supporting evidence though. Anyways, it looks like efficiency of generation plays an important role in user's perception of quality.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Leo Boytsov

@srchvrs

4 months ago

I also want to know the answer. Moreover what's the answer in the broad CS context not just AI?

thumb_up_off_alt2

chat_bubble_outline1

repeat0

shareShare