Svitlana Vakulenko 🇺🇦 (@svakulenk0) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Care about LLM evaluation? 🤖 🤔 We bring you🕊️ DOVE a massive (250M!) collection of LLMs outputs On different prompts, domains, tokens, models... Join our community effort to expand it with YOUR model predictions & become a co-author!

thumb_up_off_alt49

chat_bubble_outline2

repeat14

shareShare

Andriy Burkov

@burkov

4 months ago

In this paper, the authors show that an LLM can learn to use a search engine using reinforcement learning, which is especially cool when, to give the right answer, the model needs to run multiple searches, one based on the result of another: arxiv.org/pdf/2503.09516 Their code

thumb_up_off_alt375

chat_bubble_outline5

repeat57

shareShare

Femke Plantinga

@femke_plantinga

4 months ago

A great AI application starts with choosing the right embedding type. Here are 6 embedding types and when you should use them: • Sparse embeddings: weaviate.io/developers/wea… • Dense embeddings: weaviate.io/developers/wea… • Quantized embeddings: weaviate.io/developers/wea… • Binary

thumb_up_off_alt382

chat_bubble_outline1

repeat103

shareShare

Victoria Slocum

@victorialslocum

4 months ago

A solution for better search results: Late interaction preserves contextual nuances that pooling (as in most dense vector retrieval models) destroys. How? There are three different ways retrieval models handle the "interaction" between your query and potential documents: 1️⃣

thumb_up_off_alt217

chat_bubble_outline3

repeat57

shareShare

elvis

@omarsar0

4 months ago

// Tracing LLM Outputs Back to Trillions of Training Tokens // Presents OLMOTRACE, the first system that can trace LLM outputs verbatim back to their entire multi-trillion-token training sets in real time!

thumb_up_off_alt282

chat_bubble_outline7

repeat53

shareShare

Jeff Dean

@jeffdean

4 months ago

Someone just reminded me of this lecture I gave in 2009 that described the evolution of Google Search from 1999 to 2009. People who are interested in how our search systems work might find this interesting. It touches on disk-based serving systems, in-memory indices,

thumb_up_off_alt2,2K

chat_bubble_outline39

repeat261

shareShare

Svitlana Vakulenko 🇺🇦

@svakulenk0

3 months ago

A gentle reminder: two weeks left to submit to SCAI‘25. Don’t miss the opportunity to present your work in Montreal this summer! scai.info #IJCAI2025

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

(((ل()(ل() 'yoav))))👾

@yoavgo

2 months ago

i created a gist with some non-default LLM courses: gist.github.com/yoavg/95bbc576…

thumb_up_off_alt242

chat_bubble_outline8

repeat28

shareShare

Rohan Paul

@rohanpaul_ai

2 months ago

Training on wrong answers outpaces training on correct ones. 10 times more learning emerges from plausible errors than from truths. Large language models refine their accuracy slowly when they learn only from correct examples. This paper introduces Likra, which trains one

thumb_up_off_alt373

chat_bubble_outline5

repeat67

shareShare

Jacqueline He

@jcqln_h

2 months ago

LMs often output answers that sound right but aren’t supported by input context. This is intrinsic hallucination: the generation of plausible, but unsupported content. We propose Precise Information Control (PIC): a task requiring LMs to ground only on given verifiable claims.

thumb_up_off_alt43

chat_bubble_outline1

repeat18

shareShare

Jiaxin Wen @ICLR2025

@jiaxinwen22

2 months ago

New Anthropic research: We elicit capabilities from pretrained models using no external supervision, often competitive or better than using human supervision. Using this approach, we are able to train a Claude 3.5-based assistant that beats its human-supervised counterpart.

thumb_up_off_alt1,1K

chat_bubble_outline35

repeat153

shareShare

Pankaj Gupta

@pankaj

2 months ago

1/24 I’m thrilled to share what my co-founder Gilad Mishne and I’ve been cooking up over the past year. Check out Yupp – a fun and easy way for anyone to discover, compare and get the best answers across the latest AIs, all for free! Yes, even the most powerful pro models.

thumb_up_off_alt1,1K

chat_bubble_outline97

repeat68

shareShare

Sebastian Raschka

@rasbt

2 months ago

Feels good to be back coding! Just picked a fun one from my “someday” side project list and finally added a KV cache to the LLMs From Scratch repo: github.com/rasbt/LLMs-fro…

thumb_up_off_alt1,1K

chat_bubble_outline29

repeat115

shareShare

Svitlana Vakulenko 🇺🇦

@svakulenk0

2 months ago

CLIP embeddings for image search damian github.com/vdlm/meetups

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Rohan Paul

@rohanpaul_ai

2 months ago

AI just learned to fine-tune itself between questions. MIT introduces SEAL, a framework enabling LLMs to self-edit and update their weights via reinforcement learning, all by itself. LLMs consume whatever data they are given, so they stay frozen after pretraining. SEAL teaches

thumb_up_off_alt474

chat_bubble_outline20

repeat91

shareShare

Thao Nguyen

@thao_nguyen26

2 months ago

Web data, the “fossil fuel of AI”, is being exhausted. What’s next?🤔 We propose Recycling the Web to break the data wall of pretraining via grounded synthetic data. It is more effective than standard data filtering methods, even with multi-epoch repeats! arxiv.org/abs/2506.04689

thumb_up_off_alt213

chat_bubble_outline8

repeat57

shareShare

Chaitanya K. Joshi @ICLR2025 🇸🇬

@chaitjo

a month ago

Really thought-provoking new paper on representation learning and the notion of 'semantic compression' by Chen Shani Dan Jurafsky Yann LeCun Ravid Shwartz Ziv