Alex Ratner (@ajratner) Twitter Tweets • TwiCopy

Albert Ge

6 months ago

Online data mixing reduces training costs for foundation models, but faces challenges: ⚠️ Human-defined domains miss semantic nuances ⚠️ Limited eval accessibility ⚠️ Poor scalability Introducing 🎵R&B: first regroup data, then dynamically reweight domains during training!

thumb_up_off_alt93

chat_bubble_outline2

repeat19

shareShare

Alex Ratner

@ajratner

5 months ago

Scale alone is not enough for AI data. Quality and complexity are equally critical. Excited to support all of these for LLM developers with Snorkel AI Data-as-a-Service, and to share our new leaderboard! — Our decade-plus of research and work in AI data has a simple point:

thumb_up_off_alt142

chat_bubble_outline15

repeat33

shareShare

Snorkel AI

@snorkelai

5 months ago

Foundation models are great at public knowledge, but fall short on domain-specific tasks. 📋 That’s why we’re working with the brightest in their fields to build high-quality training data that actually makes AI useful. Want to learn more? 👉 snorkel.ai/expert-communi…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Alex Ratner

@ajratner

5 months ago

Our decade of work on AI data development has always been about *accelerating* the subject matter expert - not replacing them! Where automation is possible- saturation has been reached. The key to real AI delta is expert knowledge - which all comes down to the amazing experts!!

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Snorkel AI

@snorkelai

5 months ago

Huge thanks to Nasdaq for featuring our Series D! 👏 We’re using this momentum to solve the toughest data challenges in enterprise AI—from evaluation to expert curation. Building AI? Let’s talk data. 📈 #SnorkelAI #AIInfrastructure #LLMs #DataAsAService

Huge thanks to <a href="/Nasdaq/">Nasdaq</a> for featuring our Series D! 👏

We’re using this momentum to solve the toughest data challenges in enterprise AI—from evaluation to expert curation.

Building AI? Let’s talk data. 📈

#SnorkelAI #AIInfrastructure #LLMs #DataAsAService

thumb_up_off_alt7

chat_bubble_outline1

repeat2

shareShare

Eric Glyman

@eglyman

5 months ago

Today, Ramp reached a new valuation: $16 billion Let the robots chase your receipts and close your books, so you can use your brain and build things. That's the way AI was meant to be.

Today, <a href="/tryramp/">Ramp</a> reached a new valuation: $16 billion

Let the robots chase your receipts and close your books, so you can use your brain and build things.

That's the way AI was meant to be.

thumb_up_off_alt1,1K

chat_bubble_outline63

repeat97

shareShare

Braden Hancock

@bradenjhancock

5 months ago

A frequent evaluation mistake I see: assuming you need orders of magnitude more data than you actually do. What different evaluation set sizes are good for:

thumb_up_off_alt5

chat_bubble_outline1

repeat1

shareShare

Snorkel AI

@snorkelai

5 months ago

Three days out! 👏 We're going live with AI leaders from Accenture US, QBE, @Comcast, BNY & more. 🔹 Expert data + agentic AI 🔹 Live demos 🔹 Real-world use cases 🔹 Fresh research RSVP: snorkel.ai/events/develop… #AgenticAI #SnorkelAI

Three days out! 👏 We're going live with AI leaders from <a href="/Accenture_US/">Accenture US</a>, <a href="/QBE/">QBE</a>, @Comcast, <a href="/BNYglobal/">BNY</a> & more.

🔹 Expert data + agentic AI
🔹 Live demos
🔹 Real-world use cases
🔹 Fresh research

RSVP: snorkel.ai/events/develop…

#AgenticAI #SnorkelAI

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Snorkel AI

@snorkelai

5 months ago

Highlights from Henry Kiss Ehrenberg’s theCUBE appearance on the future of AI: 🟦 Data strategy is key 🟦 Expert data drives real advantage 🟦 Trust & compliance are critical 👉 Full convo here: youtube.com/watch?v=Qjt-d9…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Jon Saad-Falcon

@jonsaadfalcon

5 months ago

How can we close the generation-verification gap when LLMs produce correct answers but fail to select them? 🧵 Introducing Weaver: a framework that combines multiple weak verifiers (reward models + LM judges) to achieve o3-mini-level accuracy with much cheaper non-reasoning

thumb_up_off_alt204

chat_bubble_outline11

repeat56

shareShare

Alex Ratner

@ajratner

5 months ago

Very exciting work on using weak supervision for RL- closing the “generation-verification gap”!! Once again- principled approaches to labeling/data development are the keys!

thumb_up_off_alt20

chat_bubble_outline1

repeat7

shareShare

Mayee Chen

@mayeechen

5 months ago

LLMs often generate correct answers but struggle to select them. Weaver tackles this by combining many weak verifiers (reward models, LM judges) into a stronger signal using statistical tools from Weak Supervision—matching o3-mini-level accuracy with much cheaper models! 📊

thumb_up_off_alt223

chat_bubble_outline15

repeat33

shareShare

Alex Ratner

@ajratner

5 months ago

Excited for this!!

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Azalia Mirhoseini

@azaliamirh

5 months ago

Introducing Weaver, a test time scaling method for verification! Weaver shrinks the generation-verification gap through a low-overhead weak-to-strong optimization of a mixture of verifiers (e.g., LM judges and reward models). The Weavered mixture can be distilled into a tiny

thumb_up_off_alt169

chat_bubble_outline2

repeat35

shareShare

Snorkel AI

@snorkelai

5 months ago

We just dropped a benchmark dataset on Hugging Face to test AI agents on real-world insurance underwriting tasks—built with CPCU experts. Most models still struggle. Here’s how to evaluate them right: 🧠 Dataset: huggingface.co/datasets/snork…

thumb_up_off_alt23

chat_bubble_outline0

repeat4

shareShare

Alex Ratner

@ajratner

5 months ago

Excited for this!

thumb_up_off_alt10

chat_bubble_outline0

repeat1

shareShare

Jieyu Zhang

@jieyuzhang20

4 months ago

Tokenization kickstarts every Transformer pipeline—shaping how models digest data. Our latest work introduces semantic, grounded video tokenization, leveraging objectness cues to boost efficiency and performance of video understanding models.

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Snorkel AI

@snorkelai

4 months ago

Working with Amazon Web Services to push what’s possible in financial services: from AI agents for underwriting to data-driven copilots. When the data’s right, the system works. #SnorkelAI #AgenticAI #GenAI #AIinFinance

Working with <a href="/AWS/">Amazon Web Services</a> to push what’s possible in financial services: from AI agents for underwriting to data-driven copilots.

When the data’s right, the system works.

#SnorkelAI #AgenticAI #GenAI #AIinFinance

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare