Sahil Chaudhary (@csahil28) 's Twitter Profile
Sahil Chaudhary

@csahil28

Building @glaiveAI

ID: 1266759895

calendar_today14-03-2013 10:38:01

442 Tweet

4,4K Followers

575 Following

Erik Dunteman (@erikdunteman) 's Twitter Profile Photo

This will sound so trivial yet it's every first-time founder's biggest obstacle: You don't need to be a certain way as a founder. There's no correct archetype. Just have fun.

Nous Research (@nousresearch) 's Twitter Profile Photo

Introducing ๐‡๐ž๐ซ๐ฆ๐ž๐ฌ ๐Ÿ‘: The latest version in our Hermes series, a generalist language model ๐š๐ฅ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ฒ๐จ๐ฎ. nousresearch.com/hermes3/ Hermes 3 is available in 3 sizes, 8, 70, and 405B parameters. Hermes has improvements across the board, but with particular

Introducing ๐‡๐ž๐ซ๐ฆ๐ž๐ฌ ๐Ÿ‘: The latest version in our Hermes series, a generalist language model ๐š๐ฅ๐ข๐ ๐ง๐ž๐ ๐ญ๐จ ๐ฒ๐จ๐ฎ.

nousresearch.com/hermes3/

Hermes 3 is available in 3 sizes, 8, 70, and 405B parameters. Hermes has improvements across the board, but with particular
Matt Shumer (@mattshumer_) 's Twitter Profile Photo

I'm excited to announce Reflection 70B, the worldโ€™s top open-source model. Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes. 405B coming next week - we expect it to be the best model in the world. Built w/ Glaive AI. Read on โฌ‡๏ธ:

I'm excited to announce Reflection 70B, the worldโ€™s top open-source model.

Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes.

405B coming next week - we expect it to be the best model in the world.

Built w/ <a href="/GlaiveAI/">Glaive AI</a>.

Read on โฌ‡๏ธ:
Alex Volkov (Thursd/AI) (@altryne) 's Twitter Profile Photo

This from Matt Shumer and Sahil Chaudhary (Glaive AI ) is insane! A LLama 70B finetune that has reflection baked into it's weights, does CoT, Reflection and then spits out great answers, beats Sonnet on benchmarks!? This is the #breakingNews I wanted to share today on

Yuchen Jin (@yuchenj_uw) 's Twitter Profile Photo

We Hyperbolic now serve Reflection 70B by Matt Shumer in FP16! ๐Ÿค–๐Ÿ”ฅ > Use our API or playground to play w/ it > Itโ€™s free for 1 week โ€“ perfect for stress testing our infra! > Integrate with OpenRouter soon > Running on 4xH100s (and ready to scale with demand, because

We <a href="/hyperbolic_labs/">Hyperbolic</a> now serve Reflection 70B by <a href="/mattshumer_/">Matt Shumer</a> in FP16! ๐Ÿค–๐Ÿ”ฅ

&gt; Use our API or playground to play w/ it
&gt; Itโ€™s free for 1 week โ€“ perfect for stress testing our infra!
&gt; Integrate with <a href="/OpenRouterAI/">OpenRouter</a> soon
&gt; Running on 4xH100s (and ready to scale with demand, because
Sahil Chaudhary (@csahil28) 's Twitter Profile Photo

I want to address the confusion and valid criticisms that this has caused in the community. I am currently investigating what happened that led to this and will share a transparent summary as soon as possible. There are two areas Iโ€™d like to address, which I am investigating: -

Guillermo Rauch (@rauchg) 's Twitter Profile Photo

Some people are born American but in remote countries and under different nationalities. They donโ€™t have a passport yet but theyโ€™re inevitably drawn usually by entrepreneurship or discovery. America is a mindset and a spiritual experience not a country.

naklecha (@naklecha) 's Twitter Profile Photo

today, i'm excited to release a reinforcement learning guide that carefully explains the intuition and implementation details behind every single fundamental algorithm in the field. enjoy :) naklecha.com/reinforcement-โ€ฆ

today, i'm excited to release a reinforcement learning guide that carefully explains the intuition and implementation details behind every single fundamental algorithm in the field. enjoy :)

naklecha.com/reinforcement-โ€ฆ
Nous Research (@nousresearch) 's Twitter Profile Photo

Introducing DeepHermes-3 Preview, a new LLM that unifies reasoning and intuitive language model capabilities. huggingface.co/NousResearch/Dโ€ฆ DeepHermes 3 is built from the Hermes 3 datamix, with new reasoning data, creating a model that can toggle on and off long chains of thought for

Introducing DeepHermes-3 Preview, a new LLM that unifies reasoning and intuitive language model capabilities.

huggingface.co/NousResearch/Dโ€ฆ

DeepHermes 3 is built from the Hermes 3 datamix, with new reasoning data, creating a model that can toggle on and off long chains of thought for
Y Combinator (@ycombinator) 's Twitter Profile Photo

๐Ÿท@PigDev_ is an API to operate Windows Apps with AI, making it easy to automate legacy applications across healthcare, manufacturing, finance, and more. It's like Operator, for Windows. ycombinator.com/launches/Mfp-pโ€ฆ Congrats on the launch, Erik Dunteman!

Erik Dunteman (@erikdunteman) 's Twitter Profile Photo

After weeks of talking to users and iterating, I'm excited to launch three new things: - Pig Chat: drive your computer with a chat UI, like Operator - Agent API: the same batteries-included chat agent, via API - Open access - You can use it, today, at pig.dev

Glaive AI (@glaiveai) 's Twitter Profile Photo

Today, we are releasing a synthetic dataset containing 22M+ reasoning traces for general purpose prompts across various domains. We noticed a lack of large datasets containing reasoning traces for diverse non code/math topics like social and natural sciences, creative writing,

Today, we are releasing a synthetic dataset containing 22M+ reasoning traces for general purpose prompts across various domains. We noticed a lack of large datasets containing reasoning traces for diverse non code/math topics like social and natural sciences, creative writing,
Erik Dunteman (@erikdunteman) 's Twitter Profile Photo

Announcing: Muscle Mem ๐Ÿ’ช Muscle Mem is a cache system for AI agents, allowing them to learn and efficiently replay complex behaviors. This allows expensive LLM calls to be entirely removed from the hot path, during repetitive tasks. youtu.be/hToIl9PRyRk

Sharif Shameem (@sharifshameem) 's Twitter Profile Photo

Friendly reminder that itโ€™s very much possible to outperform frontier reasoning models like o3 on narrowly defined tasks for your product You donโ€™t have to be limited by o3 and Sonnet, you can make your product much better!

Friendly reminder that itโ€™s very much possible to outperform frontier reasoning models like o3 on narrowly defined tasks for your product

You donโ€™t have to be limited by o3 and Sonnet, you can make your product much better!
evan conrad (@evanjconrad) 's Twitter Profile Photo

it's so fun when a company is doing far better than external perception and everyone who works there has the shared secret of knowing they are going to crush it