Saumya Malik (@saumyamalik44) Twitter Tweets • TwiCopy

Saumya Malik

@saumyamalik44

+ Follow

predoc at @allen_ai | prev princeton cs '24

ID: 1839037231400665088

calendar_today25-09-2024 20:20:27

8 Tweet

98 Followers

26 Following

Ai2

@allen_ai

10 months ago

Meet Tülu 3 -- a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms. We invented new methods for fine-tuning language models with RL and built upon best practices in the community to scale synthetic instruction and preference data.

thumb_up_off_alt532

chat_bubble_outline12

repeat130

shareShare

Saumya Malik

@saumyamalik44

9 months ago

I'm having a great time as a PYI at Ai2! Consider applying for this great program :)

thumb_up_off_alt33

chat_bubble_outline0

repeat10

shareShare

Hamish Ivison

@hamishivi

7 months ago

li'l holiday project from the tulu team :) Scaling up the Tulu recipe to 405B works pretty well! We mainly see this as confirmation that open-instruct scales to large-scale training -- more exciting and ambitious things to come!

thumb_up_off_alt82

chat_bubble_outline2

repeat13

shareShare

Hamish Ivison

@hamishivi

7 months ago

he chonky huggingface.co/allenai/Llama-…

thumb_up_off_alt19

chat_bubble_outline1

repeat4

shareShare

Ai2

@allen_ai

6 months ago

Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks. Comparable to best open-weight models, but a fraction of training compute. When you have a good recipe, ✨ magical things happen when you scale it up!

$Announcing OLMo 2 32B: the first fully open model to beat GPT 3.5 & GPT-4o mini on a suite of popular, multi-skill benchmarks. Comparable to best open-weight models, but a fraction of training compute. When you have a good recipe, ✨ magical things happen when you scale it up!$

thumb_up_off_alt668

chat_bubble_outline29

repeat161

shareShare

Nathan Lambert

@natolambert

6 months ago

A very exciting day for open-source AI! We're releasing our biggest open source model yet -- OLMo 2 32B -- and it beats the latest GPT 3.5, GPT 4o mini, and leading open weight models like Qwen and Mistral. As usual, all data, weights, code, etc. are available. For a long time,

thumb_up_off_alt956

chat_bubble_outline51

repeat155

shareShare