Teknium (e/λ) (@teknium1) Twitter Tweets • TwiCopy

Teknium (e/λ)

@teknium1

+ Follow

Cofounder and Head of Post Training @NousResearch, prev @StabilityAI
Github: github.com/teknium1
HuggingFace: huggingface.co/teknium

ID: 1365020011123773442

linkhttp://github.com/sponsors/teknium1 calendar_today25-02-2021 19:25:11

38,38K Tweet

45,45K Followers

3,3K Following

0xManners ☕️ 🇻🇳

4 months ago

Teknium (e/λ) Happy 4th July bro

<a href="/Teknium1/">Teknium (e/λ)</a> Happy 4th July bro

thumb_up_off_alt13

chat_bubble_outline0

repeat1

shareShare

Teknium (e/λ)

4 months ago

With classifiers, whats the rule of thumb on label distribution? Do you want i.e. on a binary classifier to have 2 equally diverse distributions of 0's and 1's (an overall 50/50 split) or do you want it closer to the real world expected distribution of the binary labels (maybe

thumb_up_off_alt42

chat_bubble_outline16

repeat0

shareShare

Teknium (e/λ)

4 months ago

If an llm can use web search/browse during evals, and the evals are on the internet… cant they just search up the test set and find the answer? Lol

thumb_up_off_alt380

chat_bubble_outline32

repeat12

shareShare

Teknium (e/λ)

4 months ago

Everyone celebrated openai google and anthropics 200$/month plan but is very mad about cursors, curious! 🧐

thumb_up_off_alt133

chat_bubble_outline32

repeat2

shareShare

Teknium (e/λ)

4 months ago

Imo cursor should just make a plan thats 15$ a month to get access to all the features and just be pay api pricing for everything it seems silly to prepay your api prices in advanced when api pricing is specifically because you pay per use and that varies

thumb_up_off_alt110

chat_bubble_outline14

repeat1

shareShare

Teknium (e/λ)

4 months ago

Surely if gpt5/o4-high-pro-xxl was this good…

thumb_up_off_alt50

chat_bubble_outline5

repeat0

shareShare

Teknium (e/λ)

4 months ago

Europe is going to spend 40b$ on an even bigger collider but 50m$ on ai?

thumb_up_off_alt672

chat_bubble_outline95

repeat19

shareShare

Teknium (e/λ)

4 months ago

lol what going on here

thumb_up_off_alt210

chat_bubble_outline19

repeat4

shareShare

Teknium (e/λ)

4 months ago

If you were to annotate and label a post training/instruct/chat dataset, what annotations do you think would be valuable for filtering, ablating, and testing subsets made out of? For example, a label of “refusal” on every sequence would be useful to filter with to reduce

thumb_up_off_alt11

chat_bubble_outline4

repeat0

shareShare

Teknium (e/λ)

4 months ago

Come try on Atropos, lots of easy to follow template environments to build your own from github.com/nousresearch/a…

thumb_up_off_alt49

chat_bubble_outline4

repeat0

shareShare

Teknium (e/λ)

4 months ago

say it loudly tbh

thumb_up_off_alt85

chat_bubble_outline5

repeat3

shareShare

Teknium (e/λ)

4 months ago

I been sayin this

thumb_up_off_alt369

chat_bubble_outline36

repeat9

shareShare

Teknium (e/λ)

4 months ago

I see all the stuff Janus and crew do with opus and gpt4 and I still aint gotten psychosis I must be doing something wrong..

thumb_up_off_alt239

chat_bubble_outline46

repeat9

shareShare

Teknium (e/λ)

4 months ago

.Hugging Face’s papers of the day are actually really diverse genai fields on the top 20

thumb_up_off_alt95

chat_bubble_outline5

repeat9

shareShare

Teknium (e/λ)

4 months ago

Which Google/Gemini product is best for doing research stuff with lol

thumb_up_off_alt55

chat_bubble_outline19

repeat0

shareShare

emozilla

4 months ago

MoE money, MoE problems: it's straight up bonkers that there is not a single finetune of llama 4. zero. zilch. nada. everything on the hub is a reupload. trust me, I've spent the past several weeks trying with torchtune, torchtitan, hf -- anything. it literally just doesn't

MoE money, MoE problems:

it's straight up bonkers that there is not a single finetune of llama 4.

zero. zilch. nada. everything on the hub is a reupload.

trust me, I've spent the past several weeks trying with torchtune, torchtitan, hf -- anything. it literally just doesn't

thumb_up_off_alt525

chat_bubble_outline21

repeat36

shareShare

Teknium (e/λ)

4 months ago

How long of generations can Gemini 2.5 make - I need it to reorganize 3200 lines of text and I dont think it can write this much lol

thumb_up_off_alt91

chat_bubble_outline37

repeat1

shareShare

Teknium (e/λ)

4 months ago

Answer: about 500 lines, before Gemini Chat just deletes its whole response and says you stopped it. Theoretically much longer outputs than anything but OAI's DeepResearch, but the interface ruins it. Maybe the API is better.

thumb_up_off_alt80

chat_bubble_outline23

repeat1

shareShare

Teknium (e/λ)

4 months ago

Lol aww shit here we go again per the vague posting and hype machine begin

thumb_up_off_alt83

chat_bubble_outline9

repeat0

shareShare

Teknium (e/λ)

4 months ago

Why do people keep talking about xai making games they are in no way prepared to do that anytime soon

thumb_up_off_alt130

chat_bubble_outline23

repeat1

shareShare