Ian McKenzie (@irobotmckenzie) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Ian McKenzie

@irobotmckenzie

3 years ago

Looking forward to seeing what people find! I think we could uncover some interesting and important properties of large language models.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Some ppl have asked why we’d expect larger language models to do worse on tasks (inverse scaling). We train LMs to imitate internet text, an objective that is often misaligned w human preferences; if the data has issues, LMs will mimic those issues (esp larger ones). Examples: 🧵

thumb_up_off_alt229

chat_bubble_outline4

repeat39

shareShare

Ethan Perez

@ethanjperez

3 years ago

Inverse Scaling Prize Update: We got 43 submissions in Round 1 and will award prizes to 4 tasks! These tasks were insightful, diverse, & show approximate inverse scaling on models from Anthropic OpenAI @MetaAI @DeepMind. Full details at irmckenzie.co.uk/round1, 🧵 on winners:

thumb_up_off_alt357

chat_bubble_outline5

repeat63

shareShare

Ethan Perez

@ethanjperez

2 years ago

We’re awarding prizes to 7/48 submissions to the Inverse Scaling Prize Round 2! Tasks show inverse scaling on Anthropic OpenAI AI at Meta @DeepMind models, often even after training with human feedback. Details at irmckenzie.co.uk/round2 and 🧵 on winners:

thumb_up_off_alt271

chat_bubble_outline3

repeat61

shareShare

Ethan Perez

@ethanjperez

2 years ago

Great podcast from Ian McKenzie (Ian McKenzie), lead on the Inverse Scaling Prize, explaining the contest + the results from the final round

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Ethan Perez

@ethanjperez

2 years ago

New paper on the Inverse Scaling Prize! We detail 11 winning tasks & identify 4 causes of inverse scaling. We discuss scaling trends with PaLM/GPT4, including when scaling trends reverse for better & worse, showing that scaling trends can be misleading: arxiv.org/abs/2306.09479 🧵