Shashwat Goel (@shashwatgoel7) 's Twitter Profile
Shashwat Goel

@shashwatgoel7

Scaling supervision for AI.

PhD student @ELLISInst_Tue @MPI_IS

Here for the aha and haha moments

ID: 1277988007304261632

linkhttp://shash42.github.io calendar_today30-06-2020 15:31:10

474 Tweet

567 Followers

652 Following

Shashwat Goel (@shashwatgoel7) 's Twitter Profile Photo

Wow, generative evals show K2 not Grok 4 is what you should be hyped about ๐Ÿ˜ฎ Come chat with me about how you can convert your MCQ dataset to Generative evals ICML Conference. Also presenting this work at the (World Models) Metrics for Evaluating Understanding Workshop on Friday

Ilze Amanda Auzina (@amandailze) 's Twitter Profile Photo

Excited to be heading to ICML this year to present two projects, both as spotlights! ๐ŸŽ‰ Big thanks to my collaborators โ€” come say hi if you're around! #ICML2025 #ML

Shashwat Goel (@shashwatgoel7) 's Twitter Profile Photo

Presenting today at #ICML2025. To learn how to measure language model similarity, and it's effects on LLM as a Judge and Weak to Strong distillation, join our poster session: Today 11 am -1:30 pm, East Exhibition Hall A-B E-2411 w/ Ameya P. @ ICML 2025 joschkastrueber Ilze Amanda Auzina

Dulhan Jayalath (@dulhanjay) 's Twitter Profile Photo

Come and find me today at #ICML2025 and let's talk about speech ๐Ÿ’ฌ decoding from the brain and scaling brain-computer interfaces ๐Ÿค–. 11 am -1:30 pm, West Exhibition Hall, Poster W-415

Dulhan Jayalath (@dulhanjay) 's Twitter Profile Photo

Have a peek at our early work on non-invasive brain-to-text ๐Ÿง ๐Ÿ’ฌ where we decode phrases and sentences directly from the brain. Poster up all day tomorrow ๐Ÿ‘€ at the GenBio Workshop @ ICML25 at #ICML2025. arxiv.org/abs/2505.13446

Have a peek at our early work on non-invasive brain-to-text ๐Ÿง ๐Ÿ’ฌ where we decode phrases and sentences directly from the brain.

Poster up all day tomorrow ๐Ÿ‘€ at the <a href="/genbio_workshop/">GenBio Workshop @ ICML25</a>  at #ICML2025. 

arxiv.org/abs/2505.13446
Shashwat Goel (@shashwatgoel7) 's Twitter Profile Photo

Just today presented the bronze medal market (at 86%, coincidentally) to people at an ICML poster session on forecasting evaluations... Crazy OpenAI just shot past all the intermediate medals to gold. Next research wave will be about figuring out the new technique...

Shashwat Goel (@shashwatgoel7) 's Twitter Profile Photo

One weird things about harder and harder benchmarks is that validating these questions becomes even harder. Efforts like this that still do it are extremely valuable