vishal (@vishal_learner) Twitter Tweets • TwiCopy

vishal

@vishal_learner

+ Follow

Machine Learning. fast.ai community member. Will post about sports occasionally. #FlyEaglesFly

ID: 1694453866786336768

linkhttps://www.youtube.com/@vishal_learner calendar_today23-08-2023 20:57:58

2,2K Tweet

260 Followers

1,1K Following

Tim Babb

@tr_babb

2 months ago

thumb_up_off_alt1,1K

chat_bubble_outline9

repeat180

shareShare

sorry my verdict on Grok-4 is that it is not better than Opus for coding, and not better for o3 for reasoning. I don't think it has been trained on benchmarks, but I think its brain is deep friend into a problem-solution mindset that doesn't extend to real-world situations...

thumb_up_off_alt1,1K

chat_bubble_outline120

repeat73

shareShare

vishal

@vishal_learner

2 months ago

Just published a blog post where I highlight 10 ideas that stood out to me from the first lesson and first three chapters of the course reader from the AI evals course taught by Hamel Husain and Shreya Shankar. vishalbakshi.github.io/blog/posts/202…

thumb_up_off_alt26

chat_bubble_outline3

repeat1

shareShare

Shreya Shankar

@sh_reya

2 months ago

Really like this set of standout ideas. We say a million things in the course reader and I love hearing what sticks / what's practical

thumb_up_off_alt26

chat_bubble_outline0

repeat1

shareShare

vishal

@vishal_learner

2 months ago

Cool to see the PyTorch PR that made printing a ModuleList much more concise/cleaner github.com/pytorch/pytorc…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Vlado Boza

@bozavlado

2 months ago

I made a simple tutorial how to fine-tune LLMs using (almost) same memory as needed for inference.

thumb_up_off_alt583

chat_bubble_outline5

repeat47

shareShare

Omar Khattab

@lateinteraction

2 months ago

Nice. Late interaction on the document side, at the granularity of chunks. Just add it on the query side and do MaxSim and voila!

thumb_up_off_alt77

chat_bubble_outline2

repeat5

shareShare

vishal

@vishal_learner

2 months ago

The PyTorch PR that changes "KB" to "KiB" in `torch.cuda.memory_summary()` because "we're talking powers of 2 not 10" github.com/pytorch/pytorc…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

vishal

@vishal_learner

2 months ago

Been thinking about this, especially during the AI evals course.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Antoine Chaffin

@antoine_chaffin

2 months ago

Seeing ModernBERT and Ettin models being useful is heart warming

thumb_up_off_alt27

chat_bubble_outline0

repeat3

shareShare

vishal

@vishal_learner

2 months ago

me trying to understand a PyTorch PR but there's no .py files

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

vishal

@vishal_learner

2 months ago

TIL the term "footgun"

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

vishal

@vishal_learner

2 months ago

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

vishal

@vishal_learner

2 months ago

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

vishal

@vishal_learner

2 months ago

TIL the term "releng"

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Benjamin Clavié

@bclavie

2 months ago

Summoning the wisdom of the crowd once again: At the moment, what’s the most GRPOable small (all definitions of small: 7B, sub-4B, sub-2B) model? Is it still the case that Qwen is always ready to learn while the others are more hit and miss?

thumb_up_off_alt60

chat_bubble_outline5

repeat1

shareShare

vishal

@vishal_learner

2 months ago

1 minute of Sascha napping (he loves the fan)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Hamel Husain

@hamelhusain

2 months ago

We’ve extended enrollment in our **last** live cohort on AI Evals until the end of this week! Here’s the syllabus (2 lessons per week): Week 1: Fundamentals & Lifecycle LLM Application Evaluation, Systematic Error Analysis Week 2: Implementing Effective Evaluations,

thumb_up_off_alt26

chat_bubble_outline1

repeat4

shareShare

vishal

@vishal_learner

2 months ago

how much do I have to pay per month to get back emoji reactions during discord voice calls?

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

vishal

@vishal_learner

2 months ago

TIL about super().__getstate__() via this PyTorch PR: github.com/pytorch/pytorc…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

vishal

Tim Babb

Taelin

vishal

Shreya Shankar

vishal

Vlado Boza

Omar Khattab

vishal

vishal

Antoine Chaffin

vishal

vishal

vishal

vishal

vishal

Benjamin Clavié

vishal

Hamel Husain

vishal

vishal