vishal (@vishal_learner) 's Twitter Profile
vishal

@vishal_learner

Machine Learning. fast.ai community member. Will post about sports occasionally. #FlyEaglesFly

ID: 1694453866786336768

linkhttps://www.youtube.com/@vishal_learner calendar_today23-08-2023 20:57:58

2,2K Tweet

260 Followers

1,1K Following

Taelin (@victortaelin) 's Twitter Profile Photo

sorry my verdict on Grok-4 is that it is not better than Opus for coding, and not better for o3 for reasoning. I don't think it has been trained on benchmarks, but I think its brain is deep friend into a problem-solution mindset that doesn't extend to real-world situations...

vishal (@vishal_learner) 's Twitter Profile Photo

Just published a blog post where I highlight 10 ideas that stood out to me from the first lesson and first three chapters of the course reader from the AI evals course taught by Hamel Husain and Shreya Shankar. vishalbakshi.github.io/blog/posts/202…

Just published a blog post where I highlight 10 ideas that stood out to me from the first lesson and first three chapters of the course reader from the AI evals course taught by <a href="/HamelHusain/">Hamel Husain</a>  and <a href="/sh_reya/">Shreya Shankar</a>. 

vishalbakshi.github.io/blog/posts/202…
Shreya Shankar (@sh_reya) 's Twitter Profile Photo

Really like this set of standout ideas. We say a million things in the course reader and I love hearing what sticks / what's practical

Omar Khattab (@lateinteraction) 's Twitter Profile Photo

Nice. Late interaction on the document side, at the granularity of chunks. Just add it on the query side and do MaxSim and voila!

vishal (@vishal_learner) 's Twitter Profile Photo

The PyTorch PR that changes "KB" to "KiB" in `torch.cuda.memory_summary()` because "we're talking powers of 2 not 10" github.com/pytorch/pytorc…

The PyTorch PR that changes "KB" to "KiB" in `torch.cuda.memory_summary()` because "we're talking powers of 2 not 10"

github.com/pytorch/pytorc…
Benjamin Clavié (@bclavie) 's Twitter Profile Photo

Summoning the wisdom of the crowd once again: At the moment, what’s the most GRPOable small (all definitions of small: 7B, sub-4B, sub-2B) model? Is it still the case that Qwen is always ready to learn while the others are more hit and miss?

Hamel Husain (@hamelhusain) 's Twitter Profile Photo

We’ve extended enrollment in our **last** live cohort on AI Evals until the end of this week! Here’s the syllabus (2 lessons per week): Week 1: Fundamentals & Lifecycle LLM Application Evaluation, Systematic Error Analysis Week 2: Implementing Effective Evaluations,

We’ve extended enrollment in our **last** live cohort on AI Evals until the end of this week!

Here’s the syllabus (2 lessons per week):

Week 1: Fundamentals &amp; Lifecycle LLM Application Evaluation, Systematic Error Analysis

Week 2: Implementing Effective Evaluations,