Michael Bommarito (@mjbommar) 's Twitter Profile
Michael Bommarito

@mjbommar

wandering

ID: 14132154

calendar_today12-03-2008 13:45:11

3,3K Tweet

965 Followers

709 Following

Michael Bommarito (@mjbommar) 's Twitter Profile Photo

genai helps to lay bare the irrational state of higher education. 6 figure price tag and 4+ year opportunity cost to prepare for yesterday's knowledge work career. but ironically, it seems that much of genai's demand comes from "make-work" assignments in k-12 and higher ed...

Jack Merullo (@jack_merullo_) 's Twitter Profile Photo

Could we tell if gpt-oss was memorizing its training data? I.e., points where it’s reasoning vs reciting? We took a quick look at the curvature of the loss landscape of the 20B model to understand memorization and what’s happening internally during reasoning

Could we tell if gpt-oss was memorizing its training data? I.e., points where it’s reasoning vs reciting? We took a quick look at the curvature of the loss landscape of the 20B model to understand memorization and what’s happening internally during reasoning
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

gave openai codex cli another try today. am i missing something? how could they possibly be shipping this product without markdown support? is this what happens when your employee turnover metrics start to look like a college town mcdonalds?

gave openai codex cli another try today.  am i missing something?  how could they possibly be shipping this product without markdown support?   is this what happens when your employee turnover metrics start to look like a college town mcdonalds?
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

so thankful that the last time i submitted a kernel patch, i only got a "someone already fixed" instead of "you make the world a worse place" - never change, lkml

so thankful that the last time i submitted a kernel patch, i only got a "someone already fixed" instead of "you make the world a worse place" - never change, lkml
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

weekend project done. easily pushes 10-20MB of text/second on single CPU thread. i never want to see any of you ship a product with shitty sentence or paragraph segmentation again

weekend project done. easily pushes 10-20MB of text/second on single CPU thread.  i never want to see any of you ship a product with shitty sentence or paragraph segmentation again
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

my kids have broken me. i just read "dado" like "daddo" because they won't stop saying things like "doggo" or "canadian smoggo"

my kids have broken me.  i just read "dado" like "daddo" because they won't stop saying things like "doggo" or "canadian smoggo"
Daniel Kang (@daniel_d_kang) 's Twitter Profile Photo

The prevailing wisdom is that compute is the most important factor for frontier AI training. We think this is wrong: data is the most costly and important component of AI training. We collected estimates of revenue for major data labeling companies and compared them with the

The prevailing wisdom is that compute is the most important factor for frontier AI training. We think this is wrong: data is the most costly and important component of AI training.

We collected estimates of revenue for major data labeling companies and compared them with the
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

reminder - the original mistral release was a continued pretrain of llama, less than 3 months after the founders left meta. straight from arthur's x account:

reminder - the original mistral release was a continued pretrain of llama, less than 3 months after the founders left meta.  straight from arthur's x account:
Michael Bommarito (@mjbommar) 's Twitter Profile Photo

set up a personal site on my-name domain for the first time in ~15 years. within a week, chatgpt is already indexing it - as evidenced by at least one referral link. genuinely unsure how they are getting an index updated this quickly. again - not a newly-registered domain,

set up a personal site on my-name domain for the first time in ~15 years.  within a week, chatgpt is already indexing it - as evidenced by at least one referral link.  

genuinely unsure how they are getting an index updated this quickly.  again - not a newly-registered domain,
Hamidah Oderinwale (@didaoh) 's Twitter Profile Photo

1/ With Benjamin Laufer and Jon Kleinberg, we constructed the largest dataset of its kind to date: 1.86M Hugging Face models. In a new paper, we mapped how the open-source AI ecosystem evolves by tracing fine-tunes, merges, and more. Here's what we found 🧵

1/ With <a href="/BenDLaufer/">Benjamin Laufer</a> and Jon Kleinberg, we constructed the largest dataset of its kind to date: 1.86M Hugging Face models. In a new paper, we mapped how the open-source AI ecosystem evolves by tracing fine-tunes, merges, and more. Here's what we found 🧵