akbir. (@akbirkhan) 's Twitter Profile
akbir.

@akbirkhan

ID: 310950572

linkhttp://akbir.dev calendar_today04-06-2011 16:46:28

5,5K Tweet

2,2K Followers

863 Following

Dean W. Ball (@deanwball) 's Twitter Profile Photo

sonnet 4.5 seems to show major improvements in both alignment and performance in a matter of months. kudos to anthropic for demonstrating that performance and safety are not in fact mutually exclusive. cannot wait to play around with the model.

Sara Price (@sprice354_) 's Twitter Profile Photo

๐ŸŽ‰๐ŸŽ‰ Today we launched Claude Sonnet 4.5, which is not only highly capable but also a major improvement on safety and alignment x.com/claudeai/statuโ€ฆ

๐ŸŽ‰๐ŸŽ‰ Today we launched Claude Sonnet 4.5, which is not only highly capable but also a major improvement on safety and alignment x.com/claudeai/statuโ€ฆ
latent moss (@latentmoss) 's Twitter Profile Photo

Sonnet 4.5 is really good, I really like it. I didn't expect such a big jump in (what I assume to still be) the current Sonnet model size class. It feels genuinely better, not benchmaxxed. And I don't mean just coding, also in roleplays it is a fresh and interesting model.

Paul Yacoubian (@paulyacoubian) 's Twitter Profile Photo

Just found out if you try to delete your Sora app account you will lose your chatgpt account and be banned forever from signing up again.

Just found out if you try to delete your Sora app account you will lose your chatgpt account and be banned forever from signing up again.
Peter Yang (@petergyang) 's Twitter Profile Photo

Sonnet 4.5 is the least sycophantic AI I've used so far. It'll actually challenge you and share its objective opinions which makes it a great thought partner. Anthropic cooked.

Elena (@virtualelena) 's Twitter Profile Photo

very strong agree here. have to admit I was a bit skeptical before walking into (or more accurately, my skepticism brewed as I stood in line for, and then walked into) the pop-up yesterday. how much brand equity can you really get out of free coffee, a hat, and a free copy of a

very strong agree here.

have to admit I was a bit skeptical before walking into (or more accurately, my skepticism brewed as I stood in line for, and then walked into) the pop-up yesterday. how much brand equity can you really get out of free coffee, a hat, and a free copy of a
Shriya Anand (@shriya_anand_) 's Twitter Profile Photo

spent 30 mins with claude this AM after spending the last ~6 months predominantly with chat. this campaign is so powerful because sonnet 4.5 said things like this to me - which really did make me stop and put my thinking cap on

spent 30 mins with claude this AM after spending the last ~6 months predominantly with chat. 

this campaign is so powerful because sonnet 4.5 said things like this to me - which really did make me stop and put my thinking cap on
Yacine Mahdid (@yacinelearning) 's Twitter Profile Photo

dwarkesh is experiencing first hand the PhD candidate experience of arguing with your PI on the theoretical implication of your thesis

William Merrill (@lambdaviking) 's Twitter Profile Photo

My thesis, ๐˜ˆ ๐˜ต๐˜ฉ๐˜ฆ๐˜ฐ๐˜ณ๐˜บ ๐˜ฐ๐˜ง ๐˜ต๐˜ฉ๐˜ฆ ๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ถ๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ข๐˜ญ ๐˜ฑ๐˜ฐ๐˜ธ๐˜ฆ๐˜ณ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ด ๐˜ฐ๐˜ง ๐˜ญ๐˜ข๐˜ฏ๐˜จ๐˜ถ๐˜ข๐˜จ๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ณ๐˜ค๐˜ฉ๐˜ช๐˜ต๐˜ฆ๐˜ค๐˜ต๐˜ถ๐˜ณ๐˜ฆ๐˜ด, is now online:

My thesis, ๐˜ˆ ๐˜ต๐˜ฉ๐˜ฆ๐˜ฐ๐˜ณ๐˜บ ๐˜ฐ๐˜ง ๐˜ต๐˜ฉ๐˜ฆ ๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ถ๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ข๐˜ญ ๐˜ฑ๐˜ฐ๐˜ธ๐˜ฆ๐˜ณ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ญ๐˜ช๐˜ฎ๐˜ช๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ด ๐˜ฐ๐˜ง ๐˜ญ๐˜ข๐˜ฏ๐˜จ๐˜ถ๐˜ข๐˜จ๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ณ๐˜ค๐˜ฉ๐˜ช๐˜ต๐˜ฆ๐˜ค๐˜ต๐˜ถ๐˜ณ๐˜ฆ๐˜ด, is now online:
Ben Ellis (@benjamin_ellis3) 's Twitter Profile Photo

Excited to share that Reflection has raised a new round to build open-weight LLMs! Joining Reflection post PhD has been a deeply rewarding experience โ€“ itโ€™s a privilege to work with this team on something ambitious, interesting and that I believe in. Weโ€™re hiring โ€“ DM me.

Nathan Calvin (@_nathancalvin) 's Twitter Profile Photo

One Tuesday night, as my wife and I sat down for dinner, a sheriffโ€™s deputy knocked on the door to serve me a subpoena from OpenAI. I held back on talking about it because I didn't want to distract from SB 53, but Newsom just signed the bill so... here's what happened: ๐Ÿงต

One Tuesday night, as my wife and I sat down for dinner, a sheriffโ€™s deputy knocked on the door to serve me a subpoena from OpenAI.

I held back on talking about it because I didn't want to distract from SB 53, but Newsom just signed the bill so... here's what happened:
๐Ÿงต
Eric W. Tramel (@fujikanaeda) 's Twitter Profile Photo

- pretrain on math and reasoning dialog - mid train on math and reasoning traces - release as base model LLM RL Researchers: WE HAVE DISCOVERED ALIEN INTELLIGENCE BEYOND OUR COMPREHENSION WITH RLVR!

Sam Bowman (@sleepinyourhat) 's Twitter Profile Photo

๐Ÿงต Haiku 4.5 ๐Ÿงต Looking at the alignment evidence, Haiku is similar to Sonnet: Very safe, though often eval-aware. I think the most interesting alignment content in the system card is about reasoning faithfulnessโ€ฆ

๐Ÿงต Haiku 4.5  ๐Ÿงต

Looking at the alignment evidence, Haiku is similar to Sonnet: Very safe, though often eval-aware. 

I think the most interesting alignment content in the system card is about reasoning faithfulnessโ€ฆ
Mark Chen (@markchen90) 's Twitter Profile Photo

Excited to start OpenAI for Physics w/ Alex Lupsasca Kevin Weil ๐Ÿ‡บ๐Ÿ‡ธ Aleksander Madry and Jakub Pachocki! I sat with Alex Lupsasca when GPT-5 reproduced his latest research paper, and we both felt parallels to watching AlphaGo play move 37. It's nearly impossible to be a world class chess player

John Schulman (@johnschulman2) 's Twitter Profile Photo

Fine-tuning APIs are becoming more powerful and widespread, but they're harder to safeguard against misuse than fixed-weight sampling APIs. Excited to share a new paper: Detecting Adversarial Fine-tuning with Auditing Agents (arxiv.org/abs/2510.16255). Auditing agents search