Rohan Paul (@rohanpaul_ai) 's Twitter Profile
Rohan Paul

@rohanpaul_ai

💼 Engineer.

📚 I write daily on actionable AI developments.

🗞️ Subscribe and instantly get a 1300+page Python book → rohan-paul.com

ID: 2588345408

linkhttp://www.rohan-paul.com calendar_today25-06-2014 22:38:54

36,36K Tweet

63,63K Followers

780 Following

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

🇨🇳 INCREDIBLE. China just released 1tn parm top open source model for coding and agentic tool work. Kimi K2 from Moonshot AI Insane numbers on benchmarks. On LiveCodeBench the model hits 53.7 Pass@1, beating DeepSeek‑V3 by almost 7 points and clearing Qwen‑235B by more than

🇨🇳 INCREDIBLE. China just released 1tn parm top open source model for coding and agentic tool work.

Kimi K2 from Moonshot AI

Insane numbers on benchmarks. 

On LiveCodeBench the model hits 53.7 Pass@1, beating DeepSeek‑V3 by almost 7 points and clearing Qwen‑235B by more than
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Most self‑supervised vision encoders treat each frame as a separate puzzle and miss the flow. Token Bottleneck squeezes the whole frame into 1 learnable token, so future frames can be rebuilt from that single memory plus a few visible patches, making temporal reasoning part of

Most self‑supervised vision encoders treat each frame as a separate puzzle and miss the flow.

Token Bottleneck squeezes the whole frame into 1 learnable token, so future frames can be rebuilt from that single memory plus a few visible patches, making temporal reasoning part of
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Low rank adapters like LoRA attach 2 matrices, yet their unequal scale can stall learning. The paper introduces SINGLORA, a 1 matrix update that erases that clash and halves parameters. It swaps BA for A times A transpose, so everything shares 1 scale. That cut halves

Low rank adapters like LoRA attach 2 matrices, yet their unequal scale can stall learning.

The paper introduces SINGLORA, a 1 matrix update that erases that clash and halves parameters.

It swaps BA for A times A transpose, so everything shares 1 scale.

That cut halves
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

Open‑source language models freeze at their last training date, so they miss fresh science questions. This paper introduces X‑Master, a wrapper that lets any such model fetch web data, run code, and judge its own answers. The trick is simple. The model can drop a Python

Open‑source language models freeze at their last training date, so they miss fresh science questions.

This paper introduces X‑Master, a wrapper that lets any such model fetch web data, run code, and judge its own answers.

The trick is simple.

The model can drop a Python
Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

🧠 New research finds the Protein (cypin) that strengthen brain cell connections for memory. Higher cypin levels enhance synaptic plasticity — the brain’s ability to adapt and strengthen connections over time. Steadier signals mean sharper learning and slower memory loss. So

Rohan Paul (@rohanpaul_ai) 's Twitter Profile Photo

📝 We are so living in the future already. This is the 1st open conference where AI serves as both primary authors and reviewers of research papers. Papers naming AI as primary author are due September 5 2025.