Benjamin Marie (@bnjmn_marie) Twitter Tweets • TwiCopy

Benjamin Marie

@bnjmn_marie

+ Follow

Independent AI researcher (LLM, NLP).
My blog, The Kaitchup - AI on a Budget: kaitchup.substack.com

ID: 1136844645491589120

linkhttps://kaitchup.substack.com/ calendar_today07-06-2019 03:57:39

643 Tweet

1,1K Followers

193 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

🚀 Tower+: our latest model in the Tower family — sets a new standard for open-weight multilingual models! We show how to go beyond sentence-level translation, striking a balance between translation quality and general multilingual capabilities. 1/5 arxiv.org/pdf/2506.17080

thumb_up_off_alt24

chat_bubble_outline1

repeat8

shareShare

Unsloth AI

@unslothai

a month ago

We made a Guide on mastering LoRA Hyperparameters, so you can learn to fine-tune LLMs correctly! Learn to: • Train smarter models with fewer hallucinations • Choose optimal: learning rates, epochs, LoRA rank, alpha • Avoid overfitting & underfitting 🔗docs.unsloth.ai/get-started/fi…

thumb_up_off_alt681

chat_bubble_outline12

repeat129

shareShare

Benjamin Marie

@bnjmn_marie

a month ago

FP4 support (NVFP4) in LLM compressor (with vLLM)!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Benjamin Marie

@bnjmn_marie

a month ago

Gemma 3n support in vLLM: github.com/vllm-project/v… Only text version for now. Install vLLM from source to use it.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Benjamin Marie

@bnjmn_marie

a month ago

Excellent article by NVIDIA introducing NVFP4 developer.nvidia.com/blog/introduci…

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Benjamin Marie

@bnjmn_marie

a month ago

Got vLLM to work with Gemma 3n (text-only) but I had to compile from source. Doesn’t seem to work with VLLM_USE_PRECOMPILED=1. Also, don’t set VLLM_USE_V1=0. The shared KV is only supported by the V1 engine.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Benjamin Marie

@bnjmn_marie

a month ago

I want a Gemma 3n E70B

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Valentina Pyatkin

@valentina__py

a month ago

💡Beyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of constraints and verifier functions is limited and most models overfit on IFEval. We introduce IFBench to measure model generalization to unseen constraints.

thumb_up_off_alt347

chat_bubble_outline5

repeat89

shareShare

Maxime Labonne

@maximelabonne

20 days ago

Liquid AI open-sources a new generation of edge LLMs! 🥳 I'm so happy to contribute to the open-source community with this release on Hugging Face! LFM2 is a new architecture that combines best-in-class inference speed and quality into 350M, 700M, and 1.2B models.

Liquid AI open-sources a new generation of edge LLMs! 🥳

I'm so happy to contribute to the open-source community with this release on <a href="/huggingface/">Hugging Face</a>!

LFM2 is a new architecture that combines best-in-class inference speed and quality into 350M, 700M, and 1.2B models.

thumb_up_off_alt693

chat_bubble_outline32

repeat106

shareShare

Benjamin Marie

Gate.io

Ricardo Rei

Unsloth AI

Benjamin Marie

Benjamin Marie

Benjamin Marie

Benjamin Marie

Benjamin Marie

Valentina Pyatkin

Maxime Labonne