Benjamin Marie (@bnjmn_marie) 's Twitter Profile
Benjamin Marie

@bnjmn_marie

Independent AI researcher (LLM, NLP).
My blog, The Kaitchup - AI on a Budget: kaitchup.substack.com

ID: 1136844645491589120

linkhttps://kaitchup.substack.com/ calendar_today07-06-2019 03:57:39

643 Tweet

1,1K Followers

193 Following

Ricardo Rei (@ricardorei7) 's Twitter Profile Photo

🚀 Tower+: our latest model in the Tower family — sets a new standard for open-weight multilingual models! We show how to go beyond sentence-level translation, striking a balance between translation quality and general multilingual capabilities. 1/5 arxiv.org/pdf/2506.17080

🚀 Tower+: our latest model in the Tower family — sets a new standard for open-weight multilingual models!
We show how to go beyond sentence-level translation, striking a balance between translation quality and general multilingual capabilities.
1/5

arxiv.org/pdf/2506.17080
Unsloth AI (@unslothai) 's Twitter Profile Photo

We made a Guide on mastering LoRA Hyperparameters, so you can learn to fine-tune LLMs correctly! Learn to: • Train smarter models with fewer hallucinations • Choose optimal: learning rates, epochs, LoRA rank, alpha • Avoid overfitting & underfitting 🔗docs.unsloth.ai/get-started/fi…

We made a Guide on mastering LoRA Hyperparameters, so you can learn to fine-tune LLMs correctly!

Learn to:
• Train smarter models with fewer hallucinations
• Choose optimal: learning rates, epochs, LoRA rank, alpha
• Avoid overfitting & underfitting

🔗docs.unsloth.ai/get-started/fi…
Benjamin Marie (@bnjmn_marie) 's Twitter Profile Photo

Got vLLM to work with Gemma 3n (text-only) but I had to compile from source. Doesn’t seem to work with VLLM_USE_PRECOMPILED=1. Also, don’t set VLLM_USE_V1=0. The shared KV is only supported by the V1 engine.

Valentina Pyatkin (@valentina__py) 's Twitter Profile Photo

đź’ˇBeyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR. But the set of constraints and verifier functions is limited and most models overfit on IFEval. We introduce IFBench to measure model generalization to unseen constraints.

đź’ˇBeyond math/code, instruction following with verifiable constraints is suitable to be learned with RLVR.
But the set of constraints and verifier functions is limited and most models overfit on IFEval.
We introduce IFBench to measure model generalization to unseen constraints.
Maxime Labonne (@maximelabonne) 's Twitter Profile Photo

Liquid AI open-sources a new generation of edge LLMs! 🥳 I'm so happy to contribute to the open-source community with this release on Hugging Face! LFM2 is a new architecture that combines best-in-class inference speed and quality into 350M, 700M, and 1.2B models.

Liquid AI open-sources a new generation of edge LLMs! 🥳

I'm so happy to contribute to the open-source community with this release on <a href="/huggingface/">Hugging Face</a>! 

LFM2 is a new architecture that combines best-in-class inference speed and quality into 350M, 700M, and 1.2B models.