Eldar Kurtic (@_eldarkurtic) Twitter Tweets • TwiCopy

Nathan

7 months ago

Major props to the contributors who made this release happen 🙌 @JoelNiklaus Lewis Tunstall Alina Lozovskaya Clémentine Fourrier 🍊 Alvin HERIUN Eldar Kurtić María Grandury jnanliu Pavel Iakubovskii Check out the release & try it out: 🔗 github.com/huggingface/li…

thumb_up_off_alt13

chat_bubble_outline0

repeat2

shareShare

clem 🤗

@clementdelangue

6 months ago

Love this approach by Red Hat AI. We need more trust & validation in AI and this can help! huggingface.co/RedHatAI

Love this approach by <a href="/RedHat_AI/">Red Hat AI</a>. We need more trust & validation in AI and this can help! huggingface.co/RedHatAI

thumb_up_off_alt80

chat_bubble_outline9

repeat12

shareShare

Today at 15:00 CEST, I’ll give a talk at OpenSource@Siemens on efficient inference with LLMs. 📺 The talk will be live-streamed at opensource.siemens.com, followed by a live Q&A. Feel free to tune in and bring your questions! It’s a tutorial-style session covering the basics

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

Eldar Kurtic

@_eldarkurtic

6 months ago

Want to quickly get a feeling for how fast an LLM runs under different workloads (and in different engines)? Look no further, Charles 🎉 Frye and Modal built a really cool app for it. Pro tip: don't skip the "Executive Summary" and "How to Benchmark", well worth the read!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Andrej Jovanović

@itsmaddox_j

5 months ago

Join me to hear about decentralised training, why it works and what opportunities it can unlock 🚀. Many thanks to harsha for the invitation!

thumb_up_off_alt23

chat_bubble_outline2

repeat8

shareShare

Eldar Kurtic

@_eldarkurtic

5 months ago

LLM-Compressor now integrated with Axolotl!

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

Casper Hansen

@casper_hansen_

5 months ago

Red Hat team absolutely smashing it!! Integration with axolotl is huge for training

thumb_up_off_alt9

chat_bubble_outline2

repeat2

shareShare

Eldar Kurtic

@_eldarkurtic

5 months ago

The recording of Erwan Gallen's and my PyTorch Day France 2025 and GOSIM Foundation talk, "Scaling LLM Inference with vLLM," is now available on PyTorch’s YouTube channel. youtube.com/watch?v=XYh6Xf…

thumb_up_off_alt23

chat_bubble_outline1

repeat3

shareShare

Eldar Kurtic

@_eldarkurtic

5 months ago

Want to learn more about GuideLLM, the tool used by Charles 🎉 Frye and Modal' LLM Engine Advisor to easily benchmark LLM inference stack? Join the next vLLM office hours with Saša , Michael Goin , Jenny Yi, and Mark Kurtz . More details in the thread below 👇

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Eldar Kurtic

@_eldarkurtic

4 months ago

The Hugging Face folks deserve far more credit for being a pillar of open-source and still managing to push out SOTA results across the board, along with a full write-up of the entire model’s lifecycle.

thumb_up_off_alt86

chat_bubble_outline2

repeat12

shareShare

Eldar Kurtic

@_eldarkurtic

4 months ago

FP4 models and inference kernels ready for Blackwell GPUs! GPTQ and Hadamard for accuracy, and fused Hadamard for runtime. Check out more details about our work in the thread below 👇

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Red Hat AI

@redhat_ai

4 months ago

.vLLM office hours return next week! Alongside project updates from Michael Goin, vLLM committers and HPC experts Robert Shaw + Tyler Michael Smith will share how to scale MoE models with llm-d and lessons from real world multi-node deployments. Register: red.ht/office-hours

.<a href="/vllm_project/">vLLM</a> office hours return next week!

Alongside project updates from <a href="/mgoin_/">Michael Goin</a>, vLLM committers and HPC experts <a href="/robertshaw21/">Robert Shaw</a> + <a href="/tms_jr/">Tyler Michael Smith</a> will share how to scale MoE models with llm-d and lessons from real world multi-node deployments.

Register: red.ht/office-hours

thumb_up_off_alt10

chat_bubble_outline0

repeat4

shareShare