Bertrand Charpentier (@bertrand_charp) Twitter Tweets • TwiCopy

Bertrand Charpentier

@bertrand_charp

+ Follow

Founder, President & Chief Scientist @PrunaAI | Prev. @Twitter research, Ph.D. in ML @TU_Muenchen

@bertrand-sharp.bsky.social
@[email protected]

ID: 1080138653807058945

linkhttps://sharpenb.github.io/ calendar_today01-01-2019 16:28:17

281 Tweet

490 Followers

132 Following

David Berenstein

@davidberenstei

7 months ago

💥 SMASH and run models 5x faster, 5x cheaper Pruna AI is the AI Optimization Engine for ML teams seeking to simplify scalable inference. Make sure to ⭐️ their GitHub: buff.ly/4tE5Ahy TechCrunch: buff.ly/64jQYu3 Smashed Models on HF: buff.ly/iV3XDeU

thumb_up_off_alt11

chat_bubble_outline1

repeat5

shareShare

Bertrand Charpentier

@bertrand_charp

7 months ago

We open-sourced the pruna package! 🌍🚀 - It supports various compression methods such as pruning, quantization, distillation, and caching that can be combined! - It enables easy evaluation of efficiency and quality of compressed models! Now, developers around the world can

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Pruna AI

@prunaai

7 months ago

💦 Everything’s fine… until it isn’t. 🔥 Generative AI runs on code and concrete, metals, heat, and vapor. Let’s talk about the real infrastructure behind your prompts. 🏗⚙️🔥💨 🗓 April 15, 5PM UTC 🔗 linkedin.com/events/7309948…

thumb_up_off_alt5

chat_bubble_outline0

repeat2

shareShare

Bertrand Charpentier

@bertrand_charp

7 months ago

In a single day, we compressed and deployed the SOTA HiDream for optimized efficiency. Try it with your favorite prompts on Replicate :)

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Pruna AI

@prunaai

7 months ago

𝗦𝗮𝘆 𝗵𝗲𝘆 𝘁𝗼 𝗗𝗮𝘃𝗶𝗱 𝗕𝗲𝗿𝗲𝗻𝘀𝘁𝗲𝗶𝗻 (x.com/davidberenstei) — 𝗣𝗿𝘂𝗻𝗮’𝘀 𝗻𝗲𝘄 𝗗𝗲𝘃𝗥𝗲𝗹 𝗮𝗹𝗰𝗵𝗲𝗺𝗶𝘀𝘁! 🧪🤖 From Hugging Face & Argilla to homemade pasta 🍝, David’s all about precision, flavor & flow! Now he’s cooking with us at Pruna to make models

thumb_up_off_alt7

chat_bubble_outline0

repeat5

shareShare

Bertrand Charpentier

@bertrand_charp

6 months ago

💥Flux-juiced end point pushes the limits of image generation performance for both efficiency and quality!💥 - Try Flux-Juiced on Replicate: lnkd.in/eBYCnhwg - Read the full benchmarking blog on Hugging Face : lnkd.in/epRWuidn

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Bertrand Charpentier

@bertrand_charp

6 months ago

𝗛𝗼𝘄 𝘁𝗼 𝗹𝗲𝗮𝗿𝗻 𝗮𝗯𝗼𝘂𝘁 𝗲𝗳𝗳𝗶𝗰𝗶𝗲𝗻𝘁 𝗔𝗜? Happy to announce the Awesome AI Efficiency repo that gathers a 𝗰𝘂𝗿𝗮𝘁𝗲𝗱 𝗹𝗶𝘀𝘁 𝗼𝗳 𝟭𝟬𝟬+ 𝗺𝗮𝘁𝗲𝗿𝗶𝗮𝗹𝘀 to understand the challenges and solutions in making AI faster, smaller, cheaper, greener. 🚀 It is

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Bertrand Charpentier

@bertrand_charp

5 months ago

With Pruna AI x Replicate, video generation with Wan 2.1 is now fast and accessible in one click!

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Pruna AI

@prunaai

5 months ago

🔥 Community: ”Image editing is too damn slow!” Don’t worry, we accepted the challenge of making HiDream-e1 faster and it is now at 8,7s on an H100! :) ☕︎ Take a coffee, sit back and relax, as the model has been run over 20,000 times on Replicate already! 👇 Model URL in the

thumb_up_off_alt10

chat_bubble_outline1

repeat3

shareShare

Bertrand Charpentier

@bertrand_charp

4 months ago

We made Flux-Kontext-dev from Black Forest Labs x5 faster in <4h and deployed in on Replicate. Hope that you will enjoy! We put details in our blog: pruna.ai/blog/flux-kont… :)

thumb_up_off_alt28

chat_bubble_outline1

repeat4

shareShare

Sakib

@zsakib_

4 months ago

Use this one ! replicate.com/prunaai/flux-k…

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Pruna AI

@prunaai

4 months ago

🧑‍🏫 AI Efficiency Fundamentals - Week 1: Large Language Architectures Do you know the difference between Autoregressive, Diffusion, and State-Space LLMs? Even if you do, these slides are great for you. At Pruna, we want to educate about efficient AI, so our lead researcher and

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

Bertrand Charpentier

@bertrand_charp

4 months ago

How to make AI endpoints having less CO2 emissions? 🌱 One solution is to use endpoints with compressed models. This is particularly important when endpoints run at scale.

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Bertrand Charpentier

@bertrand_charp

4 months ago

Super happy to welcome Sara Han in the team!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Pruna AI

@prunaai

4 months ago

🚀 𝗣𝗿𝘂𝗻𝗮 𝘅 @𝗴𝗼𝗸𝗼𝘆𝗲𝗯 𝗣𝗮𝗿𝘁𝗻𝗲𝗿𝘀𝗵𝗶𝗽 𝗨𝗽𝗱𝗮𝘁𝗲! 🔥 Early adopters are reporting great results from our lightning-fast inference platform: Performance Breakthrough: • ⚡️ Much faster models • 💰 Cost reduction • 🎯 Minimal quality degradation Let’s talk

thumb_up_off_alt6

chat_bubble_outline0

repeat3

shareShare

Bertrand Charpentier

@bertrand_charp

3 months ago

From Wan video to Wan Image: We built the fastest endpoint for generating 2K images! - Accessible on Replicate : lnkd.in/eqsBR2Kx - Check details, examples, and benchmarks in our blog: lnkd.in/eXcAbqjM - Use Pruna AI to compress more AI models:

thumb_up_off_alt6

chat_bubble_outline0

repeat2

shareShare

Pruna AI

@prunaai

3 months ago

🔥 Deploy custom AI models with Pruna optimization speed + @lightningai LitServe serving engine! Lightning-Fast AI Deployments! What makes this awesome: • ⚡️ FastAPI-powered serving • 🎯 Built-in batching • ⚙️ Define and serve any model (vision, audio, text) • 🚀 Easy

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare