zach ๐Ÿ”๐ŸŽถ (@zqevans) 's Twitter Profile
zach ๐Ÿ”๐ŸŽถ

@zqevans

Dance like nobodyโ€™s watching. Let them watch. Let them love it. | Director at @harmonai_org | ๐ŸŽต: soundcloud.com/fauno-music

ID: 15242515

linkhttps://twitter.com/zqevans calendar_today26-06-2008 10:03:36

3,3K Tweet

1,1K Followers

566 Following

RoyalCities (@royalcities) 's Twitter Profile Photo

Iโ€™ve officially released the first finetuned Stable Audio Open sample generator on HF. Bringing with this release Iโ€™ve also updated the Gradio to take full advantage of the model. So lets dive in, see what Iโ€™ve done and figure out where we may be headed. [Long Thread๐Ÿงต]

Iโ€™ve officially released the first finetuned Stable Audio Open sample generator on HF.

Bringing with this release Iโ€™ve also updated the Gradio to take full advantage of the model.

So lets dive in, see what Iโ€™ve done and figure out where we may be headed.

[Long Thread๐Ÿงต]
Jordi Pons (@jordiponsdotme) 's Twitter Profile Photo

ICML in Vienna is coming to a close! ๐Ÿ‡ฆ๐Ÿ‡น Here are the top-10 general (and audio) trends from ICML 2024. A thread ๐Ÿงต 1. Open vs. Closed AI: The debate was very present, notable in Soumith Chintala's keynote or by the release of Llama 3.1 (among others). icml.cc/virtual/2024/pโ€ฆ

ICML in Vienna is coming to a close! ๐Ÿ‡ฆ๐Ÿ‡น Here are the top-10 general (and audio) trends from ICML 2024.

A thread ๐Ÿงต

1. Open vs. Closed AI:
The debate was very present, notable in <a href="/soumithchintala/">Soumith Chintala</a>'s keynote or by the release of Llama 3.1 (among others).

icml.cc/virtual/2024/pโ€ฆ
Yoach (@yoachlacombe) 's Twitter Profile Photo

๐ŸŽต Stable Audio Open ๐ŸŽต just landed into diffusers, be ready to get: -> a whole lot of fun ๐ŸŽน๐ŸŽบ๐ŸŽท๐Ÿฅ๐ŸŽผ ๐ŸŽธ -> easy installation: `pip install diffusers` -> easy usage: 5 lines

zach ๐Ÿ”๐ŸŽถ (@zqevans) 's Twitter Profile Photo

I've been trying this out today, it's incredible. This is the Flash Attention moment for contrastive model training. Super easy to use the code, it handles all the distributed stuff for you. This should unlock a lot of new research! Great work!

dadabots (@dadabots) 's Twitter Profile Photo

Prompt Jockey (n) - it's DJing but harder. With DJing you're playing usually other peoples' tracks. But with this, the tracks don't even exist yet. You're prompting them live with a neural network. lyra bubbles~ โ™ชโ€ zach ๐Ÿ”๐ŸŽถ encanti Mr. Bill

RoyalCities (@royalcities) 's Twitter Profile Photo

๐Ÿ“ขATTN Producers & Musicians๐Ÿ“ข Today is the release of the most capable FREE open source sample generator tailored for EDM production. Its the largest model yet with HIGH musicality & VERY robust AI Style transfer. ft. a link to get started with it RIGHT NOW๐Ÿ˜Ž Lets dive in! ๐Ÿ‘‡

zach ๐Ÿ”๐ŸŽถ (@zqevans) 's Twitter Profile Photo

Do you want to do cutting edge research on generative music production tools? Do you want to publish papers and also release open weights and code? Apply to be a research intern on our team!

Stability AI (@stabilityai) 's Twitter Profile Photo

What if you could turn everyday sounds into songs? Our Audio Researcher CJ Carr shows you how with Stable Audio. You can now access Stable Audio via the Stability AI API โ€” or, as always, at StableAudio.com. Learn more: bit.ly/4iI0V4q

zach ๐Ÿ”๐ŸŽถ (@zqevans) 's Twitter Profile Photo

Super excited to launch Stable Audio Open Small today! It was great working with Arm on this model to make sure it runs efficiently on Arm CPUs. I'm also now an Arm Ambassador! I'm looking forward to helping the Arm developer community integrate this new model.

Zachary Novack @ICLR2025 ๐Ÿ‡ธ๐Ÿ‡ฌ (@zacknovack) 's Twitter Profile Photo

Releasing Stable Audio Open Small! 75ms GPU latency! 7s *mobile* CPU latency! How? w/Adversarial Relativistic Contrastive (ARC) Post-Training! ๐Ÿ“˜:arxiv.org/abs/2505.08175 ๐Ÿฅ:arc-text2audio.github.io/web/ ๐Ÿค—:huggingface.co/stabilityai/stโ€ฆ Hereโ€™s how we made the fastest TTA out there๐Ÿงต

lyra bubbles~ โ€ (@_lyraaaa_) 's Twitter Profile Photo

got stable audio open small training in <12gb VRAM at batch size 8 & default sample size everyone with 16-24gb cards who wanted to locally tune SAO 1.0 but couldn't (27.6gb vram) should be very happy now

Nate Raw (@_nateraw) 's Twitter Profile Photo

Landed a feature in stable audio tools that should make it easier to fine-tune your own custom text to music models - especially if you're GPU poor. Pre-encoding latents ahead of time reduces GPU memory + helps keep your GPU hot๐Ÿ”ฅ Documentation here: github.com/Stability-AI/sโ€ฆ