
Zachary Novack @ICLR2025 ๐ธ๐ฌ
@zacknovack
efficient + controllable music generation | phd-ing @ucsd_cse | research intern @stabilityai | prev @adoberesearch @acmi_lab | teaching drums @pulsepercussion
ID: 1534894045805281280
http://zacharynovack.github.io 09-06-2022 13:44:14
228 Tweet
572 Followers
471 Following




We (w Zachary Novack Jaechul Roh et al.) are working on #memorization in #audio models & are conducting a human study on generated #music similarity. Please help us out by taking our short listening test (available in English, Mandarin & Cantonese). You can do more than one! Link โฌ๏ธ



Presenting RUListening! we edit Music-QA benchmarks to *actually* assess audio perception, using text-only LLMs to generate unimodally-hard distractors. Been super excited about this one (led by the beast Yongyi Zang), check out the full thread below! And at ISMIR 2025!๐ฐ๐ท




stable audio open small is great for stacking multiple generations Zachary Novack lyra bubbles~ โชโ the ux speriments continue. changing instrument gen during playback can be pretty jarring tho but methinks style-transfer endpoint may come in handy finetunes might make this glorious fun

Itโs been a thrilling journey buildingโฏFLAM! ๐ Super proud of what we achieved openโvocabulary audio event detection using calibrated frameโwise modeling. FLAM will be presented at ICML 2025, come check it out! ๐ Paper: arxiv.org/abs/2505.05393 ๐ง Demo: flam-model.github.io

I always like those paper/author visualizations for other conferences, so I ~vibe coded~ up an interactive one for #ISMIR2025 ISMIR Conference ! Go check it out at: zacharynovack.github.io/ismir2025.html Will hopefully add paper links and other metadata in the coming weeks :)




Stable Audio Open Small is accepted at #WASPAA2025 IEEE WASPAA 2025 ! Can't wait to share the latest in blazingly fast, on-device text-to-audio in Lake Tahoe ๐๏ธ

We're organizing the AI for Music workshop at NeurIPS Conference in San Diego! We'll be accepting both papers + demos w/an initial deadline of August 22, well timed for early visibility on your ICASSP/ICLR drafts ๐ Check out the website for more: aiformusicworkshop.github.io

made a Hugging Face space for custom sample generation using stable-audio-open-small. already had an api in my backend, so figured i should make a @gradio app for the looping stuff. combine drums+instruments then transform w/melodyflow link ๐
