Michel Olvera (@michelolzam) Twitter Tweets • TwiCopy

Vivek Verma

5 years ago

IT'S FINALLY OUT! After months of work, I'm happy to announce that the first video in a series on visualizing deep learning is out. It's on visualizing the structure of a neural network. RTs appreciated :) youtu.be/UOvPeC8WOt8

thumb_up_off_alt3,3K

chat_bubble_outline28

repeat1,1K

shareShare

FaRo

@faroit

5 years ago

Happy to see that DeMask built with asteroid won the first place of the PyTorch summer #hackathon! The model allows you to enhance muffled speech when wearing facemask Demo: youtu.be/QLf10Uqu8Yk 👋 to our team: Manuel Pariente Michel Olvera @_jonashaag and Samuel Cornell

thumb_up_off_alt22

chat_bubble_outline0

repeat5

shareShare

Manuel Pariente

@mnlpariente

5 years ago

Woohoo 🤩 We won the @Pytorch #hackathon with DeMask - built with Asteroid ! Truly surprised and humbled, but also very proud of our team (Samuele Cornell, FaRo, Michel Olvera and @_jonashaag). Project page: devpost.com/software/aster… Demo: youtube.com/watch?v=QLf10U… Thanks !

thumb_up_off_alt36

chat_bubble_outline6

repeat5

shareShare

Eduardo Fonseca

@edfonseca_

5 years ago

🔊Happy to announce FSD50K: the new open dataset of human-labeled sound events! Over 51k Freesound audio clips, totalling over 100h of audio manually labeled using 200 classes drawn from the AudioSet Ontology. Paper: arxiv.org/pdf/2010.00475… Dataset: doi.org/10.5281/zenodo…

thumb_up_off_alt241

chat_bubble_outline4

repeat79

shareShare

National Geographic

@natgeo

5 years ago

Día de los Muertos, or #DayoftheDead, unfolds in an explosion of color and joy demonstrating love and respect for lost loved ones on.natgeo.com/3kPrIh9

thumb_up_off_alt2,2K

chat_bubble_outline15

repeat505

shareShare

Christine Evers

@evers_christine

5 years ago

Interested in machine listening? Curious how robots can make sense of sounds? I will give a webinar on robot audition on Thursday, 3 Dec at 1pm UK Acoustics Network Plus Early Career SIG. Details below. Electronics Comp Sci MINDS CDT UKRI TAS Hub

thumb_up_off_alt24

chat_bubble_outline0

repeat15

shareShare

@[email protected]

@mclduk

4 years ago

New preprint: "Computational bioacoustics with deep learning: a review and roadmap" arxiv.org/abs/2112.06725 #bioacoustics #machinelearning

thumb_up_off_alt49

chat_bubble_outline2

repeat21

shareShare

Daniela Robles @daniela-oaks.bsky.social

@daniela_oaks

3 years ago

Hi guys / Hola amigos / Bom dia, Are you a researcher in a low-and-middle income country and working in #melanoma (any aspect of it)? Please get in touch, I would like to know more about what you're doing and your current challenges 😊 Please RT!!!

thumb_up_off_alt67

chat_bubble_outline3

repeat59

shareShare

arXiv Sound

@arxivsound

a year ago

``ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds,'' Sreyan Ghosh, Sonal Kumar, Chandra Kiran Reddy Evuru, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha, ift.tt/OgxJruG

thumb_up_off_alt11

chat_bubble_outline0

repeat5

shareShare

Kristina Ulicna, PhD 👩‍💻

@kristinaulicna

a year ago

Important reminder 🚨: with the surge of "all 17 papers from my lab got accepted at NeurIPS 🤩" posts today, just remember that it's perfectly fine if your paper got rejected 🙅‍♀️ / if you haven't even submitted 🤷‍♀️. You can do awesome research even without flashy publications! 📝

thumb_up_off_alt232

chat_bubble_outline3

repeat11

shareShare

Hugo

@mldhug

a year ago

You want to give audio abilities to your VLM without compromising its vision performance? You want to align your audio encoder with a pretrained image encoder without suffering from the modality gap? Check our #NeurIPS2024 paper with Michel Olvera Stéphane LATHUILIÈRE and Slim Essid

thumb_up_off_alt19

chat_bubble_outline1

repeat3

shareShare

Michel Olvera

@michelolzam

a year ago

Great talk today by Haohe Liu at the ADASP group on Latent Diffusion Models (LDMs) as versatile audio decoder! Walked us through diffusion basics, AudioLDM for text-to-audio, audio quality enhancement, and neural codecs!

Great talk today by <a href="/LiuHaohe/">Haohe Liu</a> at the <a href="/tp_adasp/">ADASP</a> group on Latent Diffusion Models (LDMs) as versatile audio decoder! Walked us through diffusion basics, AudioLDM for text-to-audio, audio quality enhancement, and neural codecs!

thumb_up_off_alt10

chat_bubble_outline1

repeat1

shareShare

daisukelab

@nizumical

a year ago

DCASE meets DCASE!? Michel Olvera

DCASE meets DCASE!?
<a href="/michelolzam/">Michel Olvera</a>

thumb_up_off_alt20

chat_bubble_outline0

repeat1

shareShare

Yohei Kawaguchi

@yohekawag

a year ago

Thank you for coming to Tokyo, everyone! Let’s see next year at Barcelona!!!! #DCASE2024

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

DCASE Workshop

@dcase_workshop

a year ago

Thank you for participating in #DCASE2024 workshop. Your participation made it a meaningful experience, and we look forward to staying connected.

thumb_up_off_alt30

chat_bubble_outline0

repeat9

shareShare

Magdalena Fuentes

@mfu3ntes

a year ago

🔊🎶 MIR folks, mirdata 0.3.9 is out‼️ ISMIR Conference #ISMIR2024 6 new loaders, for a total of 58 dataset loaders! This time, we made mirdata MUCH lighter! Install Time: 50.6s ⏩slashed by 31% to 34.6s Size: 95MB ⏬ shrunk by 98% to 1.9MB!! Datasets: mirdata.readthedocs.io/en/stable/sour…

🔊🎶 MIR folks, mirdata 0.3.9 is out‼️ <a href="/ISMIRConf/">ISMIR Conference</a> #ISMIR2024

6 new loaders, for a total of 58 dataset loaders!

This time, we made mirdata MUCH lighter!
Install Time: 50.6s ⏩slashed by 31% to 34.6s
Size: 95MB ⏬ shrunk by 98% to 1.9MB!!

Datasets: mirdata.readthedocs.io/en/stable/sour…

thumb_up_off_alt25

chat_bubble_outline1

repeat4

shareShare

Wieland Brendel

@wielandbr

a year ago

🎉 New Pre-print! 🎉 Do CLIP models truly generalize to new, out-of-domain (OOD) images, or are they only doing well because they’ve been exposed to these domains in training? Our latest study reveals that CLIP’s ability to “generalize OOD” may be more limited than previously

thumb_up_off_alt464

chat_bubble_outline3

repeat73

shareShare

Hugo

@mldhug

a year ago

If you want to learn more about audio-visual alignment and how to use it to give audio abilities to your VLM, stop by our NeurIPS Conference poster #3602 (East exhibit hall A-C) tomorrow at 11am!

thumb_up_off_alt7

chat_bubble_outline0

repeat2

shareShare

Alfredo Canziani

@alfcnz

a year ago

An Eye for an Ear: Zero-shot Audio Description Leveraging an Image Captioner using Audiovisual Distribution Alignment

thumb_up_off_alt18

chat_bubble_outline0

repeat5

shareShare