Yossi Keshet (@jkeshet) 's Twitter Profile
Yossi Keshet

@jkeshet

Professor @ECE_Technion @TechnionLive; Chief Scientist @_aiola #speech #language #ai

ID: 84675132

linkhttps://keshet.net.technion.ac.il/ calendar_today23-10-2009 20:02:45

40 Tweet

196 Followers

432 Following

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

Can we build a system that gives almost perfect auditory and visual feedback to learners of a new language? Read my medium post on a new algorithm to generate synthetic feedback of proper pronunciation from the wrong one in the speaker’s own voice. link.medium.com/ALdBTFfxisb

Amazon Science (@amazonscience) 's Twitter Profile Photo

Twenty years ago, Yossi Keshet, Amazon Scholar and Technion Israel associate professor, was working on the problem of automatic speech recognition—but he says it still isn't a solved problem. Find out why, where he sees gaps, and what he’s eager to explore in this research field. #ASR

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

Ever wondered why Kim Kardashian sounds so cool and how it influences the accuracy of automatic speech recognizers? See the blog post of Bronya Roni Chernyak describing our joint work with Talia Ben Simon & Yael Segal, along with Eleanor Chodroff, @JeremySteffman & Jennifer Cole!

Felix Kreuk (@felixkreuk) 's Twitter Profile Photo

We present “AudioGen: Textually Guided Audio Generation”! AudioGen is an autoregressive transformer LM that synthesizes general audio conditioned on text (Text-to-Audio). 📖 Paper: tinyurl.com/audiogen-text2… 🎵 Samples: tinyurl.com/audiogen-text2… 💻 Code & models - soon! (1/n)

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

Do you want to speed up or slow down the speech while listening to podcasts or YouTube? Now you can do that with exceptional quality. Read Eyal Cohen's blog presenting our work on generating speech with outstanding quality. medium.com/@eyalcohen308/…

NorthwesternLinguist (@linguisticsnu) 's Twitter Profile Photo

"Using automatic acoustic analysis to reveal disruptions to speech articulation in individuals at risk for psychosis" K. Hitczenko Y. Segal Yossi Keshet Mittal ADAPT Lab @MattGoldrick Poster session 4aSC #ASA184 5/7

Felix Kreuk (@felixkreuk) 's Twitter Profile Photo

We present MusicGen: A simple and controllable music generation model. MusicGen can be prompted by both text and melody. We release code (MIT) and models (CC-BY NC) for open research, reproducibility, and for the music community: github.com/facebookresear…

arXiv Sound (@arxivsound) 's Twitter Profile Photo

``DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation,'' Roi Benita, Michael Elad, Joseph Keshet, ift.tt/1UPCrnc

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

Check out our latest speech synthesis work that can produce Vocal Fry - a voice register used to socially express avoidance, but also popular among celebrities & upwardly mobile women. With @RoiBenita and Michael Elad at #ICLR2024 arxiv.org/abs/2310.01381

Andrey Cheptsov (@andrey_cheptsov) 's Twitter Profile Photo

A major open-source release! aiOla drops Whisper-Medusa, a model that is 50% faster than OpenAI’s Whisper. The model is based on the new "multi-head" attention architecture. Paper: paperswithcode.com/method/multi-h… GitHub: github.com/aiola-lab/whis… HuggingFace: huggingface.co/aiola/whisper-…

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

My aiOla team has just released Whisper-Medusa 50% faster than OpenAI's Whisper without sacrificing accuracy. It predicts up to 10 tokens simultaneously. github.com/aiola-lab/whis… #SpeechRecognition yaelsegal Aviv Shamsian Aviv Navon @gilhetz

ECE_Technion (@ece_technion) 's Twitter Profile Photo

Exciting news! Prof. Yossi Keshet from @TechnionECE joins forces with Prof. Bhiksha Raj as Chief Scientist at aiOla, pioneering next-gen AI speech tech! Proud of our alumnus Alon Peleg, serving as aiOla's COO! 🎓 Full story: radicaldatascience.wordpress.com/2024/10/21/two…

Exciting news! Prof. Yossi Keshet from @TechnionECE joins forces with Prof. Bhiksha Raj as Chief Scientist at aiOla, pioneering next-gen AI speech tech!

Proud of our alumnus Alon Peleg, serving as aiOla's COO! 🎓

Full story: radicaldatascience.wordpress.com/2024/10/21/two…
aiOla (@_aiola) 's Twitter Profile Photo

Big News in Ethical AI! aiOla’s new open-source model automatically identifies, tags, and masks sensitive information—names, phone numbers, addresses—all in one seamless step during audio transcription. A true leap forward in privacy-first AI. huggingface.co/spaces/aiola/w…

Yossi Keshet (@jkeshet) 's Twitter Profile Photo

We didn’t build Jargonic just to top a leaderboard—we built it to thrive in the real world: noisy, unpredictable, and full of domain-specific jargon. That’s what makes this milestone so meaningful. #ai #speech

IEEE Speech and Language Processing (@ieeesltc) 's Twitter Profile Photo

📢🌟🌟Call for ICASSP 2026 Speech and Language Processing Reviewer Nominations Please submit new speech and language processing reviewer nominations for ICASSP 2026 using the form below. docs.google.com/forms/d/1wtydY…

aiOla (@_aiola) 's Twitter Profile Photo

We are proud to have the best voice and speech AI lab in the world! Keep up the good work, team! 🚀 x.com/_akhaliq/statu…