Ruchao Fan (@ruchaofan) 's Twitter Profile
Ruchao Fan

@ruchaofan

Speech Scientist @ Microsoft
Ph.D. @ UCLA
Speech Processing

ID: 1181292626768609280

linkhttps://diamondfan.github.io/ calendar_today07-10-2019 19:38:20

19 Tweet

42 Followers

122 Following

Ruchao Fan (@ruchaofan) 's Twitter Profile Photo

This my my first tweet. I am so happy that my three papers are accepted to ICASSP2021. This is also my first pub in icassp. Thank all the reviewers for their dedicated work.

Awni Hannun (@awnihannun) 's Twitter Profile Photo

I'm stoked to see all the progress in large ASR datasets. Several 10k hour+ datasets have recently (or will soon be) released. GigaSpeech (English): github.com/SpeechColab/Gi… WeNetSpeech (Mandarin): wenet-e2e.github.io/WenetSpeech/ People's Speech (multi-lingual): mlcommons.org/en/peoples-spe…

Ruchao Fan (@ruchaofan) 's Twitter Profile Photo

The research topic during my Ph.D. at UCLA is children's speech recognition. As this journey ended last week, I, over the weekend, summarized some resources about child speech (github.com/Diamondfan/Chi…). I hope the repo can help students who join the field.

Weizhu Chen (@weizhuchen) 's Twitter Profile Photo

We released Phi-4-mini (3.8B base in LLM), a new SLM excelling in language, vision, and audio through a mixture-of-LoRA, uniting three modalities in one model. I am so impressed with its new audio capability. I hope you can play with it and share with us your feedback. We also

We released Phi-4-mini (3.8B base in LLM), a new SLM excelling in language, vision, and audio through a mixture-of-LoRA, uniting three modalities in one model. I am so impressed with its new audio capability. I hope you can play with it and share with us your feedback. We also