Oreen Yousuf (@oreenyousuf) 's Twitter Profile
Oreen Yousuf

@oreenyousuf

NLP PhD student @uppsalauni doing Handwritten Text Recognition for Ajami manuscripts | 🌍scripts @Unicode

ID: 1363849612302426112

calendar_today22-02-2021 13:54:27

282 Tweet

307 Followers

579 Following

Oreva Ahia (@orevaahia) 's Twitter Profile Photo

I am excited to be presenting MAGNET 🧲at NeurIPS 2024 next week. Subword tokenizers have been shown to overly segment text in non-Latin script languages. Our work presents an approach to train tokenizer-free multilingual LMs via efficient byte-level modeling. 1/n

I am excited to be presenting MAGNET 🧲at NeurIPS 2024 next week. Subword tokenizers have been shown to overly segment text in non-Latin script languages. 
Our work presents an approach to train tokenizer-free multilingual LMs via efficient byte-level modeling. 
1/n
Turi👨‍💻 (@qubeegen) 's Twitter Profile Photo

Introducing Sagalee: an Open Source ASR Dataset for Oromo language. Happy to share that our work on Sagalee is accepted for IEEE ICASSP 2025! 🎉 I will be attending the conference in April. Link to paper and dataset👇

7000 Languages (@7000languages) 's Twitter Profile Photo

We are seeking applicants for our 2025 Language Revitalization Cohort! Work within your community to design and implement online language learning lessons using free language learning technology tools. Learn more at our link in bio!

We are seeking applicants for our 2025 Language Revitalization Cohort! 
Work within your community to design and implement online language learning lessons using free language learning technology tools. 

Learn more at our link in bio!