Ona de Gibert (@onadegibert) 's Twitter Profile
Ona de Gibert

@onadegibert

PhD Student @HelsinkiNLP / Low-resource, Machine Translation, Knowledge Distillation, Multilinguality

ID: 1466431835303518212

linkhttps://www.linkedin.com/in/onadegibertbonet/ calendar_today02-12-2021 15:39:50

182 Tweet

424 Followers

502 Following

Viquipèdia (@viquipedia) 's Twitter Profile Photo

☑️ Gràcies a l'esforç comunitari, la Viquipèdia en català ja ha arribat a 40.000 articles sobre dones! La biografia que ha culminat aquesta fita és la de Miriam Tey. Ara mateix, el 21% de biografies de l'enciclopèdia són sobre dones. El 0,22% són sobre persones no binàries. 📈

☑️ Gràcies a l'esforç comunitari, la Viquipèdia en català ja ha arribat a 40.000 articles sobre dones!

La biografia que ha culminat aquesta fita és la de Miriam Tey.

Ara mateix, el 21% de biografies de l'enciclopèdia són sobre dones. El 0,22% són sobre persones no binàries. 📈
Transducens (@transducens) 's Twitter Profile Photo

🚀 Attention MT scientists and developers! We announce the #WMT2024 shared task on machine translation into these underrepresented Romance languages of Spain: Aragonese, Aranese, and Asturian. Joint effort Universidad de Alicante UA+UOCuniversidad. Details here: www2.statmt.org/wmt24/romance-…

HPLT (@hplt_eu) 's Twitter Profile Photo

Will you be at LREC COLING 2024 next week? HPLT will! 🥳 Don't miss: - our poster on Thursday 23, 15:30, about FastSpell, one of the langID technologies of our dataset pipeline. (paper 1571) - our presentation on Friday 24, 9:20 for all details about HPLT massive dataset (paper 2199)

Ona de Gibert (@onadegibert) 's Twitter Profile Photo

Nothing says Italy like 𝐺𝑒𝑙𝑎𝑡𝑜! Spending the week in beautiful Torino for LREC COLING 2024 #LRECCOLING2024 #lreccoling Can't wait to listen to great research, reunite with old friends and make new ones!

Nothing says Italy like 𝐺𝑒𝑙𝑎𝑡𝑜! Spending the week in beautiful Torino for <a href="/LrecColing/">LREC COLING 2024</a> #LRECCOLING2024 #lreccoling Can't wait to listen to great research, reunite with old friends and make new ones!
Institute of Formal and Applied Linguistics (@ufal_cuni) 's Twitter Profile Photo

The MT Marathon continues on its third day! We already had great talks by Ondrej Bojar, @prajdabre1, Vilém Zouhar, and Elizabeth Salesky 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.

The MT Marathon continues on its third day! We already had great talks by <a href="/OndrejBojar/">Ondrej Bojar</a>, @prajdabre1, <a href="/zouharvi/">Vilém Zouhar</a>, and <a href="/esalesk/">Elizabeth Salesky</a> 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.
Helsinki-NLP (@helsinkinlp) 's Twitter Profile Photo

🚀 Excited to introduce EMMA-500! 🌍✨ A multilingual model continue-trained on 546 languages, enhancing coverage for low-resource languages. With the MaLA corpus and Llama 2 7B, we're pushing boundaries in cross-lingual transfer. Check it out: huggingface.co/MaLA-LM

HPLT (@hplt_eu) 's Twitter Profile Photo

🚀 INTRODUCING THE LATEST HPLT MONOLINGUAL DATASETS! TL;DR: 🔍 4.5 PB of web crawls 📄 21 billion documents 💝 careful extraction, dedup, annotation and cleaning 💥 193 languages! Explore and download the new HPLT Monolingual Datasets NOW! hplt-project.org/datasets/v2.0 #HPLT

HPLT (@hplt_eu) 's Twitter Profile Photo

We are happy to announce the second release of HPLT bilingual datasets: - 50 English-centric language pairs = 380M parallel sentences (HPLT) 🤩 - 1,275 non-English-centric language pairs = 16.7B parallel sentences (MultiHPLT) 😮 Available at the HPLT dataset catalogue and OPUS.