Ona de Gibert (@onadegibert) Twitter Tweets • TwiCopy

Ona de Gibert

@onadegibert

+ Follow

PhD Student @HelsinkiNLP / Low-resource, Machine Translation, Knowledge Distillation, Multilinguality

ID: 1466431835303518212

linkhttps://www.linkedin.com/in/onadegibertbonet/ calendar_today02-12-2021 15:39:50

182 Tweet

424 Followers

502 Following

Ona de Gibert

@onadegibert

2 years ago

Just hosted my first ever BoF session, I hope the first of many more to come! Engaging discussions and new ideas at #ALPS24

thumb_up_off_alt14

chat_bubble_outline0

repeat0

shareShare

☑️ Gràcies a l'esforç comunitari, la Viquipèdia en català ja ha arribat a 40.000 articles sobre dones! La biografia que ha culminat aquesta fita és la de Miriam Tey. Ara mateix, el 21% de biografies de l'enciclopèdia són sobre dones. El 0,22% són sobre persones no binàries. 📈

thumb_up_off_alt163

chat_bubble_outline4

repeat51

shareShare

Transducens

@transducens

2 years ago

🚀 Attention MT scientists and developers! We announce the #WMT2024 shared task on machine translation into these underrepresented Romance languages of Spain: Aragonese, Aranese, and Asturian. Joint effort Universidad de Alicante UA+UOCuniversidad. Details here: www2.statmt.org/wmt24/romance-…

thumb_up_off_alt12

chat_bubble_outline1

repeat3

shareShare

HPLT

@hplt_eu

2 years ago

Will you be at LREC COLING 2024 next week? HPLT will! 🥳 Don't miss: - our poster on Thursday 23, 15:30, about FastSpell, one of the langID technologies of our dataset pipeline. (paper 1571) - our presentation on Friday 24, 9:20 for all details about HPLT massive dataset (paper 2199)

thumb_up_off_alt7

chat_bubble_outline0

repeat5

shareShare

Ona de Gibert

@onadegibert

2 years ago

Nothing says Italy like 𝐺𝑒𝑙𝑎𝑡𝑜! Spending the week in beautiful Torino for LREC COLING 2024 #LRECCOLING2024 #lreccoling Can't wait to listen to great research, reunite with old friends and make new ones!

Nothing says Italy like 𝐺𝑒𝑙𝑎𝑡𝑜! Spending the week in beautiful Torino for <a href="/LrecColing/">LREC COLING 2024</a> #LRECCOLING2024 #lreccoling Can't wait to listen to great research, reunite with old friends and make new ones!

thumb_up_off_alt34

chat_bubble_outline2

repeat1

shareShare

Ona de Gibert

@onadegibert

a year ago

Today I'm excited to present at @wikimania 2024! Let's talk about minority languages, language technology and open-source initiatives :) Wikimedia Foundation HPLT Helsinki-NLP University of Helsinki

Today I'm excited to present at @wikimania 2024! Let's talk about minority languages, language technology and open-source initiatives :)

<a href="/Wikimedia/">Wikimedia Foundation</a> <a href="/hplt_eu/">HPLT</a> <a href="/HelsinkiNLP/">Helsinki-NLP</a> <a href="/helsinkiuni/">University of Helsinki</a>

thumb_up_off_alt16

chat_bubble_outline0

repeat2

shareShare

Ona de Gibert

@onadegibert

a year ago

Let's meet in Prague!

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Institute of Formal and Applied Linguistics

@ufal_cuni

a year ago

The MT Marathon continues on its third day! We already had great talks by Ondrej Bojar, @prajdabre1, Vilém Zouhar, and Elizabeth Salesky 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.

The MT Marathon continues on its third day! We already had great talks by <a href="/OndrejBojar/">Ondrej Bojar</a>, @prajdabre1, <a href="/zouharvi/">Vilém Zouhar</a>, and <a href="/esalesk/">Elizabeth Salesky</a> 👏 and a poster session with 10 posters 🖼️. Today, we continue with more talks, and of course, the week-long hackathon continues with interesting projects.

thumb_up_off_alt36

chat_bubble_outline1

repeat9

shareShare

Ona de Gibert

@onadegibert

a year ago

Today Joseph Attieh and I presented KD4MT in the MT Marathon in Prague organized by Institute of Formal and Applied Linguistics, one of my favourite events of the year! Helsinki-NLP HPLT

Today <a href="/josephnlp/">Joseph Attieh</a> and I presented KD4MT in the MT Marathon in Prague organized by <a href="/ufal_cuni/">Institute of Formal and Applied Linguistics</a>, one of my favourite events of the year!

<a href="/HelsinkiNLP/">Helsinki-NLP</a> <a href="/hplt_eu/">HPLT</a>

thumb_up_off_alt29

chat_bubble_outline2

repeat4

shareShare

Institute of Formal and Applied Linguistics

@ufal_cuni

a year ago

Ondrej Bojar @prajdabre1 Vilém Zouhar Elizabeth Salesky We continued with great talks by Ricardo Rei, Julius Cheng, @onadegibert, Joseph Attieh, Laurie Burchell, @khetamalsharou, and Tsz Kin. Thanks for coming to Prague 🙌. ➕ Throughout the week, people worked on 👉7 🤩 exciting hacking projects.

<a href="/OndrejBojar/">Ondrej Bojar</a> @prajdabre1 <a href="/zouharvi/">Vilém Zouhar</a> <a href="/esalesk/">Elizabeth Salesky</a> We continued with great talks by <a href="/RicardoRei7/">Ricardo Rei</a>, <a href="/julius_gulius/">Julius Cheng</a>, @onadegibert, <a href="/josephnlp/">Joseph Attieh</a>, <a href="/very_laurie/">Laurie Burchell</a>, @khetamalsharou, and <a href="/Lam19Tk/">Tsz Kin</a>. Thanks for coming to Prague 🙌. ➕ Throughout the week, people worked on 👉7 🤩 exciting hacking projects.

thumb_up_off_alt17

chat_bubble_outline0

repeat4

shareShare

Helsinki-NLP

@helsinkinlp

a year ago

🚀 Excited to introduce EMMA-500! 🌍✨ A multilingual model continue-trained on 546 languages, enhancing coverage for low-resource languages. With the MaLA corpus and Llama 2 7B, we're pushing boundaries in cross-lingual transfer. Check it out: huggingface.co/MaLA-LM

thumb_up_off_alt102

chat_bubble_outline2

repeat25

shareShare

HPLT

@hplt_eu

a year ago

🚀 INTRODUCING THE LATEST HPLT MONOLINGUAL DATASETS! TL;DR: 🔍 4.5 PB of web crawls 📄 21 billion documents 💝 careful extraction, dedup, annotation and cleaning 💥 193 languages! Explore and download the new HPLT Monolingual Datasets NOW! hplt-project.org/datasets/v2.0 #HPLT

thumb_up_off_alt37

chat_bubble_outline2

repeat15

shareShare

Ona de Gibert

@onadegibert

a year ago

A tragedy in two acts:

thumb_up_off_alt14

chat_bubble_outline1

repeat0

shareShare

Ona de Gibert

@onadegibert

a year ago

It's happening! Will you join us? 😉

thumb_up_off_alt11

chat_bubble_outline0

repeat0

shareShare

HPLT

@hplt_eu

10 months ago

We are happy to announce the second release of HPLT bilingual datasets: - 50 English-centric language pairs = 380M parallel sentences (HPLT) 🤩 - 1,275 non-English-centric language pairs = 16.7B parallel sentences (MultiHPLT) 😮 Available at the HPLT dataset catalogue and OPUS.

thumb_up_off_alt16

chat_bubble_outline0

repeat13

shareShare