Maite Melero (@maitemelero1) Twitter Tweets • TwiCopy

Maite Melero

@maitemelero1

+ Follow

ID: 809736524887683072

calendar_today16-12-2016 12:26:46

636 Tweet

186 Followers

232 Following

Marta Villegas

@martavillegasm

2 years ago

📢FLOR-6.3B, a new generative model for Catalan, Spanish & English based on BLOOM-7.1B. We modified the vocabulary and embedding layer, and continuously pre-trained the model with 140B tokens in our target languages 🚀huggingface.co/projecte-aina/… BSC-CNS Aina SomosNLP 👇

thumb_up_off_alt60

chat_bubble_outline1

repeat25

shareShare

Maite Melero

2 years ago

theguardian.com/commentisfree/…

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Izaskun Arretxe Irigoien

@izaskunarretxe

2 years ago

El gran diccionari etimològic de Joan Coromines ja és a internet vilaweb.cat/noticies/dicci…

thumb_up_off_alt222

chat_bubble_outline4

repeat81

shareShare

Maite Melero

2 years ago

Projecció del documental All Static and Noise i conversa amb el poeta i activista Abduweli Ayup sobre la lluita del poble uigur per mantenir la seva llengua i cultura. cccb.org/ca/activitats/… CCCB

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Maite Melero

2 years ago

La población civil en la franja de Gaza está atrapada en medio de los bombardeos. Necesitan apoyo urgente. Por favor, haz tu donación para ayuda humanitaria de emergencia aquí: ayudagaza.com

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Carlos Escolano

a year ago

[1/7] Introducing "Investigating the translation capabilities of Large Language Models trained on parallel data only" To our knowledge, the first work studying translation on #LLM trained exclusively on parallel data. Arxiv paper: arxiv.org/abs/2406.09140

thumb_up_off_alt17

chat_bubble_outline1

repeat7

shareShare

Carlos Escolano

a year ago

[2/7] Along with the paper we release PLUME a family of 3 2B #LLM based on the Gemma architecture. Each model uses a different vocabulary size, from 32k up to 256k tokens. PLUME 32k: huggingface.co/projecte-aina/… PLUME 128k: huggingface.co/projecte-aina/… PLUME 256k: huggingface.co/projecte-aina/…

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Carlos Escolano

a year ago

[3/7] Our results show that these models can perform comparably to previous Encoder-Decoder methods and that larger vocabularies lead to better performance on both supervised and zero-shot translation directions.

[3/7] Our results show that these models can perform comparably to previous Encoder-Decoder methods and that larger vocabularies lead to better performance on both supervised and zero-shot translation directions.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Carlos Escolano

a year ago

[4/7] Further analysis shows that different layers specialize in different parts of the prompt. Two clear patterns we observe are the presence of sink heads that attend to the <BOS> token, and a small amount of attention to the source language tag.

[4/7] Further analysis shows that different layers specialize in different parts of the prompt. Two clear patterns we observe are the presence of sink heads that attend to the <BOS> token, and a small amount of attention to the source language tag.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Carlos Escolano

a year ago

[5/7] Given the previous findings we masked the heads with less attention coverage. Results show that more than 47% of model heads can be removed without losing more than 2 BLEU points. Being the 256k model the more resilient one, with 64,7% masked heads.

[5/7] Given the previous findings we masked the heads with less attention coverage. Results show that more than 47% of model heads can be removed without losing more than 2 BLEU points. Being the 256k model the more resilient one, with 64,7% masked heads.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Carlos Escolano

a year ago

[6/7] Finally, we study how the cross-lingual space is learned through the model layers. We observe that larger vocabulary sizes show smaller distances between languages, at the early and middle layers.

[6/7] Finally, we study how the cross-lingual space is learned through the model layers. We observe that larger vocabulary sizes show smaller distances between languages, at the early and middle layers.

thumb_up_off_alt2

chat_bubble_outline1

repeat1

shareShare

Carlos Escolano

a year ago

[7/7] This work has been conducted at BSC-CNS thanks to funding by Aina and Proyecto Ilenia. Also, thank my co-authors Javier García Gilabert (Javier García Gilabert), Aleix Sant Savall, Francesca De Luca Fornaciari, Audrey Mash, Xixian Liao, Maite Melero (Maite Melero )

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare

Maite Melero

a year ago

Reivindicar el català com a llengua científica: publicat el primer resum sense traduir en una revista internacional ara.cat/1_4d68e2?utm_s… via diariARA

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Linguapax

a year ago

Hi serem! Aprofitem per agrair a l'Emili Boix la feina que fa a Linguapax, com a membre de la Junta, i el bon humor que hi aporta!

thumb_up_off_alt5

chat_bubble_outline0

repeat3

shareShare

Linguapax

a year ago

Avui és el Dia Internacional de les Llengües de Signes. Sabíeu que al món existeixen unes 300 llengües de signes? Formen part de la preciosa DIVERSITAT LINGÜÍSTICA mundial, però també són llengües minoritzades. youtu.be/_qO-ybCQQFI?si…

thumb_up_off_alt19

chat_bubble_outline1

repeat16

shareShare

Maite Melero

10 months ago

theguardian.com/science/2024/o…

thumb_up_off_alt4

chat_bubble_outline1

repeat2

shareShare

Linguapax

10 months ago

📢 ATENCIÓ! Ja està oberta la convocatòria de candidatures per al 🏆#PremiLinguapax 2024. Teniu temps fins al 21 de febrer de 2025 per a presentar nominacions. Més informació: linguapax.org/convocatoria-d…

thumb_up_off_alt4

chat_bubble_outline0

repeat8

shareShare

Maite Melero

10 months ago

Contemplar l’abisme sota meu cccb.org/ca/activitats/… CCCB

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Maite Melero

10 months ago

L'Alternativa 2024 - ALTO EL FUEGO alternativa.cccb.org/2024/es/fest/c… via L'Alternativa Fest

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Maite Martín

9 months ago

🤬Una auténtica vergüenza!esta convocatoria ha sido un desastre desde q se convocó pero q ni siquiera la vayan a resolver, realmente esto es de países tercermundistas. A los políticos no les importa la ciencia (ni la IA) 😢#SinCienciaNoHayFuturo - elpais.com/tecnologia/202…

thumb_up_off_alt12

chat_bubble_outline0

repeat12

shareShare