Iker García-Ferrero (@iker_garciaf) Twitter Tweets • TwiCopy

Julen Etxaniz

2 years ago

In our new paper, we introduce Latxa, a family of LLMs for Basque from 7 to 70B parameters that outperform open models and GPT3.5. Models and datasets Hugging Face hf.co/collections/Hi… Code: github.com/hitz-zentroa/l… Blog: hitz.eus/en/node/343 Paper: arxiv.org/abs/2403.20266

thumb_up_off_alt107

chat_bubble_outline4

repeat35

shareShare

Hugh Zhang

@hughbzhang

2 years ago

Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.

thumb_up_off_alt1,1K

chat_bubble_outline36

repeat223

shareShare

Eneko Agirre @eagirre.bsky.social

@eagirre

2 years ago

good bye #ICLR2024 and congrats to organizers, they managed to make a 5000 conference a pleasure to attend!

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Alon Jacovi

@alon_jacovi

2 years ago

The CONDA Data Contamination Workshop deadline was delayed to May 31st to give more buffer following ACL notifications - consider submitting if this gives you the chance to! conda-workshop.github.io

thumb_up_off_alt12

chat_bubble_outline1

repeat6

shareShare

Julen Etxaniz

@juletxara

a year ago

Our paper about Latxa has been accepted at #ACL2024NLP 🎉 Congratulations to all the coauthors and see you in Bangkok!

thumb_up_off_alt29

chat_bubble_outline0

repeat11

shareShare

Iker García-Ferrero

@iker_garciaf

a year ago

Today I presented Medical mT5 at LREC COLING 2024!!! We have released a lot of multilingual models, training data and evaluation benchmarks for the medical domain here, check them out!! huggingface.co/collections/Hi…

Today I presented Medical mT5 at <a href="/LrecColing/">LREC COLING 2024</a>!!! We have released a lot of multilingual models, training data and evaluation benchmarks for the medical domain here, check them out!! huggingface.co/collections/Hi…

thumb_up_off_alt23

chat_bubble_outline0

repeat3

shareShare

Iker García-Ferrero

@iker_garciaf

a year ago

Back in the day, XLM-RoBERTa was too big for many people to run. It doesn't matter if you cannot run a 400B model today; in a few years, we will all laugh remembering how we thought that a "400B model was big" in the same way we laugh about how we thought that BERT was big.

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

HiTZ zentroa (UPV/EHU)

@hitz_zentroa

a year ago

Our colleagues accompanied by @orai_nlp and Vicomtech are working hard at #lreccolling2024

Our colleagues accompanied by @orai_nlp and <a href="/Vicomtech/">Vicomtech</a> are working hard at #lreccolling2024

thumb_up_off_alt25

chat_bubble_outline0

repeat3

shareShare

Iñigo Alonso

@alonsonlp

a year ago

Reimagining table representation! In our new #ACL2024NLP paper we introduce PixT3: a family of image-based Table-to-Text Generation models that scale better at generating text from large tables, outperforming traditional text-based baselines. arxiv.org/abs/2311.09808

thumb_up_off_alt22

chat_bubble_outline1

repeat11

shareShare

Julen Etxaniz

@juletxara

a year ago

How Much Do Language Models Know About Local Culture? We introduce BertaQA, a multiple-choice trivia dataset parallel in English and Basque. It consists of a local subset about the Basque culture, and a global subset with questions of broader interest. arxiv.org/abs/2406.07302

thumb_up_off_alt33

chat_bubble_outline2

repeat8

shareShare

Iker García-Ferrero

@iker_garciaf

a year ago

Great work!!! It's nice to see GoLLIE being useful for the community 😀

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Yanai Elazar

@yanaiela

a year ago

Concerned about data contamination? We asked the community for known contamination in different datasets and models, and summarized these finding in this report. arxiv.org/pdf/2407.21530

thumb_up_off_alt33

chat_bubble_outline1

repeat7

shareShare

Julen Etxaniz

@juletxara

a year ago

Thanks to everyone who came to our poster session! It was nice to talk about Latxa. I gave out many stickers! #ACL2024NLP

thumb_up_off_alt15

chat_bubble_outline0

repeat2

shareShare

Anna Rogers

@annargrs

a year ago

I'll be discussing 'emergent properties' at 11am in this lovely workshop tomorrow. I found even more definitions for what this means during this ACL and also ICML!

thumb_up_off_alt26

chat_bubble_outline2

repeat4

shareShare

Clémentine Fourrier 🍊

@clefourrier

a year ago

There is now an LLM Leaderboard for one of the most spoken language worldwide: Spanish! 🚀 (+ Catalan, Basque and Galician) Congrats to María Grandury for setting it up, and to SomosNLP for gathering super high quality datasets from many partners! huggingface.co/spaces/la-lead…

thumb_up_off_alt81

chat_bubble_outline1

repeat21

shareShare

HiTZ zentroa (UPV/EHU)

@hitz_zentroa

a year ago

El trabajo de HiTZ sobre Latxa ha recibido un premio internacional, aumentando el peso del euskera en la investigación. ehu.eus/es/-/centro-hi…

thumb_up_off_alt23

chat_bubble_outline1

repeat9

shareShare

HiTZ zentroa (UPV/EHU)

@hitz_zentroa

a year ago

Además, Iker García-Ferrero ha presentado el dataset NoticIA dataset y sistemas para la detección de clickbaits

Además, <a href="/iker_garciaf/">Iker García-Ferrero</a> ha presentado el dataset NoticIA dataset y sistemas para la detección de clickbaits

thumb_up_off_alt8

chat_bubble_outline0

repeat4

shareShare

Rodrigo Agerri

@ragerri

a year ago

Join us in the First Shared Task on the Multilingual Counterspeech Generation against Hate Speech! To be held COLING 2025 Info: rb.gy/jfm481 - Aim: given a HS (and any additional knowledge the participants may like to use), generate a CN to counteract the HS.

Join us in the First Shared Task on the Multilingual Counterspeech Generation against Hate Speech! To be held <a href="/coling2025/">COLING 2025</a>
Info: rb.gy/jfm481
- Aim: given a HS (and any additional knowledge the participants may like to use), generate a CN to counteract the HS.

thumb_up_off_alt9

chat_bubble_outline1

repeat12

shareShare