Iker García-Ferrero (@iker_garciaf) 's Twitter Profile
Iker García-Ferrero

@iker_garciaf

PhD student. @IxaGroup @Hitz_zentroa

ID: 1377243105682976769

linkhttps://ikergarcia1996.github.io/Iker-Garcia-Ferrero/ calendar_today31-03-2021 12:55:28

195 Tweet

1,1K Followers

278 Following

Julen Etxaniz (@juletxara) 's Twitter Profile Photo

In our new paper, we introduce Latxa, a family of LLMs for Basque from 7 to 70B parameters that outperform open models and GPT3.5. Models and datasets Hugging Face hf.co/collections/Hi… Code: github.com/hitz-zentroa/l… Blog: hitz.eus/en/node/343 Paper: arxiv.org/abs/2403.20266

In our new paper, we introduce Latxa, a family of LLMs for Basque from 7 to 70B parameters that outperform open models and GPT3.5.
Models and datasets <a href="/huggingface/">Hugging Face</a> hf.co/collections/Hi…
Code: github.com/hitz-zentroa/l…
Blog: hitz.eus/en/node/343
Paper: arxiv.org/abs/2403.20266
Hugh Zhang (@hughbzhang) 's Twitter Profile Photo

Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.

Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.
Alon Jacovi (@alon_jacovi) 's Twitter Profile Photo

The CONDA Data Contamination Workshop deadline was delayed to May 31st to give more buffer following ACL notifications - consider submitting if this gives you the chance to! conda-workshop.github.io

Iker García-Ferrero (@iker_garciaf) 's Twitter Profile Photo

Today I presented Medical mT5 at LREC COLING 2024!!! We have released a lot of multilingual models, training data and evaluation benchmarks for the medical domain here, check them out!! huggingface.co/collections/Hi…

Today I presented Medical mT5 at <a href="/LrecColing/">LREC COLING 2024</a>!!! We have released a lot of multilingual models, training data and evaluation benchmarks for the medical domain here, check them out!!  huggingface.co/collections/Hi…
Iker García-Ferrero (@iker_garciaf) 's Twitter Profile Photo

Back in the day, XLM-RoBERTa was too big for many people to run. It doesn't matter if you cannot run a 400B model today; in a few years, we will all laugh remembering how we thought that a "400B model was big" in the same way we laugh about how we thought that BERT was big.

Iñigo Alonso (@alonsonlp) 's Twitter Profile Photo

Reimagining table representation! In our new #ACL2024NLP paper we introduce PixT3: a family of image-based Table-to-Text Generation models that scale better at generating text from large tables, outperforming traditional text-based baselines. arxiv.org/abs/2311.09808

Julen Etxaniz (@juletxara) 's Twitter Profile Photo

How Much Do Language Models Know About Local Culture? We introduce BertaQA, a multiple-choice trivia dataset parallel in English and Basque. It consists of a local subset about the Basque culture, and a global subset with questions of broader interest. arxiv.org/abs/2406.07302

How Much Do Language Models Know About Local Culture?

We introduce BertaQA, a multiple-choice trivia dataset parallel in English and Basque. It consists of a local subset about the Basque culture, and a global subset with questions of broader interest.

arxiv.org/abs/2406.07302
Yanai Elazar (@yanaiela) 's Twitter Profile Photo

Concerned about data contamination? We asked the community for known contamination in different datasets and models, and summarized these finding in this report. arxiv.org/pdf/2407.21530

Concerned about data contamination?
We asked the community for known contamination in different datasets and models, and summarized these finding in this report.
arxiv.org/pdf/2407.21530
Anna Rogers (@annargrs) 's Twitter Profile Photo

I'll be discussing 'emergent properties' at 11am in this lovely workshop tomorrow. I found even more definitions for what this means during this ACL and also ICML!

Clémentine Fourrier 🍊 (@clefourrier) 's Twitter Profile Photo

There is now an LLM Leaderboard for one of the most spoken language worldwide: Spanish! 🚀 (+ Catalan, Basque and Galician) Congrats to María Grandury for setting it up, and to SomosNLP for gathering super high quality datasets from many partners! huggingface.co/spaces/la-lead…

HiTZ zentroa (UPV/EHU) (@hitz_zentroa) 's Twitter Profile Photo

El trabajo de HiTZ sobre Latxa ha recibido un premio internacional, aumentando el peso del euskera en la investigación. ehu.eus/es/-/centro-hi…

El trabajo de HiTZ sobre Latxa ha recibido un premio internacional, aumentando el peso del euskera en la investigación.

ehu.eus/es/-/centro-hi…
Rodrigo Agerri (@ragerri) 's Twitter Profile Photo

Join us in the First Shared Task on the Multilingual Counterspeech Generation against Hate Speech! To be held COLING 2025 Info: rb.gy/jfm481 - Aim: given a HS (and any additional knowledge the participants may like to use), generate a CN to counteract the HS.

Join us in the First Shared Task on the Multilingual Counterspeech Generation against Hate Speech! To be held <a href="/coling2025/">COLING 2025</a>  
Info: rb.gy/jfm481
- Aim: given a HS (and any additional knowledge the participants may like to use), generate a CN to counteract the HS.