Simone Tedeschi (@simonetedeschi_) Twitter Tweets • TwiCopy

Matt Shumer

@mattshumer_

2 years ago

The dataset is everything. Great read: nonint.com/2023/06/10/the…

thumb_up_off_alt2,2K

chat_bubble_outline111

repeat558

shareShare

Iacopo Ghinassi just presented our paper on Latin word sense disambiguation LREC COLING 2024 : we used language pivoting on English to boost the task on Latin. More research on this to come, watch this space!

Iacopo Ghinassi just presented our paper on Latin word sense disambiguation <a href="/LrecColing/">LREC COLING 2024</a> : we used language pivoting on English to boost the task on Latin. More research on this to come, watch this space!

thumb_up_off_alt12

chat_bubble_outline1

repeat4

shareShare

Hoyeon Chang

@hoyeon_chang

a year ago

🚨 New paper 🚨 How Large Language Models Acquire Factual Knowledge During Pretraining? I’m thrilled to announce the release of my new paper! 🎉 This research explores how LLMs acquire and retain factual knowledge during pretraining. Here are some key insights:

thumb_up_off_alt523

chat_bubble_outline12

repeat119

shareShare

Babelscape

@babelscape

a year ago

We are proud to share that our paper, "CNER: Concept and Named Entity Recognition", a joint work with SapienzaNLP, has been presented at #NAACL24! 🥳 Looking forward to engaging with the community. #NAACL2024 #AI #NLProc #Research #NER

We are proud to share that our paper, "CNER: Concept and Named Entity Recognition", a joint work with <a href="/SapienzaNLP/">SapienzaNLP</a>, has been presented at #NAACL24! 🥳 Looking forward to engaging with the community. #NAACL2024 #AI #NLProc #Research #NER

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Hitesh Patel

@hitesh_lpatel

a year ago

ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks This paper introduces ADVSCORE, a metric to evaluate and create high-quality adversarial datasets. ADVQA, a robust question answering dataset effectively fools models while not humans. This approach

thumb_up_off_alt9

chat_bubble_outline0

repeat1

shareShare

Hitesh Patel

@hitesh_lpatel

a year ago

ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming The paper introduces ALERT, a benchmark for assessing the safety of LLMs. It employs a fine-grained risk taxonomy to evaluate LLMs propensity to generate harmful content and

thumb_up_off_alt13

chat_bubble_outline1

repeat2

shareShare

Steffi Chern

@steffichern

a year ago

🚀How can we effectively evaluate and prevent superintelligent LLMs from deceiving others? We introduce 🤝BeHonest, a pioneering benchmark specifically designed to assess the honesty in LLMs comprehensively. Paper 📄: [arxiv.org/abs/2406.13261] Code 👨🏻‍💻: [github.com/GAIR-NLP/BeHon…]

thumb_up_off_alt59

chat_bubble_outline1

repeat22

shareShare

Anka Reuel | @ankareuel.bsky.social

@ankareuel

a year ago

Our new paper "Open Problems in Technical AI Governance" led by Ben Bucknall & me is out! We outline 89 open technical issues in AI governance, plus resources and 100+ research questions that technical experts can tackle to help AI governance efforts🧵 t.ly/Y-mQ1

thumb_up_off_alt184

chat_bubble_outline11

repeat46

shareShare

Rongwu Xu

@rongwu_xu

a year ago

☕️New paper 👉Our latest paper delves into LLMs' ability to perform safety self-correction, namely COURSE-CORRECTION. In this paper, we: - Benchmark course-correction ability - Improving using synthetic preferences. Paper: arxiv.org/pdf/2407.16637 Code: github.com/pillowsofwind/…

thumb_up_off_alt38

chat_bubble_outline4

repeat20

shareShare

Babelscape

@babelscape

10 months ago

Four of our industrial #PhD students, Stefan Bejgu, Pere-Lluís Huguet Cabot, Alessandro Scirè and Simone Tedeschi, were awarded their #PhD in #AI last Friday with the best grades (and two cum laude)! Congrats all! 👏 🎉 With Roberto Navigli, their advisor and Babelscape's scientific director, in the photo

Four of our industrial #PhD students, <a href="/SBejgu/">Stefan Bejgu</a>, <a href="/PereLluisHC/">Pere-Lluís Huguet Cabot</a>, <a href="/alescire94/">Alessandro Scirè</a> and <a href="/SimoneTedeschi_/">Simone Tedeschi</a>, were awarded their #PhD in #AI last Friday with the best grades (and two cum laude)! Congrats all! 👏 🎉 With <a href="/RNavigli/">Roberto Navigli</a>, their advisor and Babelscape's scientific director, in the photo

thumb_up_off_alt12

chat_bubble_outline0

repeat5

shareShare

SapienzaNLP

@sapienzanlp

10 months ago

Last week 5 in our group received their #PhD in #AI & #Engineering in #ComputerScience! Stefan Bejgu, Pere-Lluís Huguet Cabot, Riccardo Orlando, Alessandro Scirè, and Simone Tedeschi, all with the highest grade (+2 cum laude)! Congrats all: we are very proud of you! Four of them were/are @Babelscape

Last week 5 in our group received their #PhD in #AI & #Engineering in #ComputerScience! <a href="/SBejgu/">Stefan Bejgu</a>, <a href="/PereLluisHC/">Pere-Lluís Huguet Cabot</a>, <a href="/RiccardoRicOrl/">Riccardo Orlando</a>, <a href="/alescire94/">Alessandro Scirè</a>, and <a href="/SimoneTedeschi_/">Simone Tedeschi</a>, all with the highest grade (+2 cum laude)! Congrats all: we are very proud of you! Four of them were/are @Babelscape

thumb_up_off_alt18

chat_bubble_outline1

repeat4

shareShare

Emmy Liu

@_emliu

8 months ago

What design decisions in LLM training affect the final performance of LLMs? Scaling model size and training data is important, but it's not the only thing. We performed an analysis of 90+ open-weights models to answer this question. 🧵 arxiv.org/abs/2503.03862 (1/12)

thumb_up_off_alt213

chat_bubble_outline5

repeat53

shareShare

Simone Tedeschi

Matt Shumer

Barbara McGillivray

Hoyeon Chang

Babelscape

Hitesh Patel

Hitesh Patel

Steffi Chern

Anka Reuel | @ankareuel.bsky.social

Rongwu Xu

Babelscape

SapienzaNLP

Emmy Liu