Karim Kamal (@karimash14) 's Twitter Profile
Karim Kamal

@karimash14

ID: 266492936

calendar_today15-03-2011 09:44:16

14,14K Tweet

308 Followers

2,2K Following

Rada Mihalcea (@radamihalcea) 's Twitter Profile Photo

An interesting argument by Ev (like in 'evidence', not Eve) Fedorenko 🇺🇦 in her #EMNLP2021 keynote: humans use language for communication, not for thought—and so language models should not be judged for their ability to reason. I would add: we should also not use lang.models as proxies for cognitive abilities.

Alfredo Canziani (@alfcnz) 's Twitter Profile Photo

What a pleasure it's been to teach 3 hours of self-supervised energy-based models at Sapienza Università di Roma with a super warm audience, split between physically present and remotely connected students. Thanks, Simone Scardapane, for hosting me today. 😋😋😋

Michael Auli (@michaelauli) 's Twitter Profile Photo

New work! Humans appear to learn similarly for different modalities and so should machines! data2vec uses the same self-supervised algorithm to train models for vision, speech, and nlp. Paper: ai.facebook.com/research/data2… Blog: ai.facebook.com/blog/the-first… Code: github.com/pytorch/fairse…

Michele Bevilacqua (@michelebevila20) 's Twitter Profile Photo

New work on autoregressive language models for retrieval! We train our model, SEAL (Search Engines with Autoregressive LMs) to produce text snippets occurring somewhere in the corpus, in relevant documents. We use the generated ngrams to rank documents in the corpus. Thread 👇

Sebastian Riedel (@riedelcastro@sigmoid.social) (@riedelcastro) 's Twitter Profile Photo

Differentiable Search Indices showed that autoregressive transformers can operate as “search engines” to some extent, but not for full scale corpora. FM-Indices change this! Really enjoyed this work led by the amazing Michele Bevilacqua!

Alessandro Scirè (@alescire94) 's Twitter Profile Photo

Exciting strides in text summarization with LLMs 🚀but verifying their factual accuracy is still an open challenge 🤔 We introduce FENICE, a factuality-oriented metric for summarization with a strong focus on interpretability🔍arxiv.org/abs/2403.02270 #NLProc #LLMs #Factuality

Alessandro Scirè (@alescire94) 's Twitter Profile Photo

By leveraging NLI-based alignments, FENICE matches claims extracted from the summary to passages in the source document. This allows identifying the specific sections that support or contradict each claim, providing deeper insights into the factual accuracy of a generated summary

By leveraging NLI-based alignments, FENICE matches claims extracted from the summary to passages in the source document. This allows identifying the specific sections that support or contradict each claim, providing deeper insights into the factual accuracy of a generated summary
Alessandro Scirè (@alescire94) 's Twitter Profile Photo

FENICE achieves state of the art performance on AGGREFACT, the de-facto benchmark for this task, and excels in our set of human annotations for long-form summarization evaluation. arxiv.org/abs/2403.02270 Joint work between SapienzaNLP and Babelscape Karim Kamal Roberto Navigli

SapienzaNLP (@sapienzanlp) 's Twitter Profile Photo

FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction - Findings Poster Session 4 📅14th August @ 12:15 🗣️ Alessandro Scirè Karim Kamal