Goran Glavaš (@gg42554) Twitter Tweets • TwiCopy

Goran Glavaš

@gg42554

+ Follow

Professor for #NLProc @Uni_WUE. Moving to Bluesky: bsky.app/profile/gglava…

ID: 382302464

linkhttps://sites.google.com/view/goranglavas calendar_today29-09-2011 20:48:47

405 Tweet

1,1K Followers

257 Following

Andreea Iana

a year ago

🤔 If you're interested in more #multilingual news data for other #NLProc tasks, check out PolyNews 📰 on Hugging Face ! w/ 77 low & high-resource languages in 19 scripts 🌍 🤗 huggingface.co/datasets/aiana… 📃 arxiv.org/abs/2406.12634 w/ Fabian David Schmidt Goran Glavaš Heiko Paulheim Data and Web Science Group

thumb_up_off_alt12

chat_bubble_outline0

repeat3

shareShare

Goran Glavaš

a year ago

Check out our massively multilingual and (partially) multi-parallel news dataset PolyNews! Great work by Andreea Iana on compiling this massively multilingual domain-specific data as well as on using it to improve multilingual sentence encoders for news recommendation!

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

a year ago

Great work by Andreea Iana who put an immense effort to collect and clean such massively multi-parallel news dataset. I reckon that that such a domain-specific multi-parallel corpus is of quite some interest for the MT folks :)!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

a year ago

Great effort by Gregor Geigle: we test if explicit grounding objectives reduce hallucination of Large Vision-Language Models. We confirm that they yield better fine-grained image understanding performance, but this does not propagate to less hallucination in open captioning!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

a year ago

Can your Large Vision-Language Model differentiate tell a Keeshond from a Samoyed? We show that fine-grained object classification is a skill quite complementary to image understanding tested by existing benchmarks and that LVLMs don't excel on the task, to say the least.

thumb_up_off_alt1

chat_bubble_outline0

repeat1

shareShare

Goran Glavaš

a year ago

You can now get our multilingual multi-parallel news recommendation dataset from HuggingFace!

thumb_up_off_alt5

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

a year ago

I really enjoyed working with Valentin Hofmann on this! The highlight of this work for me is Figure 6: rendering toponym names from their embeddings obtained from the LM after geoadaptation, we basically obtained the map (for the BCMS area)!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

UKP Lab

a year ago

Code LMs are improving fast 📈, but they are limited in low-resource programming languages (PLs). 😬 In this #ACL2024NLP paper, we pre-train code LMs on source-compiler IR pairs for low-resource PLs💪 – 🧵 (1/7) Poster: Mon 4 PM - Oral: Wed 10:30 AM 📄: arxiv.org/abs/2403.03894

Code LMs are improving fast 📈, but they are limited in low-resource programming languages (PLs). 😬

In this #ACL2024NLP paper, we pre-train code LMs on source-compiler IR pairs for low-resource PLs💪 – 🧵 (1/7)
Poster: Mon 4 PM - Oral: Wed 10:30 AM
📄: arxiv.org/abs/2403.03894

thumb_up_off_alt19

chat_bubble_outline2

repeat5

shareShare

Goran Glavaš

a year ago

Intermediate code representations like LLVM can indeed be a great facilitator of cross-programming-language transfer for Code-LLMs! Well deserved Oustanding Paper Award for Indraneil Paul for this great work! It was a pleasure to be part of the effort!

thumb_up_off_alt20

chat_bubble_outline1

repeat2

shareShare

Goran Glavaš

a year ago

If you're looking on the fly customization of your news recommendation function, then MANNeR is the framework for you! Great work by Andreea Iana!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Andreea Iana

a year ago

🔎 What's beneath the surface of encoder architectures in news #recsys? 🤔 Our latest work w/ Goran Glavaš Heiko Paulheim goes beyond recommendation accuracy to shed💡on how news & user encoders behave w.r.t. representational similarity! 🔗 Read more: arxiv.org/abs/2410.01470 👇

thumb_up_off_alt9

chat_bubble_outline1

repeat2

shareShare

Goran Glavaš

a year ago

Yes, come to Fabian David Schmidt's poster on Tuesday! (even I will be there and I haven't been to a conference in 2.5 years :))

thumb_up_off_alt8

chat_bubble_outline0

repeat1

shareShare

Goran Glavaš

a year ago

If you're into Vision-LLMS, come check Gregor Geigle's amazing work! See you in Miami ;)

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

a year ago

Tired of work that probes LLMs or uses them as agents? Andreea Iana will present something cool and different: come check her great work on flexible news recommendation.

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Goran Glavaš

a year ago

Great work by fschmidt! Afaik, it's the first massively multilingual benchmark for spoken language understanding (and not just topical classification of speech utterances :). Ready "out-of-the-box" on HF datasets. Paper coming soon (but all important details already described).

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

Goran Glavaš

10 months ago

If you're looking for a good recipe for training a multilingual LVLM or a just a very strong multilingual LVLM to use, supporting 100 languages (built following the identifed "optimal" recipe), check our latest work! Gregor Geigle and Florian Schneider as lead authors!

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Goran Glavaš

10 months ago

Great new work on multilingual news recommendation (NR) by Andreea Iana! New datasets for multilingual and cross-lingual NR as well as a SotA NR model, new domain-adapted from a multilingual sentence encoder!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Fabian David Schmidt

9 months ago

Joint work with Florian Schneider, Chris Biemann, and Goran Glavaš My first paper on multilingual vision-language, and couldn't be happier how this work turned out!🙂

thumb_up_off_alt5

chat_bubble_outline0

repeat1

shareShare

Goran Glavaš

8 months ago

youtu.be/wcfm2Zn-IpQ?si…

thumb_up_off_alt0

chat_bubble_outline0

repeat0

shareShare

Andreea Iana

6 months ago

📢 Introducing Walk&Retrieve, a simple yet effective zero-shot #RAG framework based on #knowledgegraph walks! Arxiv : arxiv.org/abs/2505.16849 GitHub: github.com/MartinBoecklin… Joint work w/ Martin Böckling Heiko Paulheim Data and Web Science Group IR-RAG #SIGIR2025 Details 👇

📢 Introducing Walk&Retrieve, a simple yet effective zero-shot #RAG framework based on #knowledgegraph walks!

Arxiv : arxiv.org/abs/2505.16849
GitHub: github.com/MartinBoecklin…

Joint work w/ Martin Böckling <a href="/heikopaulheim/">Heiko Paulheim</a> <a href="/dwsunima/">Data and Web Science Group</a>

<a href="/ir_rag_sigir/">IR-RAG</a> #SIGIR2025

Details 👇

thumb_up_off_alt11

chat_bubble_outline1

repeat4

shareShare