Sebastian Lapuschkin (@slapuschkin) Twitter Tweets • TwiCopy

Fraunhofer HHI

a year ago

How can we use AI to automate our #rail transportation, when we all know how bad mobile data service can be on the train? In #BerDiBa, our scientists are using efficient computing to work on the future of #mobility. Learn more here: hhi.fraunhofer.de/en/departments…

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Dilyara Bareeva

@di_lya

a year ago

Check out our paper "Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression" at the #CVPR2024 #SAIAD workshop on June 18! We introduce a targeted, training-free model correction method that minimizes collateral damage within models.

thumb_up_off_alt7

chat_bubble_outline1

repeat3

shareShare

Sebastian Lapuschkin

@slapuschkin

a year ago

New paper alert! With EMPRT and SMPRT we fix some glaring issues with the original Model Parameter Randomization Test (MPRT). Visit our talk at the 2nd XAI World Conference in Valetta, Malta, on July 17th! more info & paper: tinyurl.com/lnkdn-smprt-em…

thumb_up_off_alt14

chat_bubble_outline0

repeat3

shareShare

Anna Hedström

@anna_hedstroem

a year ago

Our latest paper is out! On Tuesday, 17 July, we will present our latest work "A Fresh Look at Sanity Checks for Saliency Maps" at the 2nd XAI World Conference in Valetta, Malta. I hope to see you there! Read the paper: link.springer.com/chapter/10.100… Code: github.com/annahedstroem/…

thumb_up_off_alt15

chat_bubble_outline0

repeat3

shareShare

Sebastian Lapuschkin

@slapuschkin

a year ago

🚨 AttnLRP is at #ICML! Explain your transformers in a fast, efficient and best possible way! 🚨 more info: tinyurl.com/lnkdn-attnlrp paper: proceedings.mlr.press/v235/achtibat2… code: github.com/rachtibat/LXT

thumb_up_off_alt13

chat_bubble_outline2

repeat4

shareShare

Sebastian Lapuschkin

@slapuschkin

a year ago

Go meet our #AttnLRP - Team at #ICML -- Reduan Achtibat, (Sayed) Erfan Hatefi and Aakriti Jain, right now! Discuss how to best (ie, fast, cheap, faithful!) explain your transformer-based model during Poster Session 2!

thumb_up_off_alt17

chat_bubble_outline1

repeat2

shareShare

Kirill Bykov

@kirill_bykov

a year ago

I'm very excited to share some great news: the paper I supervised has been accepted to #NeurIPS2024. Huge congratulations to my student and soon-to-be PhD candidate Laura Kopf on her first lead-author publication—what an incredible way to start a scientific journey!

thumb_up_off_alt27

chat_bubble_outline1

repeat3

shareShare

Understandable Machine Intelligence Lab

@umi_lab_ai

a year ago

🚨 New paper alert! 🚨 Excited to share Quanda—a toolkit for evaluating data attribution methods, accepted at #NeurIPS ATTRIB workshop!🎉 A collaboration between Fraunhofer HHI & BIFOLD. 🔗 Paper: arxiv.org/abs/2410.07158 🔗 Code: github.com/dilyabareeva/q…

thumb_up_off_alt12

chat_bubble_outline1

repeat4

shareShare

Anna Hedström

@anna_hedstroem

a year ago

We (Dilyara Bareeva, Galip Ümit Yolcu, Niklas Schmolenski, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin + me) just launched QUANDA — a training data attribution TDA software Built for researchers curious to apply/ develop/ evaluate TDA methods GitHub repo: github.com/dilyabareeva/q…

thumb_up_off_alt14

chat_bubble_outline1

repeat2

shareShare

Dilyara Bareeva

@di_lya

a year ago

✨Introducing quanda: an open-source library for benchmarking training data attribution (TDA) methods in PyTorch! Quanda offers a user-friendly interface for ⚖️ evaluating attributions and 📊 benchmarking TDA methods across diverse metrics.

thumb_up_off_alt31

chat_bubble_outline1

repeat8

shareShare

Fraunhofer HHI

@fraunhoferhhi

a year ago

Congrats to Fraunhofer HHI #XAI experts Wojciech Samek & Sebastian Lapuschkin! @Handelsblatt has listed them among the most cited #AI researchers in Germany, and Fraunhofer HHI as one of the most important #AIhubs in the country. 👉hhi.fraunhofer.de/kiforschungfra…

Congrats to Fraunhofer HHI #XAI experts <a href="/WojciechSamek/">Wojciech Samek</a> & <a href="/SLapuschkin/">Sebastian Lapuschkin</a>!

@Handelsblatt has listed them among the most cited #AI researchers in Germany, and Fraunhofer HHI as one of the most important #AIhubs in the country.
👉hhi.fraunhofer.de/kiforschungfra…

thumb_up_off_alt18

chat_bubble_outline0

repeat4

shareShare

Understandable Machine Intelligence Lab

@umi_lab_ai

9 months ago

🚨 New paper alert! 🚨 We’re excited to share our latest work on interpretability evaluation: "Evaluating Interpretable Methods via Geometric Alignment of Functional Distortions" 📜 Accepted at TMLR 🎉 🔥 Survey certification 🔥 📖 Read: openreview.net/pdf?id=ukLxqA8…

thumb_up_off_alt5

chat_bubble_outline1

repeat5

shareShare

Understandable Machine Intelligence Lab

@umi_lab_ai

9 months ago

Huge thanks again to our co-authors! Anna Hedström Philine Bommer Tom Burns Sebastian Lapuschkin Wojciech Samek Marina M.-C. Höhne (née Vidovic) TU Berlin Fraunhofer HHI BIFOLD Brown University Cornell University OIST Leibniz-Institut für Agrartechnik und Bioökonomie Universität Potsdam

thumb_up_off_alt4

chat_bubble_outline0

repeat2

shareShare

Patrick Kahardipraja

@pkhdipraja

5 months ago

ICL allows LLMs to adapt to new tasks and at the same time enables them to access external knowledge through RAG. How does the latter work? TL;DR we find that certain attention heads perform various, distinct operations on the input prompt for QA! arxiv.org/abs/2505.15807 1/

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Sebastian Lapuschkin

@slapuschkin

5 months ago

Have had enough of the fake "sources" "cited" by ChatGPT? We have the solution in the form of low-cost causal citations for LLMs: arxiv.org/abs/2505.15807 Thanks to my amazing co-authors Patrick Kahardipraja , Reduan Achtibat, Thomas Wiegand and Wojciech Samek from Fraunhofer HHI

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Egor Zverev @ICLR 2025

@egor_zverev_ai

4 months ago

🚀 We’ve released the source code for 𝗔𝗦𝗜𝗗𝗘 (presented as an 𝗢𝗿𝗮𝗹 at the #ICLR2025 BuildTrust workshop)! 🔍ASIDE boosts prompt injection robustness without safety-tuning: we simply rotate embeddings of marked tokens by 90° during instruction-tuning and inference 👇code

thumb_up_off_alt9

chat_bubble_outline1

repeat6

shareShare