Stefan Grafberger (@sgrafberger) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

We just open-sourced our prototype StreamDQ, a library built on top of Apache Flink for defining "unit tests for data", which measure data quality in large data streams. github.com/stefan-grafber… Joint work with Sebastian and Paul Groth.

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Ce Zhang

@ce_zhang

2 years ago

Congrats Sebastian Stefan Grafberger! Data quality for ML is becoming increasingly important, excited to see ArgusEyes brings PTIME Data Shapely into practice -- to help improve the data to improve model! - system: ssc.io/publication/pr… - PTME Shapley: arxiv.org/abs/2204.11131

thumb_up_off_alt24

chat_bubble_outline0

repeat4

shareShare

Maximilian Kuschewski

@maxikuschewski

2 years ago

Excited to present BtrBlocks, our new columnar compression format for data lakes at the SIGMOD/PODS 2025 compression and fairness session at ~5pm PDT github.com/maxi-k/btrbloc…

Excited to present BtrBlocks, our new columnar compression format for data lakes at the <a href="/SIGMODConf/">SIGMOD/PODS 2025</a> compression and fairness session at ~5pm PDT github.com/maxi-k/btrbloc…

thumb_up_off_alt6

chat_bubble_outline2

repeat2

shareShare

Stefan Grafberger

@sgrafberger

2 years ago

Today I started my research internship in Redmond with the Microsoft Microsoft Gray Systems Lab. Looking forward to an amazing summer!

Today I started my research internship in Redmond with the Microsoft <a href="/GraySystemsLab/">Microsoft Gray Systems Lab</a>. Looking forward to an amazing summer!

thumb_up_off_alt63

chat_bubble_outline2

repeat0

shareShare

TruLens

@trulensml

2 years ago

Awesome paper by xiaozhong lyu Stefan Grafberger @ce__zhang Sebastian shows how #RAG can be improved through data importance learning. The approach learns weights for data sources based on their performance on a validation set and then re-weights or prunes the corpus. 1/3

thumb_up_off_alt5

chat_bubble_outline1

repeat4

shareShare

Stefan Grafberger

@sgrafberger

a year ago

Our paper "Towards Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" has been accepted for the DEEM Workshop @ SIGMOD at SIGMOD! 🎉 In this vision paper, we present our initial ideas for my next research project. Joint work with Sebastian and Paul Groth.

Our paper "Towards Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" has been accepted for the <a href="/deem_workshop/">DEEM Workshop @ SIGMOD</a> at SIGMOD! 🎉

In this vision paper, we present our initial ideas for my next research project.

Joint work with <a href="/sscdotopen/">Sebastian</a> and <a href="/pgroth/">Paul Groth</a>.

thumb_up_off_alt54

chat_bubble_outline5

repeat6

shareShare

DEEM Workshop @ SIGMOD

@deem_workshop

a year ago

We can't wait for DEEM Workshop @ SIGMOD 2024 to get started @sigmodconf! Join us tomorrow Sunday 9 June from 9am in the Tupungato room!! Check out the full program at: deem-workshop.github.io 🇨🇱

thumb_up_off_alt10

chat_bubble_outline1

repeat5

shareShare

Stefan Grafberger

@sgrafberger

a year ago

Looking forward to presenting our vision "Towards Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" at the DEEM Workshop @ SIGMOD today! The talk will be around 10:40 a.m. in the Tupungato room. stefan-grafberger.com/shadow-pipelin…

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Stefan Grafberger

@sgrafberger

a year ago

Life update: After three amazing years in Amsterdam, I moved to Berlin to finish my PhD with Sebastian at BIFOLD. Very excited to join the data management community in Berlin!

thumb_up_off_alt43

chat_bubble_outline5

repeat0

shareShare

Olga Ovcharenko

@o_ovcharenko

a year ago

📢 Excited to share Feature Clock, an open-source library and paper accepted at IEEE VIS! Feature Clock enhances the explainability and compactness of visualizations of high-dimensional effects in two-dimensional plots. Big thanks to my co-authors Valentina Boeva and Rita Sevastjanova!

📢 Excited to share Feature Clock, an open-source library and paper accepted at <a href="/ieeevis/">IEEE VIS</a>! Feature Clock enhances the explainability and compactness of visualizations of high-dimensional effects in two-dimensional plots.

Big thanks to my co-authors <a href="/val_boeva/">Valentina Boeva</a> and <a href="/RSevastjanova/">Rita Sevastjanova</a>!

thumb_up_off_alt15

chat_bubble_outline1

repeat7

shareShare

BIFOLD

@bifoldberlin

10 months ago

"Snapcase" allows users to regain control over their recommendations in online shopping platforms. #VLDB24: a.o. BIFOLD researchers introduced "Snapcase," a demo paper that addresses the concept of machine unlearning. bifold.berlin/news-events/ne… Sebastian Stefan Grafberger Maarten de Rijke

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Sebastian

@sscdotopen

7 months ago

Interested in a *PhD in Data Engineering* in Berlin? Our institute has several openings for PhD positions as part of its graduate school, see the post below! Here is how to work with the DEEM Lab as part of the graduate school deem.berlin/#jobs-189196

thumb_up_off_alt13

chat_bubble_outline0

repeat4

shareShare

DEEM Workshop @ SIGMOD

@deem_workshop

6 months ago

The Data Management for End-to-End Machine Learning workshop (DEEM Workshop @ SIGMOD) will be back at #SIGMOD2025! ✨ 🔗 Check out the CfP: deem-workshop.github.io 📝 Submission deadline: March 21 📢 Notifications: April 25 Join us for the 9th edition in Berlin! #DEEM2025

thumb_up_off_alt7

chat_bubble_outline1

repeat6

shareShare

Stefan Grafberger

@sgrafberger

5 months ago

Our vision paper "Towards Regaining Control over Messy ML Pipelines" was accepted for the DAIS@ICDE2025 at IEEE ICDE Conference! Initial experiments show LLMs are promising for extracting declarative query plans from messy ML code. Joint work w/ Hao Chen, Olga Ovcharenko, Sebastian

Our vision paper "Towards Regaining Control over Messy ML Pipelines" was accepted for the <a href="/DAIS_workshop/">DAIS@ICDE2025</a> at <a href="/icdeconf/">IEEE ICDE Conference</a>!

Initial experiments show LLMs are promising for extracting declarative query plans from messy ML code.

Joint work w/ <a href="/guangchen811/">Hao Chen</a>, <a href="/o_ovcharenko/">Olga Ovcharenko</a>, <a href="/sscdotopen/">Sebastian</a>

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare

Stefan Grafberger

@sgrafberger

5 months ago

Thanks a lot for the invitation! I really enjoyed the workshop!

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

DEEM Workshop @ SIGMOD

@deem_workshop

4 months ago

📢 Deadline extension for DEEM 2025 SIGMOD/PODS 2025! Following requests, we're extending the submission deadline to April 1, 5pm Pacific Time. More info at: deem-workshop.github.io

thumb_up_off_alt4

chat_bubble_outline0

repeat3

shareShare

Stefan Grafberger

@sgrafberger

2 months ago

Our demo "mlidea: Interactively Improving ML Data Preparation Code via 'Shadow Pipelines'" was accepted at VLDB! 🥳 We demo suggestions for ML pipelines, similar to IntelliJ code inspections or Grammarly suggestions youtu.be/ePGm1J6S2qk Joint work w/ Sebastian Paul Groth

thumb_up_off_alt16

chat_bubble_outline2

repeat1

shareShare