Simone Balloccu (@simoneballoccu) Twitter Tweets • TwiCopy

Simone Balloccu

@simoneballoccu

+ Follow

(he/him)
Leading the ExpNLP lab @TUDarmstadt. Researching AI w.r.t human evaluation, behaviour change, safety and controllability, expert domains.

ID: 1285850193615884289

linkhttps://uccollab.github.io/ calendar_today22-07-2020 08:13:15

707 Tweet

303 Followers

223 Following

INLG 2025

@inlgmeeting

8 months ago

First #CallForPapers for #INLG2025! Submit work on any aspect of #NaturalLanguagGeneration, incl. but not limited to: rule-based data-to-text systems, summarisation and simplification with the latest LLMs, or new evaluation methods :) Deadline: 14 July 2025.inlgmeeting.org

thumb_up_off_alt12

chat_bubble_outline0

repeat5

shareShare

Mike Burnham

@ml_burn

8 months ago

Me after defending my dissertation:

thumb_up_off_alt14,14K

chat_bubble_outline24

repeat1,1K

shareShare

Edu

@educa_nlp

8 months ago

Excited to fly to Albuquerque to present my latest piece at NAACL! If you want to learn how to design human-centered #NLProc evaluation UIs, visit my poster on May 1, Hall 3, 14:00-15:30. We can also chat about NLG, hallucinations, lexical change, and anything in between! :)

Excited to fly to Albuquerque to present my latest piece at <a href="/naacl/">NAACL</a>!

If you want to learn how to design human-centered #NLProc evaluation UIs, visit my poster on May 1, Hall 3, 14:00-15:30.

We can also chat about NLG, hallucinations, lexical change, and anything in between! :)

thumb_up_off_alt7

chat_bubble_outline2

repeat1

shareShare

(((ل()(ل() 'yoav))))👾

@yoavgo

8 months ago

"LLM on way to replace doctors" gets published in Nature. meanwhile "LLM judgement not as good as human MDs" gets a spot in "Physical Therapy and Rehabilitation Journal".

thumb_up_off_alt680

chat_bubble_outline24

repeat54

shareShare

Ehud Reiter

@ehudreiter

7 months ago

New blog: Key messages from my NLG book Its been 6 months since my NLG book was released. I summarise what I think are its key messages, for rule-based NLG, ML and neural NLG, requirements, evaluation, safety/testing/maintainability, and applications. ehudreiter.com/2025/05/14/key…

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

vas

@vasumanmoza

7 months ago

Claude 4 just refactored my entire codebase in one call. 25 tool invocations. 3,000+ new lines. 12 brand new files. It modularized everything. Broke up monoliths. Cleaned up spaghetti. None of it worked. But boy was it beautiful.

thumb_up_off_alt39,39K

chat_bubble_outline1,1K

repeat2,2K

shareShare

Amanda Hatwell

@awrites116

7 months ago

Best writing advice

thumb_up_off_alt1,1K

chat_bubble_outline44

repeat374

shareShare

Simone Balloccu

@simoneballoccu

6 months ago

Opinion: Authors should be allowed to add hidden prompts for LLMs in papers. If you lazily paste the paper you're supposed to judge on ChatGPT etc., you don't belong in peer reviewing.

thumb_up_off_alt4

chat_bubble_outline1

repeat0

shareShare

INLG 2025

@inlgmeeting

6 months ago

The Second #CallForPapers just went out and announces two of our keynote speakers: Verena Rieser (Google DeepMind) & Minlie Huang (Tsinghua University)! Submit your work on NLG, whether LLM or rule-based :D Deadline: 14 July 2025.inlgmeeting.org (first posted elsewhere)

thumb_up_off_alt2

chat_bubble_outline0

repeat2

shareShare

Simone Balloccu

@simoneballoccu

6 months ago

We just received some reviews for EMNLP and I'm filled with an immense amount of rage.

thumb_up_off_alt12

chat_bubble_outline2

repeat0

shareShare

Neil Renic

@nc_renic

6 months ago

Reviewing Getting reviewed

thumb_up_off_alt3,3K

chat_bubble_outline7

repeat410

shareShare

Mathieu

@miniapeur

6 months ago

thumb_up_off_alt1,1K

chat_bubble_outline23

repeat159

shareShare

Simone Balloccu

@simoneballoccu

6 months ago

Remember my tweet from the other day? Well, this is not what I meant.

thumb_up_off_alt3

chat_bubble_outline1

repeat0

shareShare

Dr. Dominic Ng

@drdominicng

6 months ago

Microsoft claims their new AI framework diagnoses 4x better than doctors. I'm a medical doctor and I actually read the paper. Here's my perspective on why this is both impressive AND misleading ... 🧵

thumb_up_off_alt8,8K

chat_bubble_outline273

repeat1,1K

shareShare

Mickey Friedman

@mickeyxfriedman

6 months ago

as a parent, i will never push a career path onto my kids. i would give them full freedom to decide which AI lab they want to join for $100 mil

thumb_up_off_alt11,11K

chat_bubble_outline74

repeat633

shareShare

Marco Guerini

@m_guerini

6 months ago

I love this analysis of the limitations of the experimental setting/design. This is the kind of expert insight and methodological rigor we need when evaluating LLMs!

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Jia-Bin Huang

@jbhuang0604

6 months ago

Writing a rebuttal is 30% technical and 70% reviewers' psychology.

thumb_up_off_alt300

chat_bubble_outline13

repeat11

shareShare

Leon Derczynski ✍🏻 🍂🍏

@leonderczynski

5 months ago

did people get greedy and sloppy and ruin it like with almost everything ever? you tell me

thumb_up_off_alt8

chat_bubble_outline2

repeat3

shareShare

Vilém Zouhar

@zouharvi

5 months ago

You have a budget to human-evaluate 100 inputs to your models, but your dataset is 10,000 inputs. Do not just pick 100 randomly!🙅 We can do better. "How to Select Datapoints for Efficient Human Evaluation of NLG Models?" shows how.🕵️ (random is still a devilishly good baseline)

thumb_up_off_alt72

chat_bubble_outline2

repeat14

shareShare

Ehud Reiter

@ehudreiter

5 months ago

Motivated by recent discussion with my group: Ignore subjective statements such as "I find LLMs to be incredibly useful for XX", especially when made by people (such as AI companies or gurus) who have strong biases/incentives/COI .

thumb_up_off_alt3

chat_bubble_outline0

repeat1

shareShare