Marzena Karpinska (@mar_kar_) Twitter Tweets • TwiCopy

Abhilasha Ravichander

4 months ago

✈️ I'm in Vienna for #ACL2025NLP! Would love to meet and chat about training data, factuality, transparency, doing a PhD in AI🤖, or anything else. Please say hi if you see me!☕️🍰 I am hiring PhD students + interns (shorturl.at/fZnOq), let's chat if you are looking!

thumb_up_off_alt96

chat_bubble_outline9

repeat8

shareShare

Satya Nadella

@satyanadella

4 months ago

Today we’re introducing Copilot Mode in Edge, our first step in reinventing the browser for the AI age. My favorite feature is multi-tab RAG. You can use Copilot to analyze your open tabs, like I do here with papers our team has published in nature journals over the last year.

thumb_up_off_alt4,4K

chat_bubble_outline311

repeat587

shareShare

Hita K

@_hitakam

4 months ago

Are you a researcher in CS or a CS-adjacent field who could use help in refining your research ideas? Want to try our new AI-powered tool that helps with just that in a paid user study? Details and sign up here! forms.gle/UPFjyJ59uuZ5Zb…

thumb_up_off_alt19

chat_bubble_outline2

repeat6

shareShare

Niloofar (on faculty job market!)

@niloofar_mire

4 months ago

This is why there is years of research on usable privacy. This is not clear to lay users. We are building technology for “people” not ourselves.

thumb_up_off_alt56

chat_bubble_outline2

repeat3

shareShare

Mohit Iyyer

@mohitiyyer

4 months ago

GPT-5 lands first place on NoCha, our long-context book understanding benchmark. That said, this is a tiny improvement (~1%) over o1-preview, which was released almost one year ago. Have long-context models hit a wall? Accuracy of human readers is >97%... Long way to go!

thumb_up_off_alt41

chat_bubble_outline1

repeat10

shareShare

Marzena Karpinska

@mar_kar_

4 months ago

We've added #gpt5 to #NoCha (benchmark testing how well models can process long input) While it is doing well, it is ONLY 1% of improvement over a period of ONE YEAR. On the bright side, open-weights models, which started much lower (below random) are now getting closer.

thumb_up_off_alt11

chat_bubble_outline0

repeat1

shareShare

鴨井遼

@ryokamoi_ja

4 months ago

NLPでアメリカの大学院に興味がある人はぜひ声をかけてください！今年出願の人はもちろん、数年先という人も大歓迎です。語学留学、交換留学、修士、博士と全部やってるので、どれについての相談でも良いです。私のウェブサイトからメールを送れます→ ryokamoi.github.io

thumb_up_off_alt60

chat_bubble_outline0

repeat11

shareShare

Marzena Karpinska

@mar_kar_

4 months ago

It's all about WHICH humans and in what circumstances...

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Mosh Levy

@mosh_levy

3 months ago

Producing reasoning texts boosts the capabilities of AI models, but do we humans correctly understand these texts? Our latest research suggests that we do not. This highlights a new angle on the "Are they transparent?" debate: they might be, but we misinterpret them. 🧵

thumb_up_off_alt125

chat_bubble_outline7

repeat27

shareShare

Marzena Karpinska

@mar_kar_

3 months ago

Happy to see these papers accepted to #EMNLP2025! 🎊

thumb_up_off_alt80

chat_bubble_outline0

repeat2

shareShare

rishanth rajendhran

@rishanthrajendh

3 months ago

Accepted to Findings of #EMNLP2025!

thumb_up_off_alt17

chat_bubble_outline0

repeat3

shareShare

Najoung Kim 🫠

@najoungkim

3 months ago

Pulling this opportunity on research agent evaluation up one more time! The official title of the position will be "Senior research technician". Feel free to email either Sebastian Schuster or me directly if you have any questions. Link for more detailed info and where to apply in 🧵

thumb_up_off_alt18

chat_bubble_outline2

repeat7

shareShare

Sebastian Schuster

@sebschu

3 months ago

Work with me and Najoung Kim 🫠 on a cool project evaluating the latest agents. Joining us at BU would be preferable but if you make a strong case, we **may** also be able to hire you in Vienna. #nlproc

thumb_up_off_alt31

chat_bubble_outline0

repeat7

shareShare

EMNLP 2025

@emnlpmeeting

3 months ago

#EMNLP2025 is offering Virtual Registration Subsidies for those who would otherwise be unable to attend. Note that these are only available for participants who are NOT registering any paper. To apply, read the details here, and fill out the linked form: 2025.emnlp.org/calls/virtual_…

thumb_up_off_alt14

chat_bubble_outline1

repeat4

shareShare

Clémentine Fourrier 🍊

@clefourrier

2 months ago

Updated the evaluation guidebook with a new deep dive! 2025 panorama of all the important and next level evaluations that you need to know to build *actually impactful and useful* models! (Assistant tasks, games, forecasting, and more) Tell me wyt! :) github.com/huggingface/ev…

thumb_up_off_alt163

chat_bubble_outline3

repeat26

shareShare

Dayeon (Zoey) Ki

@zoeykii

2 months ago

1/ 🌍 Do #LLMs really treat all languages equally when citing evidence? 📑 In our new work, we uncover linguistic nepotism: models often trade off citation quality for language preference 👇

thumb_up_off_alt42

chat_bubble_outline2

repeat16

shareShare

Chantal

@chantalshaib

2 months ago

"AI slop" seems to be everywhere, but what exactly makes text feel like slop? In our new work (w/ Tuhin Chakrabarty, Diego Garcia-Olano, byron wallace) we provide a systematic attempt at measuring AI slop in text! arxiv.org/abs/2509.19163 🧵 (1/7)

"AI slop" seems to be everywhere, but what exactly makes text feel like slop?

In our new work (w/ <a href="/TuhinChakr/">Tuhin Chakrabarty</a>, <a href="/dgolano/">Diego Garcia-Olano</a>, <a href="/byron_c_wallace/">byron wallace</a>) we provide a systematic attempt at measuring AI slop in text!

arxiv.org/abs/2509.19163

🧵 (1/7)

thumb_up_off_alt221

chat_bubble_outline14

repeat36

shareShare

Tuhin Chakrabarty

@tuhinchakr

2 months ago

Low-quality AI-generated text is often referred to as #AISlop !! But how do humans quantify Slop? Are they consistent in their judgments? Is GPT5-thinking absolutely wrong in "thinking" its text is not Slop ⛳️ ? Look at Chantal's new work addressing these questions 👇

thumb_up_off_alt16

chat_bubble_outline0

repeat1

shareShare

Jessy Li

@jessyjli

2 months ago

All of us (Kyle Mahowald, Kanishka Misra 🌊 and me) are looking for PhD students this cycle! If computational linguistics/NLP is your passion, join us at UT Austin! For my areas see jessyli.com

thumb_up_off_alt179

chat_bubble_outline4

repeat36

shareShare

Abhilasha Ravichander

@lasha_nlp

2 months ago

It is PhD application season again 🍂 For those looking to do a PhD in AI, these are some useful resources 🤖: 1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]

thumb_up_off_alt369

chat_bubble_outline6

repeat74

shareShare

Marzena Karpinska

Abhilasha Ravichander

Satya Nadella

Hita K

Niloofar (on faculty job market!)

Mohit Iyyer

Marzena Karpinska

鴨井 遼

Marzena Karpinska

Mosh Levy

Marzena Karpinska

rishanth rajendhran

Najoung Kim 🫠

Sebastian Schuster

EMNLP 2025

Clémentine Fourrier 🍊

Dayeon (Zoey) Ki

Chantal

Tuhin Chakrabarty

Jessy Li

Abhilasha Ravichander

鴨井遼