Dan Goldstein (@dggoldst) 's Twitter Profile
Dan Goldstein

@dggoldst

Senior Principal Research Manager at Microsoft Research NYC. Economics and Computation Group. Distinguished Scholar at Wharton.

ID: 9541832

linkhttp://www.dangoldstein.com calendar_today19-10-2007 12:59:13

9,9K Tweet

10,10K Followers

1,1K Following

Econometrica (@ecmaeditors) 's Twitter Profile Photo

Why fund highly novel research? It can improve the evolution of knowledge by guiding future researchers. We propose a model in which researchers decide which questions to address and at what intensity to search for the answer based on existing knowledge. econometricsociety.org/publications/e…

Why fund highly novel research? It can improve the evolution of knowledge by guiding future researchers. We propose a model in which researchers decide which questions to address and at what intensity to search for the answer based on existing knowledge. econometricsociety.org/publications/e…
matt hardy (@mdahardy) 's Twitter Profile Photo

New test: Operator vs Claude taking a B2B CEO survey. Both give sensible answers, but Operator dumps text all at once and Claude streams in chunks. Typing like a human is hard!

Stefanie Stantcheva s-stantcheva.bsky.social (@s_stantcheva) 's Twitter Profile Photo

🚨 New data alert! Curious about how people really understand inflation—its causes, impacts & what governments should do about it? 📊 You can dive into the data from our project "People's understanding of inflation" here: socialeconomicslab.org/research/publi… Excited to see new analyses!

Andrew Ng (@andrewyng) 's Twitter Profile Photo

Today's "DeepSeek selloff" in the stock market -- attributed to DeepSeek V3/R1 disrupting the tech ecosystem -- is another sign that the application layer is a great place to be. The foundation model layer being hyper-competitive is great for people building applications.

Dan Goldstein (@dggoldst) 's Twitter Profile Photo

It gets harder and harder to identify experts through tests because of cheating with AI. Forecasting performance will become the primary metric on which experts compete. Those who consistently outforecast AI are either psychic or possess valuable knowledge and judgment.

Dan Goldstein (@dggoldst) 's Twitter Profile Photo

While there are big crowds on the "AI hurts learning" train, let's realize that's like saying "computers hurt learning" Two papers where unaided learners do better after AI interaction: arxiv.org/abs/2502.02880 ssrn.com/abstract=46416…

jake hofman (@jakehofman) 's Twitter Profile Photo

Attention NYC undergrads: Applications are open for our 12th annual Data Science Summer school at Microsoft Research NYC! Apply here by April 15th: bit.ly/3pCQENh

Brian Guay (@brianmguay) 's Twitter Profile Photo

🚨Out today in PNAS PNASNews🚨 pnas.org/doi/10.1073/pn… Why do people overestimate the size of politically relevant groups (immigrant, LGBTQ, Jewish) and quantities (% of budget spent on foreign aid, % of refugees that are criminals)? We analyze 100k estimates to find out🧵👇

🚨Out today in PNAS <a href="/PNASNews/">PNASNews</a>🚨

pnas.org/doi/10.1073/pn…

Why do people overestimate the size of politically relevant groups (immigrant, LGBTQ, Jewish) and quantities (% of budget spent on foreign aid, % of refugees that are criminals)?

We analyze 100k estimates to find out🧵👇
Serina Chang (@serinachang5) 's Twitter Profile Photo

What happens when a static benchmark comes to life? ✨Introducing ChatBench, a large-scale user study where we *converted* MMLU questions into thousands of user-AI conversations. Then, we trained a user simulator on ChatBench to generate user-AI outcomes on unseen questions. 1/

What happens when a static benchmark comes to life? ✨Introducing ChatBench, a large-scale user study where we *converted* MMLU questions into thousands of user-AI conversations. Then, we trained a user simulator on ChatBench to generate user-AI outcomes on unseen questions. 1/
Dan Goldstein (@dggoldst) 's Twitter Profile Photo

🚨 Predoc at Microsoft Research 🚨 Do you have strong front-end programming and statistics skills but need research experience before heading off to graduate school for a computational social science PhD? Apply now for this summer! microsoft.com/en-us/research…

Dan Goldstein (@dggoldst) 's Twitter Profile Photo

A nice discussion of why it is so hard to outpredict models. References one of my favorite papers: "Improving out-of-population prediction. The complementary effects of model assistance and judgmental bootstrapping" dangoldstein.com/papers/Hardy_e…

Dan Goldstein (@dggoldst) 's Twitter Profile Photo

The Society for Judgment and Decision Making is pleased to announce that the latest newsletter is ready for download: sjdm.org/newsletters/ This issue contains announcements, conferences, jobs, and a new section entitled HotFresh Research News! decisionsciencenews.com/?p=7220