Zico Kolter (@zicokolter) Twitter Tweets • TwiCopy

Zico Kolter

@zicokolter

+ Follow

Professor and Head of Machine Learning Department at @CarnegieMellon. Board member @OpenAI. Chief Technical Advisor @GraySwanAI. Chief Expert @BoschGlobal.

ID: 841499391508779008

linkhttp://zicokolter.com calendar_today14-03-2017 04:01:04

617 Tweet

21,21K Followers

645 Following

Fahim Tajwar

@fahimtajwar10

8 months ago

Interacting with the external world and reacting based on outcomes are crucial capabilities of agentic systems, but existing LLMs’ ability to do so is limited. Introducing Paprika 🌶️, our work on making LLMs general decision makers than can solve new tasks zero-shot. 🧵 1/n

thumb_up_off_alt451

chat_bubble_outline5

repeat92

shareShare

Pratyush Maini

@pratyushmaini

7 months ago

1/Being in academia is such a privilege: You get to collaborate with insanely talented & passionate students on their journey to upskill themselves. Very excited to share *OpenUnlearning*: a unified, easily extensible framework for unlearning led by Anmol Mekala Vineeth🧵

thumb_up_off_alt141

chat_bubble_outline4

repeat26

shareShare

Christina Baek

@_christinabaek

6 months ago

Are current reasoning models optimal for test-time scaling? 🌠 No! Models make the same incorrect guess over and over again. We show that you can fix this problem w/o any crazy tricks 💫 – just do weight ensembling (WiSE-FT) for big gains on math! 1/N

thumb_up_off_alt478

chat_bubble_outline6

repeat103

shareShare

CMU School of Computer Science

@scsatcmu

6 months ago

Huge thank you to NVIDIA Data Center for gifting a brand new #NVIDIADGX B200 to CMU’s Catalyst Research Group! This AI supercomputing system will afford Catalyst the ability to run and test their work on a world-class unified AI platform.

thumb_up_off_alt143

chat_bubble_outline3

repeat29

shareShare

Zico Kolter

@zicokolter

6 months ago

Thanks NVIDIA Data Center for the DGX B200 machine for the CMU Catalyst group! I'm perhaps already a bit too enthralled by it in the photos...

thumb_up_off_alt103

chat_bubble_outline3

repeat14

shareShare

Kyunghyun Cho

@kchonyc

6 months ago

spicy Zico Kolter

spicy <a href="/zicokolter/">Zico Kolter</a>

thumb_up_off_alt242

chat_bubble_outline7

repeat9

shareShare

Christina Baek

@_christinabaek

6 months ago

When we train models to do QA, are we robustly improving context dependency? No! In our ICLR Oral (Fri 11 AM), we show that if the base model knows the facts already, it shortcuts and learns to ignore the context completely! Visit us to learn more about knowledge conflicts 😀

thumb_up_off_alt102

chat_bubble_outline3

repeat16

shareShare

Yutong (Kelly) He

@electronickale

6 months ago

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

thumb_up_off_alt83

chat_bubble_outline2

repeat31

shareShare

Pratyush Maini

@pratyushmaini

6 months ago

Looking forward to giving a talk this Friday OpenAI with Zhili Feng on some of our privacy & memorization research + how it applies to production LLMs! We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

Looking forward to giving a talk this Friday <a href="/OpenAI/">OpenAI</a> with <a href="/zhilifeng/">Zhili Feng</a> on some of our privacy & memorization research + how it applies to production LLMs!

We've been gaining momentum on detecting, quantifying & erasing memorization; excited to explore its real-world impact!

thumb_up_off_alt101

chat_bubble_outline0

repeat10

shareShare

Runtian Zhai

@runtianzhai

6 months ago

A shorter version of the first three chapters of my thesis is accepted by ICML 2025. It provides a quick start for those interested in learning about the contexture theory. Check it out: arxiv.org/abs/2505.01557

thumb_up_off_alt36

chat_bubble_outline1

repeat2

shareShare

Pratyush Maini

@pratyushmaini

6 months ago

Excited to be talking today about how research into memorization provides a fundamentally different lens on safety!

thumb_up_off_alt99

chat_bubble_outline3

repeat9

shareShare

Zhengyang Geng

@zhengyanggeng

5 months ago

Excited to share our work with my amazing collaborators, Goodeat, Xingjian Bai, Zico Kolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

Excited to share our work with my amazing collaborators, <a href="/Goodeat258/">Goodeat</a>, <a href="/SimulatedAnneal/">Xingjian Bai</a>, <a href="/zicokolter/">Zico Kolter</a>, and Kaiming.

In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,

thumb_up_off_alt111

chat_bubble_outline4

repeat28

shareShare

YixuanEvenXu

@yixuanevenxu

5 months ago

✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7)

$✨ Did you know that NOT using all generated rollouts in GRPO can boost your reasoning LLM? Meet PODS! We down-sample rollouts and train on just a fraction, delivering notable gains over vanilla GRPO. (1/7)$

thumb_up_off_alt135

chat_bubble_outline4

repeat16

shareShare

Vaishnavh Nagarajan

@_vaishnavh

5 months ago

Wrote my first blog post! I wanted to share a powerful yet under-recognized way to develop emotional maturity as a researcher: making it a habit to read about the ✨past ✨ and learn from it to make sense of the present

thumb_up_off_alt96

chat_bubble_outline1

repeat13

shareShare

Maksym Andriushchenko @ ICLR

@maksym_andr

4 months ago

🚨Excited to release OS-Harm! 🚨 The safety of computer use agents has been largely overlooked. We created a new safety benchmark based on OSWorld for measuring 3 broad categories of harm: 1. deliberate user misuse, 2. prompt injections, 3. model misbehavior.

thumb_up_off_alt94

chat_bubble_outline3

repeat26

shareShare

Zhengyang Geng

@zhengyanggeng

4 months ago

now the code is up here: github.com/Gsunshine/mean…

thumb_up_off_alt70

chat_bubble_outline2

repeat17

shareShare

Yiding Jiang

@yidingjiang

4 months ago

A mental model I find useful: all data acquisition (web scrapes, synthetic data, RL rollouts, etc.) is really an exploration problem 🔍. This perspective has some interesting implications for where AI is heading. Wrote down some thoughts: yidingjiang.github.io/blog/post/expl…

thumb_up_off_alt412

chat_bubble_outline5

repeat56

shareShare