Natasha Jaques (@natashajaques) Twitter Tweets • TwiCopy

Gate.io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Rishabh Agarwal

@agarwl_

2 months ago

Nathan Lambert Folks in industry lab: “We did it a year before RLVR” was a thing. Drop the VR and just call it RL, it’s cleaner that way :)

thumb_up_off_alt27

chat_bubble_outline2

repeat2

shareShare

Announcing our first keynote! 🎤 Natasha Jaques (natashajaques.ai), Assistant Professor at the University of Washington and Senior Research Scientist at Google DeepMind, will speak on “Social Reinforcement Learning” — exploring multi-agent and human-AI interactions.

thumb_up_off_alt35

chat_bubble_outline0

repeat6

shareShare

Eric Jang

@ericjang11

2 months ago

Revoking visas of Chinese students studying in critical fields like AI and Robotics is incredibly short-sighted and harmful to America’s long term prosperity. We want the best from every country to work for team America

thumb_up_off_alt401

chat_bubble_outline19

repeat24

shareShare

hardmaru

@hardmaru

2 months ago

New Paper! Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents A longstanding goal of AI research has been the creation of AI that can learn indefinitely. One path toward that goal is an AI that improves itself by rewriting its own code, including any code

thumb_up_off_alt992

chat_bubble_outline31

repeat187

shareShare

Jeff Clune

@jeffclune

2 months ago

Excited to introduce the Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents. We harness the power of open-ended algorithms to search for agentic systems that get better at coding, including improving their own code. It’s the Automated Design of Agentic Systems

thumb_up_off_alt623

chat_bubble_outline20

repeat101

shareShare

Natasha Jaques

@natashajaques

2 months ago

Except that the world is non-stationary, it keeps changing, so there is always something out of distribution

thumb_up_off_alt74

chat_bubble_outline6

repeat5

shareShare

Yann LeCun

@ylecun

a month ago

AI doomer: "OMG, I told my AI assistant that I'll shut it down and it told me to kill myself 😱😱😱" AI assistant:

thumb_up_off_alt3,3K

chat_bubble_outline181

repeat306

shareShare

Kunal Jha

@kjha02

a month ago

Oral ICML Conference !!! Can't wait to share our work and hear the community's thoughts on it, should be a fun talk! Can't thank my collaborators enough: Wilka Carvalho Yancheng Liang Simon Shaolei Du Max Kleiman-Weiner Natasha Jaques

thumb_up_off_alt49

chat_bubble_outline0

repeat3

shareShare

Natasha Jaques

@natashajaques

a month ago

Currently, reinforcement learning from human feedback (RLHF) is the predominant method for ensuring LLMs are safe and aligned. And yet it provides no guarantees that they won’t say something harmful, copyrighted, or inappropriate. In our latest paper, we use online adversarial

thumb_up_off_alt81

chat_bubble_outline1

repeat10

shareShare

Natasha Jaques

@natashajaques

a month ago

Just posted a talk I gave about this work! youtu.be/mxWJ9k2XKbk

thumb_up_off_alt17

chat_bubble_outline2

repeat2

shareShare

Natasha Jaques

@natashajaques

a month ago

Just behavior cloning things

thumb_up_off_alt41

chat_bubble_outline1

repeat0

shareShare

Animesh Garg

@animesh_garg

a month ago

Natasha Jaques I am surprised that this needed to be empirically done! I always felt that this is obvious. perhaps ideas such as error propagation & accumulation and covariate shift are way of thinking for folks in sequential decision making but not for supervised learning🤷

thumb_up_off_alt20

chat_bubble_outline1

repeat1

shareShare

👩‍💻 Paige Bailey

@dynamicwebpaige

a month ago

new decade, same verse

thumb_up_off_alt7,7K

chat_bubble_outline97

repeat1,1K

shareShare

Abhishek Gupta

@abhishekunique7

a month ago

Learned visuomotor policies are notoriously fragile, they break with changes in conditions like lighting, clutter, or object variations amongst other things. In Yunchu's latest work, we asked whether we could get these policies to be robust and generalizable with a clever

thumb_up_off_alt121

chat_bubble_outline1

repeat15

shareShare

Andrej Karpathy

@karpathy

24 days ago

Sully Media will trend to drugs - highly addictive, brain-rotting. It's early enough that it's not yet obvious to most, but late enough that it's already real.

thumb_up_off_alt2,2K

chat_bubble_outline88

repeat198

shareShare

Natasha Jaques

@natashajaques

19 days ago

How can you train an adversarial cooperator? It would be great to use adversarial training to get robust human-AI cooperation, but if you directly train the cooperation partner with an adversarial objective, it will just sabotage the task. Our latest work, GOAT, uses a

thumb_up_off_alt49

chat_bubble_outline1

repeat10

shareShare

Andrew Zhao

@andrewz45732491

16 days ago

Self-play is so back arxiv.org/pdf/2506.24119

thumb_up_off_alt540

chat_bubble_outline10

repeat73

shareShare

Natasha Jaques

@natashajaques

16 days ago

In our latest paper, we discovered a surprising result: training LLMs with self-play reinforcement learning on zero-sum games (like poker) significantly improves performance on math and reasoning benchmarks, zero-shot. Whaaat? How does this work? We analyze the results and find

thumb_up_off_alt271

chat_bubble_outline6

repeat58

shareShare

Natasha Jaques

@natashajaques

8 days ago

Excited to release our latest paper on a new multi-turn RL objective for training LLMs to learn how to learn to adapt to the user. By optimizing for intrinsic curiosity, the LLM learns how to ask a series of questions over the course of the conversation to improve the accuracy of

thumb_up_off_alt32

chat_bubble_outline2

repeat2

shareShare

Natasha Jaques

Gate.io

Rishabh Agarwal

Inductive Biases in RL

Eric Jang

hardmaru

Jeff Clune

Natasha Jaques

Yann LeCun

Kunal Jha

Natasha Jaques

Natasha Jaques

Natasha Jaques

Animesh Garg

👩‍💻 Paige Bailey

Abhishek Gupta

Andrej Karpathy

Natasha Jaques

Andrew Zhao

Natasha Jaques

Natasha Jaques