Lilian Weng (@lilianweng) Twitter Tweets • TwiCopy

Lilian Weng

@lilianweng

+ Follow

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

ID: 96999384

linkhttps://lilianweng.github.io calendar_today15-12-2009 15:17:40

187 Tweet

140,140K Followers

158 Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Rule-based rewards (RBRs) use model to provide RL signals based on a set of safety rubrics, making it easier to adapt to changing safety policies wo/ heavy dependency on human data. It also enables us to look at safety and capability in a more unified lens as a more capable

thumb_up_off_alt311

chat_bubble_outline13

repeat44

shareShare

Lilian Weng

@lilianweng

a year ago

Join us if you are interested for a chat at Defcon & the AI Security Forum! 🙌

thumb_up_off_alt81

chat_bubble_outline2

repeat5

shareShare

Lilian Weng

@lilianweng

a year ago

Iterative deployment for maximizing AI safety learning needs to be built on top of rigorous science and process. We are learning and improving through each launch.

thumb_up_off_alt133

chat_bubble_outline12

repeat10

shareShare

Mira Murati

@miramurati

10 months ago

All Plus and Team users in ChatGPT

thumb_up_off_alt3,3K

chat_bubble_outline213

repeat229

shareShare

Lilian Weng

@lilianweng

10 months ago

🩵🩵🩵

thumb_up_off_alt336

chat_bubble_outline5

repeat9

shareShare

Lilian Weng

@lilianweng

9 months ago

📢 We are hiring Research Scientists and Engineers for safety research at OpenAI, ranging from safe model behavior training, adversarial robustness, AI in healthcare, frontier risk evaluation and more. Please fill in this form if you are interested: jobs.ashbyhq.com/openai/form/oa…

thumb_up_off_alt768

chat_bubble_outline20

repeat73

shareShare

Lilian Weng

@lilianweng

9 months ago

After working at OpenAI for almost 7 years, I decide to leave. I learned so much and now I'm ready for a reset and something new. Here is the note I just shared with the team. 🩵

thumb_up_off_alt6,6K

chat_bubble_outline269

repeat345

shareShare

Lilian Weng

@lilianweng

8 months ago

🦃 At the end of Thanksgiving holidays, I finally finished the piece on reward hacking. Not an easy one to write, phew. Reward hacking occurs when an RL agent exploits flaws in the reward function or env to maximize rewards without learning the intended behavior. This is imo a

thumb_up_off_alt1,1K

chat_bubble_outline67

repeat228

shareShare

Mira Murati

@miramurati

5 months ago

I started Thinking Machines Lab alongside a remarkable team of scientists, engineers, and builders. We're building three things: - Helping people adapt AI systems to work for their specific needs - Developing strong foundations to build more capable AI systems - Fostering a

thumb_up_off_alt9,9K

chat_bubble_outline635

repeat900

shareShare

Lilian Weng

@lilianweng

4 months ago

👩‍🍳Actively cooking the next blog post. Tiny teaser: It is spiritually related to our new company.

thumb_up_off_alt600

chat_bubble_outline16

repeat11

shareShare

Lilian Weng

@lilianweng

3 months ago

See you at #ICLR2025 soon. Excited about chatting with many of you about Thinking Machines and what we have been up to!

thumb_up_off_alt230

chat_bubble_outline7

repeat8

shareShare

Lilian Weng

@lilianweng

3 months ago

Nope what’s that?

thumb_up_off_alt230

chat_bubble_outline15

repeat5

shareShare

Lilian Weng

@lilianweng

2 months ago

When a new dataset comes out, I get excited and check it out and then only realize that this is another meta-mixed dataset combining a collections of other existing datasets. My brain immediately acts like "oh fork ... contamination!" No meta-meta-mixed dataset plzzzz :lolsob:

thumb_up_off_alt585

chat_bubble_outline27

repeat28

shareShare

Lilian Weng

@lilianweng

2 months ago

Probably the first product Thinky will build is a full panel of dials that researchers can use to physically adjust all the hparams during training. We gonna do hardware one day and it is the time 😂

thumb_up_off_alt433

chat_bubble_outline21

repeat11

shareShare