Ben Lipkin (@ben_lipkin) Twitter Tweets • TwiCopy

Ben Lipkin

@ben_lipkin

+ Follow

phd @mit. cogsci, probml, nlp. he/him.

ID: 565036478

linkhttp://benlipkin.github.io calendar_today28-04-2012 00:50:33

177 Tweet

628 Followers

1,1K Following

Gate.io

@gate_io

5 hours ago

🔥The 9th Round of Easy Loan, Earn $40 Reward is in progress❗️ ⏰ Promotion Period: January 15th - Feburary 15th, 2025 👉 Register now and check more details at gate.io/campaigns/358

thumb_up_off_alt34

chat_bubble_outline39

repeat6

shareShare

Current KL estimation practices in RLHF can generate high variance and even negative values! We propose a provably better estimator that only takes a few lines of code to implement.🧵👇 w/ Tim Vieira and Ryan Cotterell code: arxiv.org/pdf/2504.10637 paper: github.com/rycolab/kl-rb

thumb_up_off_alt113

chat_bubble_outline4

repeat28

shareShare

Ben Lipkin

@ben_lipkin

2 months ago

Life news: I moved to SF for the next few months. Excited to connect with old friends and meet new ones. Get in touch if you're around these days :)

thumb_up_off_alt16

chat_bubble_outline0

repeat0

shareShare

Ahmad Beirami @ ICLR 2025

@abeirami

2 months ago

As we go through a lot of excitement about RL recently with lots of cool work/results, here is a reminder that RL with a reverse KL-regularizer to the base model cannot learn new skills that were not already present in the base model. It can only amplify the existing weak skills.

thumb_up_off_alt475

chat_bubble_outline12

repeat52

shareShare

Morph

@morph_labs

2 months ago

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at Morph. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.

We are excited to announce Trinity, an autoformalization system for verified superintelligence that we have developed at <a href="/morph_labs/">Morph</a>. We have used it to automatically formalize in Lean a classical result of de Bruijn that the abc conjecture is true almost always.

thumb_up_off_alt375

chat_bubble_outline10

repeat50

shareShare

Kevin Ellis

@ellisk_kellis

2 months ago

New paper: World models + Program synthesis by Wasu Top Piriyakulkij 1. World modeling on-the-fly by synthesizing programs w/ 4000+ lines of code 2. Learns new environments from minutes of experience 3. Positive score on Montezuma's Revenge 4. Compositional generalization to new environments

thumb_up_off_alt556

chat_bubble_outline14

repeat100

shareShare

Jacob Andreas

@jacobandreas

2 months ago

👉 New preprint on a new family of Transformer-type models whose depth scales logarithmically with sequence length. Enables: - fast training - fast decoding - large memory capacity in associative recall - strong length generalization on state tracking

thumb_up_off_alt77

chat_bubble_outline1

repeat9

shareShare

Ben Lipkin

Gate.io

Afra Amini

Ben Lipkin

Ahmad Beirami @ ICLR 2025

Morph

Kevin Ellis

Jacob Andreas